You can easily extract sentences froma epub/pdf/office/text file thanks to the opensource projects [Apache Tika](http://Apache Tika) and Apache OpenNlp with a command like```
java -jar tika-app-1.14.jar -t ebook.epub | opennlp SentenceDetector it-sent.bi