It’s difficult enough for a computer to understand a simple sentence. So how do you train it to recognize even the most representative keywords of a document that might run 10 or 20 pages in length?