Internet major Google has secured a patent in India for phrase identification in an information retrieval system.
The system is adapted to identify phrases that have sufficient frequency and distinct use in a document collection. Good phrases are predictive of other phrases, and are not merely sequences of words that appear in the lexicon.
"For instance, 'President of United States' predicts other phrases such as 'George Bush' and 'Bill Clinton'. However, other phrases are not predictive, such as 'fell down the stairs', or 'top of the morning', or 'out of the blue'," Google said.
Phrase identification avoids
The system is adapted to identify phrases that have sufficient frequency and distinct use in a document collection. Good phrases are predictive of other phrases, and are not merely sequences of words that appear in the lexicon.
"For instance, 'President of United States' predicts other phrases such as 'George Bush' and 'Bill Clinton'. However, other phrases are not predictive, such as 'fell down the stairs', or 'top of the morning', or 'out of the blue'," Google said.
Phrase identification avoids