TFIDF In Libraries: Part I of III (For Librarians)
This is the first of a three-part series called TFIDF In Libraries, where “relevancy ranking” will be introduced. In this part, term frequency/inverse document frequency (TFIDF) — a common...
View ArticleTFIDF In Libraries: Part II of III (For programmers)
This is the second of a three-part series called TFIDF In Libraries, where relevancy ranking techniques are explored through a set of simple Perl programs. In Part I relevancy ranking was introduced...
View ArticleAutomatic metadata generation
I have been having a great deal of success extracting keywords and two-word phrases from documents and assigning them as “subject headings” to electronic texts — automatic metadata generation. In many...
View Article
More Pages to Explore .....