TF-IDF

Term Frequency-Inverse Document Frequency (TF-IDF) is a statistical measure used in information retrieval and text mining to evaluate how important a word is to a document in a collection or corpus. It combines term frequency (TF), which measures how frequently a term appears in a document, with inverse document frequency (IDF), which measures how important the term is across all documents.