Word Frequency Blog - Word Frequency Blog

Understanding Corpus Tools: An Introduction

On

October 4, 2023

A trip through the linguistic isn’t complete without stumbling upon the term “corpus.” As we delve deeper into language studies […]
Continue reading
Apache OpenNLP – Tokenization

On

September 8, 2022

Tokenization is a process of segmenting strings into smaller parts called tokens(say sub-strings). Usually, these tokens are words, numbers, or […]
Continue reading
NLP – Natural language processing

On

April 27, 2022

From voice-activated assistants like Siri and Alexa to chatbots on customer service websites, there’s a hidden technology working behind the […]
Continue reading
English Lemmatization: Simplifying Words in NLP

On

April 25, 2022

Language, in all its complexity, offers multiple ways to express similar concepts. We have “running”, “ran”, and “runner” — all […]
Continue reading
Understanding the Text Corpus

On

March 15, 2022

In the realm of linguistics and natural language processing, you might have come across the term “text corpus.” For many […]
Continue reading
Bound Morphemes

On

March 14, 2022

Language is a captivating domain, filled with depth and complexity. Each word we speak or pen reflects the profoundness of […]
Continue reading