Code for Counting Sentences in Text using tokenizers Package

R




if (!require(tokenizers)) {
  install.packages("tokenizers")
  library(tokenizers)
}
 
text <- "This is an example gfg sentence. Another gfg sentence! this is last example."
 
sentences <- unlist(tokenize_sentences(text))
 
num_sentences <- length(sentences)
 
cat("Number of sentences using tokenizers:", num_sentences, "\n")


Output:

Number of sentences using tokenizers: 3 

  • we store text data in text variable.
  • use tokenize_sentences() to tokenize text into sentences.
  • unlist() to list the sentences and store it in sentences .
  • length() to count sentences and display it using cat .

As there are three sentences in text variable . Two of them separated by full stop(.) and one of them separated by exclamation mark(!). The count is 3.



How to count the number of sentences in a text in R

A fundamental task in R that is frequently used in text analysis and natural language processing is counting the number of sentences in a text. Sentence counting is necessary for many applications, including language modelling, sentiment analysis, and text summarization. In this article, we’ll look at various techniques and R packages for quickly and correctly counting the amount of phrases in a given text using R.

Similar Reads

Related Concepts :

Regular Expressions : Regular expression specifies pattern that is used to identify sentences . Functions in R : Various string related functions will be used for counting sentences...

Steps Required For Counting Sentences in R :

First we need to write R script in R Studio that will perform counting of sentences . We will store our text in a variable as string . Then we will use regular expression to match it with text to count sentences . Now we will use below examples to get count of sentences . Finally we will display the count of sentences on console ....

Code for Counting Sentences in Text using stringr Package

R text <- "This is R program for counting number of sentences in text. This program is for GFG article . And it is using stringr package for counting."   sentences <- unlist(strsplit(text, "[.!?]"))   num_sentences <- length(sentences)   cat("Number of sentences using unlist and strsplit :", num_sentences)...

Counting Sentences in Text using R and strcount()

...

Code for Counting Sentences in Text using openNLP Package

R if (!require(stringr)) {   install.packages("stringr")   library(stringr) }   text <- "This is R program for counting number of sentences in text. This program is for GFG article . And it is using stringr package for counting. And is it working ?"   sentence_pattern <- "[.!?]"   num_sentences <- str_count(text, sentence_pattern)   cat("Number of sentences using stringr :", num_sentences, "\n")...

Code for Counting Sentences in Text using tokenizers Package

...