1 Answer

0 votes
by
 
Best answer
In Text Normalization, we undergo several steps to normalize the text to a lower level. That is, we will be working on text from multiple documents and the term used for the whole textual data from all the documents altogether is known as corpus. OR A corpus is a large and structured set of machine-readable texts that have been produced in a natural communicative setting. OR A corpus can be defined as a collection of text documents. It can be thought of as just a bunch of text files in a directory, often alongside many other directories of text files.

Related questions

0 votes
    What do you understand by linguistic Intelligence? Select the correct answer from above options...
asked Nov 12, 2021 in Education by JackTerrance
0 votes
    What do you understand by Data Privacy? Select the correct answer from above options...
asked Nov 12, 2021 in Education by JackTerrance
0 votes
    Three defective bulbs are mixd with 7 good ones. Let X be the number of defective bulbs when 3 bulbs are drawn ... and variance of X. Select the correct answer from above options...
asked Nov 13, 2021 in Education by JackTerrance
...