Words and language VS digit and info
Language and mathematics are created for the same purpose --
Recording and dissemination of information
information
information
--encoding--> information
--decoding--> information
words and digit
Human needs to record large amounts of information efficiently : words
Text clustering according to meaning, will eventually bring some ambiguity, how to know a polysemous word what is the meaning in a certain environment ? the solution is to rely on the context
Rosetta Stone
Redundancy of information is the guarantee of information security
Language data, we refer to as the corpus, multilingual compare-corpus is important to the translation, it is we are engaged in the research foundation of machine translation
商博良破译
Rosetta Stone
Mathematics behind words and language
The problem of linguistic research, language right or grammar right?
NLP's achievement is the language right.
Summary
communication principle and model of information transmission
(info-source) encoding and shortest encoding
decoding rules, syntax
clustering
check bit
bilingual contrast text, corpus and Machine-Translation
polysemy, Using context to eliminate ambiguity