《The beauty of mathematics 》 Chap 1

824 查看

Words and language VS digit and info

Language and mathematics are created for the same purpose -- Recording and dissemination of information

information

information --encoding--> information --decoding--> information

words and digit

Human needs to record large amounts of information efficiently : words

Text clustering according to meaning, will eventually bring some ambiguity, how to know a polysemous word what is the meaning in a certain environment ? the solution is to rely on the context

Rosetta Stone

  1. Redundancy of information is the guarantee of information security

  2. Language data, we refer to as the corpus, multilingual compare-corpus is important to the translation, it is we are engaged in the research foundation of machine translation

商博良破译 Rosetta Stone

Mathematics behind words and language

The problem of linguistic research, language right or grammar right?
NLP's achievement is the language right.

Summary

  • communication principle and model of information transmission

  • (info-source) encoding and shortest encoding

  • decoding rules, syntax

  • clustering

  • check bit

  • bilingual contrast text, corpus and Machine-Translation

  • polysemy, Using context to eliminate ambiguity