Jump to content

Topic on Talk:TextCat

Related article - How to identify any language at a glance

2
Quiddity (talkcontribs)
TJones (WMF) (talkcontribs)

Cool! This is actually a much lower resolution version of the ideas behind TextCat. Certain letters, combinations of letters, and the relative frequencies of the letters or combinations is how TextCat identifies languages. It's made more complicated by the length of the text we're dealing with; queries are often very short, and not all of the distinguishing features of a language will not be present in a short query string.

Reply to "Related article - How to identify any language at a glance"