Product – Name Ethnicity Classifyer

See how incredibly diverse the world is, in full color #bigdata #opendata #datamining #names

NamSor™ is our specialized data mining software. It can predict, given a person name : likely country / region of origin, its cultural and linguistic classification, gender and spelling variants. Our innovative machine learning algorithm offers unmatched accuracy at a fine grain level, flexibility and integration capability, to

filter through large databases and extract names,
recognize which language or culture stands behind a name,
accommodate all countries / regions / languages / alphabets.

For each name, in addition to classification by name origin, our unique algorithm provides several scores (strength, doubt, synthetic) that enable advanced sorting, according to specific needs. The output of our software can integrate with other data management tools, for additional data enrichment: data mining, predictive analytics, social graph analysis, semantic analysis …

Names are meaningful : we use socialinguistics to extract their semantics and deliver actionable intelligence. We proudly invented the onomastic millefeuille, a data visualization which demonstrate the importance of names, languages and culture in all human activities. For example, in business and investment: this is a visualization of the demographics of the 500,000 top executives in Europe (the 5000 largest companies in each EU country plus Switzerland).