GGG & AngelList – a making-of

Posted by

Tools, methodology, data sources, data output used to produce the article GenderGapGrader: AngelList.

We’ve opened the free GendRE API which extracts gender from names. To make it usable by everyone, we’ve built an extension for RapidMiner, a leading open source data mining and predictive analytics software

So you can run your own gender gap analysis, where and when it matters to you!

GGG_Make_your_own_gendergap_study_vF

Data Sources:

Data Mining Tools:

  • RapidMiner v6
  • RapidMiner Onomastics Extension (Extract Gender Operator) v0.0.4
    • Get it from RapidMiner Market Place, OR
    • Get it from GitHub
    • Documentation and video Tutorial
  • GendRE API v0.0.15/v0.0.16
  • Plus, specials thanks to : MonetDB, PostgreSQL, Apache Tomcat, OpenJDK, Python, Ubuntu… and others.

Data Output:

  • AngelList_Genderized.zip (TXT DELIMITED, UTF-8, ZIP)

Estimates:

  • 201409_AngelList_Preanalysis_v003_Preview_vF.zip (EXCEL, ZIP)

Tutorial:

3 comments

Leave a Reply