User:LouisDang
Appearance
Hi I'm Louis Dang and I'm volunteering with the Analytics team. I will use this page to index my work and reference material for easy access for myself and others.
My Work
[edit]- https://github.com/louisdang/kraken/tree/master/src/main/java/org/wikimedia/analytics/kraken/pig/RegexMatch.java Pig UDFs for regular expressions matching and simple IPv4 and IPv6 address validation.
- https://github.com/louisdang/kraken/blob/master/src/main/java/org/wikimedia/analytics/kraken/pig/ParseWikiUrl.java Parse Wiki URL UDF.
- https://github.com/louisdang/kraken/tree/master/src/test/java/org/wikimedia/analytics/kraken/pig Various tests.
- https://github.com/louisdang/kraken/tree/master/src/main/java/org/wikimedia/analytics/kraken/hue Hue API
References
[edit]Pig
[edit]- http://pig.apache.org/docs/r0.10.0/ Pig 0.10 Documentation
- https://cwiki.apache.org/confluence/display/PIG/Index Pig Wiki
- http://www.cloudera.com/blog/2009/06/analyzing-apache-logs-with-pig/ Cloudera tutorial on geocoding with Pig
Oozie
[edit]- http://nosql.mypopescu.com/post/8436633131/a-detailed-guide-to-oozie InfoQ guides on Oozie.
Misc
[edit]- https://docs.google.com/folder/d/0B1unTxaXLQeARGhBTjlySDlUVFk/edit Useful research papers.
- http://meta.wikimedia.org/wiki/User:Stu/comScore_data_on_Wikimedia comScore data analysis by user Stu.