Wikimedia Product/Data dictionary
Appearance
Definitions for core and other essential metrics have been moved to a separate Data Glossary. For documentation of the datasets behind these derived tables, and the pipelines that generate that data, see the Data Platform docs on Wikitech. |
Druid Data Tables in Superset/Turnilo
[edit]- edit_hourly
- Table contains edits data, aggregated hourly.
- mediawiki_history_reduced
- A light version of mediawiki_history. Table only contains data of revisions which is not deleted by page deletion.
- virtualpageviews_hourly
- Table contains virtual pageviews data, aggregated hourly.
- pageviews_hourly
- Table contains pageviews data, aggregated hourly.
- pageviews_daily
- Table contains pageviews data, aggregated daily.
- unique_devices_per_domain_daily
- Table contains unique devices counts per domain, aggregated daily.
- unique_devices_per_domain_monthly
- Table contains unique devices counts per domain, aggregated monthly.
- unique_devices_per_project_family_daily
- Table contains unique devices counts per project family, aggregated daily.
- unique_devices_per_project_family_monthly
- Table contains unique devices counts per project family, aggregated monthly.
- mediawiki_geoeditors_monthly
- Table contains private data of editors counts by country region, aggregated monthly.
Hive Tables in Superset
[edit]- session_length_daily
- Table contains session length and session counts data, aggregated daily.
- content_interactions
- Table contains interactions data, aggregated monthly.
- active_editors
- Table contains active editors data, aggregated monthly.
- content_edit_daily
- Table contains edit topic data, aggregated daily.
- content_pv
- Table contains pageview topic data, aggregated daily.
References
[edit]Reconcile datasets in Superset with Key Product Metrics, documents differences between data available for exploration in Superset and our monthly Key Product Metrics