Wikimedia Product/Data dictionary/mediawiki_history_reduced
Appearance
This page describes the data set mediawiki_history_reduced
that stores on Druid Datasources, which can be accessed via Superset/Turnilo. It's directly loaded from wmf.mediawiki_history_reduced
on Hive. wmf.mediawiki_history_reduced
on Hive is a reduced version of wmf.mediawiki_history
.
Schema
[edit]Field name | Data type | Description | Data example | Source schema | Source field |
---|---|---|---|---|---|
event_entity | string | event entity | revision, user, page | wmf.mediawiki_history | event entity |
event_type | string | create, move, delete, etc with specific digest types. Detailed explanation in the docs under #Event_types | monthly_digest, create, etc. | wmf.mediawiki_history | event_type |
events | int | Number of events | 1 | wmf.mediawiki_history | COUNT(1) |
other_tags | array<string> | Can contain: deleted (and deleted_day, deleted_month, deleted_year if deleted within the given time period), revetered and revert (for revisions), self_created (for users), user_first_24_hours if a revision is made during the first 24 hours of a user registration, redirect (for pages) | ["redirect"], ["self_created"] | wmf.mediawiki_history | page_is_redirect,user_is_created_by_self |
page_namespace | int | The page namespace of the event | 0 | wmf.mediawiki_history | page_namespace |
page_title | string | The page_title of the event, prefixed with canonical namespace if any | Galileo Galilei | wmf.mediawiki_history | page_title, page_title_historical |
page_type | string | content or non_content based on namespace being in content space or not | content | wmf.mediawiki_history | page_namespace_is_content, page_namespace_is_content_historical |
project | string | The project this event belongs to | en.wikipedia | wmf_raw.mediawiki_
project_namespace_map |
hostname |
revisions | bigint | 1 if the event is entity revision, or sum of revisions in case of digests | 1 | wmf.mediawiki_history | event_entity |
revisions_sum | bigint | Number of revisions | 100 | wmf.mediawiki_history | COUNT(1) |
text_bytes_diff | bigint | The text-bytes difference of the event | 4077 | wmf.mediawiki_history | revision_text_bytes_diff |
text_bytes_diff_abs | bigint | The absolute value of text-bytes difference for the event | 4077 | wmf.mediawiki_history | revision_text_bytes_diff |
text_bytes_diff_abs_sum | bigint | Sum of the text-bytes difference of the event | 4077 | wmf.mediawiki_history | revision_text_bytes_diff |
text_bytes_diff_sum | bigint | Sum of the absolute value of text-bytes difference for the event | 4077 | wmf.mediawiki_history | revision_text_bytes_diff |
user_text | string | user_text of user performing the event, whether registered or anonymous (IP) | user_12345 | wmf.mediawiki_history | event_user_text, event_user_text_historical |
user_type | string | anonymous, group_bot, name_bot or user | all | wmf.mediawiki_history | event_user_is_anonymous, event_user_is_bot_by, event_user_is_bot_by_historical |
Dashboards which use this table
[edit](Collecting...)