Wikimedia Release Engineering Team/DataDataData Sync Up/2019-05-21
Appearance
2019-05-21
[edit]Phab task
[edit]Last time
[edit]- Previous meeting was long ago...
Today's Agenda
[edit]- Vacation and not much movement
- Review requirements
- Go over email draft
What data we have currently or are planning to collect
[edit]- Schema
- Data samples
How we might want to query that data
[edit]- Our data is highly structured (see schemas)
- Is Hadoop or ES more appropriate for that? Would we lose structure by putting it in Hadoop?
- How much do we have to know about how data's structure before we put it in ES?
- Can relationships/schema be changed after data is stored?
TODOs (by next meeting)
[edit]- Dan to send email to Analytics and set up meeting