Wikimedia Release Engineering Team/Checkin archive/20160725
Appearance
2016-07-25
[edit]Vacations/Important dates
[edit]How to do it: https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Time_off
- July 25 - August 15: Željko vacation. Will have laptop with me. Reachable via phone.
- July 30 - August 21: Antoine vacation. At home 1st week.
- August 1st - 5th: Mukunda - vacation: Concert & relaxation
...
- January 9-11: Dev Summit
- January 12-13: All Hands
Team Business
[edit]Rotating positions and absences
[edit]Maniphest query for deployment blocker tasks: https://phabricator.wikimedia.org/u/blockers
weeks of July 25 and Aug 1
[edit]- Train: Tyler
- SoS: Mukunda / Tyler
- Out:
- Zeljko: July 25 - Aug 15
- Antoine: July 30 - Aug 21
- Mukunda: Aug 1-5
weeks of Aug 8 and Aug 15
[edit]- Train: Mukunda
- SoS: Chad
- Out:
- Zeljko: July 25 - Aug 15
- Antoine: July 30 - Aug 21
Time spent spreadsheet
[edit]
Actions from last meeting
[edit]- TODO: file task re upgrading MW-Vagrant guests to Jessie
- done by bryan :)
- https://phabricator.wikimedia.org/T136429
Scrum of Scrums
[edit]- https://phabricator.wikimedia.org/project/board/64/
- Blocked on us: https://phabricator.wikimedia.org/maniphest/query/h7YTCBTJsepS/#R
This week
[edit]- Blocking
- Blocked
- Updates
- Labs update (with CI distruption)
- Proposed 2016-08-02. What time is best?
- Andrew still wondering about https://phabricator.wikimedia.org/T139771
- If everything is fine w/CI response then it's all good :)
- Labs update (with CI distruption)
Last week
[edit]Blocking
- Android to differential
- Blocked
- None
- Updates
- Zuul upgraded this week, should address a bunch of issues
- New SWAT deploy process going ok, reminder to install
https://wikitech.wikimedia.org/wiki/X-Wikimedia-Debug if you're putting things up for SWAT
Other Team Business
[edit]Project tech leads
[edit]- https://phabricator.wikimedia.org/T139540#2485589
- tl;dr: how about we just add an explicit "Lead" for each quarterly goal?
Offsite
[edit]- agenda/purpose :)
- https://phabricator.wikimedia.org/T138437
- Will be meeting with Kristen Lans from TPG re potential TPG support in 2 hours :)
- Last offsite writeup with lessons learned: https://docs.google.com/document/d/17C6x_Sys21DcEZ_HxgLA7FkCYeiTUAzZX1XbEcZTNfw/edit#
Q1 goal/project check-in
[edit]Phase out Ubuntu Precise
[edit]Replace primary production Continuous Integration host (gallium
) - task T95757
[edit]- NEXT: https://phabricator.wikimedia.org/T139771 - "Identify metric (or metrics) that gives a useful indication of user-perceived (Wikimedia developer) service of CI"
- Tyler and Hashar reply to Faidon's comment, keeping focused on getting off of gallium for now
Upgrade Phabricator database servers to Maria10/Jessie - task T138460
[edit]- Done
Upgrade Beta Cluster database servers to Maria10/Jessie - task T138778
[edit]waiting on Jaime to priority- Priority is "this quarter" (not "this month" or "next week")
- up to us to schedule, should be no more than an hour for Jaime
- NEXT: Needs an owner
- DAN!
Reduce Technical Debt
[edit]Perform a technical debt analysis of software and services maintained by WMF Release Engineering - task T138225
- Original mega sheet: https://docs.google.com/spreadsheets/d/1Kxj9p4fKVNo2h23yAQVoOGg77dZ4FLxeXuYrH-1CrPA/edit#gid=0
- Already is tracking specific 'things' which need to be addressed
- Redux: https://docs.google.com/spreadsheets/d/1Ncbgbg-ZPSSScOaGswQSJRtreuJOlizRCFln4KyfMWI/edit#gid=0
- Simply severity+importance.
- Redux Redux: https://docs.google.com/spreadsheets/d/1btVdLuV59GZkQax8Hk0jkWDeyF5O_M5HkylWL0WxHxo/edit#gid=0
- Just severity
- Next steps?
- Fill out Redux Redux
- Based on Redux Redux identify the one thing to focus on
- then plan accordingly in Phabricator
Streamline deployments (long-lived branches)
[edit]keyresult task:
- Convert our production deployment strategy to use long-lived branches - task T89945
project view: https://phabricator.wikimedia.org/project/view/2117/
Non-Quarterly goal work
[edit]SWAT deploy changes
[edit]- European SWAT deploys next steps (task T137970
- NEXT: stalled pending finding people to do the SWAT window while Antoine and Zeljko are on vacation
- Week of 20th August lets gogogo
CI Scaling/Nodepool
[edit]- Wait time for Nodepool instances https://grafana.wikimedia.org/dashboard/db/releng-kpis
- Zuul repackaged with latest upstream. Will upgrade all fleet early this week.
- debian-glue job enhanced
- TODO: Zuul packaging tutorial
- TODO: android job. Move to Jessie, hacked over the week-end has to be polished https://phabricator.wikimedia.org/T139137
- MySQL on CI slaves either shutdown randomly / or does not start on boot :(
Browser tests
[edit]Differential migration
[edit]Differential weekly (https://etherpad.wikimedia.org/p/diffuerential-weekly ) TODOs:
- Mukunda had questiosn for antione re puppet (keys into the private store, production or other? for CI image builder)
- Update documentation on creating/renaming of repos in Diffusion
- Update task with discussion about ACLs?
- Announce plan to migrate MW-Vagrant to Differential
- https://phabricator.wikimedia.org/T131419#2439362
- outstanding patches should be either merged, abandoned or migrated to differential revisions.
Beta Cluster
[edit]- "deployment-fluorine becomes unresponsive frequently" - https://phabricator.wikimedia.org/T140313
- TODO: Submit patch ( https://gerrit.wikimedia.org/r/#/c/299672/ ) for PuppetSWAT?
Other
[edit]- Figure out how to help Jaime with the DB schema inconsistencies issue:
- https://phabricator.wikimedia.org/T132416 and https://phabricator.wikimedia.org/T104459 (see also: https://www.mediawiki.org/wiki/Development_policy#Database_patches )
- What can we do in CI to help prevent, mostly?
- Chad will lick this cookie :)
Scap querying logstash now for the canaries:
- Email to review: https://etherpad.wikimedia.org/p/scap-announce-2016-07-25
People status updates
[edit]Antoine
[edit]Last week
[edit]- Gerrit upgrade / Zuul upgrade
- Target host to replace gallium
- Sync up with Tyler for CI / gallium phase out
- Moaar maintenance
- Offsite site/date
This week
[edit]- Zuul upgrade to latest upstream
- Zuul packaging doc
- Vacations backup plan
Chad
[edit]Last week
[edit]- Moar Gerrit. Train. Choo choo.
This week
[edit]- Gerrit. Remove precise remnants from puppet, tune cache stuff, CSS tweaks for crap UI. Triaging old bugs to see which are fixed / invalid / still fixable.
- DB consistencies thingie for Jaime. I owe him one.
Dan
[edit]Last week
[edit]- Getting back
This week
[edit]- Start poking at MW-Vagrant jessie base image https://phabricator.wikimedia.org/T136429
- Figure out where we're at with Malu
Mukunda
[edit]Last week
[edit]- Get the merge-wmf-branch script cleaned up and shared with the team for feedback
- Brainstorm improvements / other ideas around branch merging / cherry-picking
This week
[edit]- T141278: Decide how ReleaseTaggerBot fits into the brave new world of long-lived-branches https://phabricator.wikimedia.org/T141278
Tyler
[edit]Last week
[edit]- MW Canary work
This week
[edit]Željko
[edit]Last week
[edit]- trying to do the first SWAT (depending on https://phabricator.wikimedia.org/T140264 MediaWiki deployment shell access request for zfilipin)
- Analyze (and share analysis of) the browser testing feedback survey https://phabricator.wikimedia.org/T139247
- Run language screenshots script for VisualEditor in Jenkins https://phabricator.wikimedia.org/T139613
This week
[edit]Vacation