Jump to content

Data Platform Engineering/Data Platform SRE/Priorities

From mediawiki.org

Here are the high level priorities of the DPE SRE team. The detailed backlog can be found on our main Phabricator board. Our current work can be followed on our "milestone" Phabricator board (there is no stable link to the current milestone, but it can be found as a link in the menu of our main board).

Current main projects

[edit]

To simplify operations and increase availability, we are migrating Airflow to k8s.

[edit]

To support the deprecation and removal of Graphite

[edit]

To support work by the Search Platform team. In particular, DPE SRE is focused on migration of the internal WDQS clients and the operational support of the underlying servers / platform.

[edit]

Archiva is our current solution for artifact hosting for Java / Scala projects and mirroring of external Maven repositories. It is unsupported and as a critical piece of our development and deployment infrastructure needs to be replaced. Gitlab is a component that provides the functionality that we need and is already deployed in our infrastructure, it is the obvious solution.

This project is driven by DPE SRE, but most of the implementation work is done by Search Platform, Data Engineering and Data Products. It is prioritized on top of the usual work for those teams and thus is slow moving.

[edit]

Usual operational work

[edit]
  • Incidents
  • Various minor software upgrades
  • Access requests
  • SPARQL Federation requests

High level backlog of projects

[edit]
  • Migration of the Search cluster from Elasticsearch to OpenSearch: T370147
  • Kafka upgrade: design doc
  • Mutualized OpenSearch cluster: T362105 & design doc
  • Hadoop upgrade: T379385
  • Kubernetes upgrade
  • Spark upgrade
  • Migration of additional services to k8s
    • Presto
    • JupyterHub