Onboarding Checklist

NOTE
This is currently a Work In Progress

This is a curated task list aimed at developers joining the team. It covers general tasks that everyone should complete, followed by role-specific checklists for software developers, data engineers, and DevOps engineers.

General (everyone)

People and access

Meet with your technical lead to understand the team structure and current priorities
Meet with the product manager to understand the product vision and roadmap
Create a GitHub account (or use an existing one) and ask your technical lead to get you added to the organisation with the correct permissions — remove any personal access tokens from an existing account before using it
Get access to the relevant Slack channels — ask your technical lead if you are unsure which ones

Reading

Read the Planning Data Handbook
Read the key development principles
Review the deploy and release procedure
Review the Pull Request best practices
Review the CI/CD strategy
Read through Onboarding in Data Operations for a broad overview of the platform and the questions worth asking as you get started

Software Developer (frontend and/or backend)

Machine setup

[ ] Run through the Set Up for Mac tutorial to install the base-level dependencies needed across most of our repositories

Key repositories to clone

[ ] Planning Data platform — the main website
[ ] Submit service — tools to help data providers submit URLs for collection

Get the applications running

[ ] Follow the Submit Service tutorial for an overview of the application architecture and instructions to get it running locally

Best practices and conventions

[ ] Review the best practices documentation — all projects should follow these
[ ] Confirm you can start a service using the standard make targets:
```
make init
make run
```

Data Engineer

Machine setup

[ ] Run through the Set Up for Mac tutorial to install the base-level dependencies needed across most of our repositories

Key reading

[ ] Review the data architecture overview
[ ] Work through the Data Operations Manual — pay particular attention to the Key Concepts and How-To Guides sections

Key repositories to clone

[ ] Digital Land Python — the core CLI and data processing library
[ ] Makerules — shared make targets used across data pipeline repositories
[ ] Specification — dataset specifications and schemas
[ ] AWS Batch Docker — Dockerfiles and entrypoints for batch processing tasks

Get oriented

[ ] Run through the Running a Collection tutorial
[ ] Confirm you can set up a repository locally using:
```
make init
make test
```

DevOps

Key reading

[ ] Review the infrastructure documentation in full, including deployment diagrams, code structure, security approach, and how-to guides
[ ] Review the CI/CD strategy in depth — you will own and maintain these pipelines
[ ] Review the alerting and monitoring strategy
[ ] Review the architecture checklist

Key repositories

[ ] Clone the infrastructure repository and review the structure with your technical lead — this is the primary repo you will work in
[ ] Review the Makerules repository as it underpins the CI/CD workflows across all other repositories
[ ] Review the AWS Batch Docker repository

Access and tooling

[ ] Set up AWS Vault for secure credential management
[ ] Confirm access to the relevant AWS environments with your technical lead