<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:base="https://digital-land.github.io/extract-alpha/">
  <title>Weeknotes for Extract planning data</title>
  <link href="https://digital-land.github.io/extract-alpha/weeknotes/feed.xml" rel="self"/>
  <updated>2025-12-23T00:00:00Z</updated>
  <id>https://digital-land.github.io/extract-alpha/</id>
  <entry>
    <title>The unit of delivery is the team</title>
    <link href="https://digital-land.github.io/extract-alpha/weeknotes/the-unit-of-delivery-is-the-team/"/>
    <updated>2025-10-24T00:00:00Z</updated>
    <id>https://digital-land.github.io/extract-alpha/weeknotes/the-unit-of-delivery-is-the-team/</id><author>
  <name>Steve Messer</name>
</author>
<content xml:lang="en" type="html">
      <![CDATA[
<p class="govuk-body">‘Ello.</p>
<p class="govuk-body">This is our first weeknote about working on the <a href="https://www.gov.uk/service-manual/agile-delivery/how-the-alpha-phase-works" class="govuk-link">alpha phase</a> of Extract. It’s a <a href="https://doingweeknotes.com/" class="govuk-link">regular, weekly communication</a> of what we’ve done, what we’re thinking about, ideas, decisions, scrappy thoughts, and stories about our work designing and testing Extract. It’s also how we can remember what we’ve done. Written for us, but shared with you.</p>
<p class="govuk-body">This week’s note comes from the people on the <a href="/extract-alpha/team/" class="govuk-link">team</a> working at Ministry of Housing, Communities and Local Government (MHCLG). In future weeks, we’ll feature work from the Incubator for AI (i.‌AI) too.</p>
<p class="govuk-body">We’ve written about Extract and where it came from on the <a href="https://mhclgdigital.blog.gov.uk/2025/06/12/extract-using-ai-to-unlock-historic-planning-data/" class="govuk-link">MHCLG Digital blog</a>, if you’re not familiar with it. But since you’re here, we’ll presume you know what it’s about.</p>
<p class="govuk-body">So, what did we get up to this week?</p>
<h2 id="forming-as-a-unit" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#forming-as-a-unit"><span>Forming as a unit</span></a></h2>
<p class="govuk-body">As a wise man once said (and <a href="https://www.flickr.com/photos/psd/with/26868697057/" class="govuk-link">another man once drew</a>), ‘the unit of delivery is the team’. This week we kicked off the alpha, held our first planning session, and started looking at risky assumptions. New people joining the team has meant we can get started, so we’ve focused on forming as a unit.</p>
<figure><img src="the-unit-of-delivery-is-the-team.jpg" alt="Paul Downey’s doodle of Jamie Arnold’s mantra, ‘the unit of delivery is the team’."></figure>
<h3 id="guiding-principles" tabindex="-1" class="govuk-heading-m"><a class="app-link--heading govuk-link" href="#guiding-principles"><span>Guiding principles</span></a></h3>
<p class="govuk-body">In the kick-off, Steve shared some principles to guide us through the alpha. The alpha phase is an opportunity to try out different solutions to the problem you’ve identified, finding out what works for users (and what doesn’t). To help create variety, we’ll <strong>start small, learn fast and iterate</strong> towards the design that meets users’ needs.</p>
<p class="govuk-body">We looked at the principles from <a href="https://leanuxbook.com" class="govuk-link"><em>Lean UX</em></a> too, as the alpha phase is all about velocity of learning, not velocity of delivery. The <em>Lean UX</em> principles are:</p>
<ol class="govuk-list govuk-list--number">
<li>Moving from doubt to certainty</li>
<li>Outcomes, not output</li>
<li>Removing waste</li>
<li>Shared understanding</li>
<li>No rock stars, gurus, or ninjas</li>
<li>Permission to fail</li>
</ol>
<h2 id="progress-is-better-than-perfection" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#progress-is-better-than-perfection"><span>Progress is better than perfection</span></a></h2>
<p class="govuk-body">After the kick-off, Jenny and Chanelle lead our first planning session. As we’re still waiting for key roles to join the team, the purpose was to get started on preparatory tasks and start planning ahead for testing with local planning authorities down the line.</p>
<p class="govuk-body">It was also an opportunity to test out a basic delivery model and change aspects of it now (rather than waiting for a retro). We refined how we do estimation, and we thought about how best to frame the units of work. Instead of trying to craft the perfect delivery model on paper, we dived right in and gave it a go. Progress is better than perfection.</p>
<p class="govuk-body">Steve and Gordon also chatted about how we might best keep i.‌AI’s progress aligned with MHCLG’s sprints. As i.‌AI’s work is experimental, rooted in the practices of continuous improvement, estimations don’t work too well. They’re going to <a href="https://basecamp.com/shapeup/3.4-chapter-13" class="govuk-link">use hill charts to show progress</a> instead.</p>
<h2 id="mapping-out-what-we-know-and-what-we-don-t-know" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#mapping-out-what-we-know-and-what-we-don-t-know"><span>Mapping out what we know…and what we don’t know</span></a></h2>
<p class="govuk-body">On Friday, Fabia lead a workshop to map out our assumptions about Extract, joined by Hannah (our new frontend developer), Jenny, and Steve.</p>
<p class="govuk-body">Calling out your assumptions is an essential part of any <a href="https://public.digital/pd-insights/blog/2024/12/just-what-is-test-and-learn" class="govuk-link">test &amp; learn</a> approach. Testing our assumptions allows us to find out whether we’re right or wrong, creating a feedback loop. If we’re wrong, we can change our minds, change the designs and ensure the solution we deliver works for users. If we’re right, we crack on.</p>
<p class="govuk-body">Fabia asked some questions which helped us list out everything we thought we knew:</p>
<ul class="govuk-list govuk-list--bullet">
<li>Who are the users?</li>
<li>Who won’t be a user?</li>
<li>How technical are they?</li>
<li>What are their current methods [for extracting data from documents]?</li>
<li>How do they edit shapes on a map?</li>
<li>What technologies are they using?</li>
<li>What are the inputs?</li>
<li>What are the outputs?</li>
<li>Why are they using it?</li>
<li>What happens after?</li>
</ul>
<p class="govuk-body">We wrote up what we know so far, from speaking to local planning authorities. Then we placed those on a scale of known to unknown: which facts we’re confident in, which assumptions we need more evidence to rely on.</p>
<p class="govuk-body">There’s some more work to do on that next week, and once we’ve onboarded a user researcher, we’ll start prioritising what we need to learn first and form a research plan.</p>
<h2 id="on-to-the-next-one" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#on-to-the-next-one"><span>On to the next one</span></a></h2>
<p class="govuk-body">All in all, a pretty good first week. The best way to build up steam is to light the stove, and sooner rather than later we’ll be chugging along.</p>
<p class="govuk-body"><a href="/extract-alpha/weeknotes/feed.xml" class="govuk-link">Subscribe to the feed</a> if you’d like to follow along, or check back here next week. (<a href="https://aboutfeeds.com/" class="govuk-link">Read more about how to use feeds</a>.)</p>
<p class="govuk-body"><em>Ciao!</em></p>

      ]]>
    </content>
  </entry>
  <entry>
    <title>“A strong week, a solid 9”</title>
    <link href="https://digital-land.github.io/extract-alpha/weeknotes/a-strong-week/"/>
    <updated>2025-10-31T00:00:00Z</updated>
    <id>https://digital-land.github.io/extract-alpha/weeknotes/a-strong-week/</id><author>
  <name>Steve Messer</name>
</author>
<content xml:lang="en" type="html">
      <![CDATA[
<p class="govuk-body">As Jenny said, this was a strong week. A solid 9 out of 10.</p>
<p class="govuk-body">This is our second weeknote about working on the <a href="https://www.gov.uk/service-manual/agile-delivery/how-the-alpha-phase-works" class="govuk-link">alpha phase</a> of Extract. It’s a <a href="https://doingweeknotes.com/" class="govuk-link">regular, weekly communication</a> of what we’ve done, what we’re thinking about, ideas, decisions, scrappy thoughts, and stories about our work designing and testing Extract. It’s also how we can remember what we’ve done. Written for us, but shared with you. (Read <a href="/extract-alpha/weeknotes/the-unit-of-delivery-is-the-team/" class="govuk-link">last week’s note</a> if you’d like to catch up.)</p>
<p class="govuk-body">The key theme of the week was knowledge-sharing, living up to the guiding principle of sharing understanding that we <a href="/extract-alpha/weeknotes/the-unit-of-delivery-is-the-team/#guiding-principles" class="govuk-link">established last week</a>. The more a team collectively understands what they’re doing and why, the less they need to debate what happened or why and can quickly move to utilising that learning.</p>
<h2 id="assumptions-to-validate" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#assumptions-to-validate"><span>Assumptions to validate</span></a></h2>
<p class="govuk-body">Monday started with a workshop to finish off mapping our assumptions, which we had started last week. These are the assumptions we’ll be validating or invalidating throughout the alpha, so we’re glad we got those out of our heads early.</p>
<p class="govuk-body">Though this isn’t everything we’ll be exploring, some of our assumptions include the following.</p>
<ul class="govuk-list govuk-list--bullet">
<li><strong>The need for efficiency:</strong> there’s a strong assumption that these users desperately need to reduce manual effort in extracting data from planning documents, as they’re currently spending significant time on this task.</li>
<li><strong>The need for accuracy and consistency:</strong> users require high accuracy in data extraction; inconsistencies are a major pain point.</li>
<li><strong>Integration with existing systems:</strong> there’s an implied need for the system to potentially integrate with existing planning systems – though this isn’t fully fleshed out yet.</li>
<li><strong>PDF is the dominant format:</strong> though we know that some local planning authorities have TIFF files of scanned documents, PDFs are more popular.</li>
</ul>
<p class="govuk-body">Each of these assumptions has an impact on the functionality and user experience of Extract, hence why we’re testing out different options during the alpha phase.</p>
<h2 id="re-using-code-from-other-parts-of-government" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#re-using-code-from-other-parts-of-government"><span>Re-using code from other parts of government</span></a></h2>
<p class="govuk-body">Make things open, it makes things better. That’s a <a href="https://www.gov.uk/guidance/government-design-principles#make-things-open-it-makes-things-better" class="govuk-link">government design principle</a> that encourages us to share code, share learnings, and makes it possible to get a head start by re-using other people’s work.</p>
<p class="govuk-body">Fabia, Hannah and Kevin have been looking at the <a href="https://github.com/alphagov/govuk-design-system/discussions/4531#discussioncomment-13887453" class="govuk-link">map components we can borrow from HM Land Registry and Environment Agency</a>, rather than building our own. They’ve done some great work over the years, especially on the accessibility of maps, and we’re keen to share back anything we learn too – returning the favour.</p>
<figure><img src="edit-shape-on-map.png" alt="HM Land Registry’s map component includes features our users will find useful, like editing a shape on a map. This screenshot shows their map component displaying a London street map. A sidebar offers options to search by address, draw areas, and edit the map’s features."></figure>
<figure><img src="keyboard-navigation-on-a-map.png" alt="HM Land Registry’s map component also includes accessibility features, like navigating a map with a keyboard rather than the mouse. "></figure>
<p class="govuk-body">The team started documenting where we might re-use these tools in the user journey of Extract, so we can hit the ground running when we do more prototyping.</p>
<h2 id="scribbles-diagrams-flows" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#scribbles-diagrams-flows"><span>Scribbles, diagrams, flows</span></a></h2>
<p class="govuk-body">Wednesday was a busy day, but our best collaboration session yet. Jenny, Chanelle and Fabia came down to London from Manchester, and Gavin, Gordon and Kevin left the Whitechapel Building to head over to 2 Marsham Street, the MHCLG offices. We camped out at the team table all day to work through a packed agenda.</p>
<p class="govuk-body">To kick things off, Owen introduced the technical stack of planning.data.gov.uk to the team. There were lots of similarities in approaches, which was really good to hear. Though the current prototype of Extract is built in Next.js, the team agreed that porting it over to an Express.js app now would reap rewards in the near future, helping with quicker prototyping.</p>
<p class="govuk-body">Throughout the meeting, the team members shared their expertise and experiences, aiming to align on the product’s architecture, and deployment strategies. This collaborative discussion is essential in the early stages of forming a team to build a digital product, as it fosters understanding, consensus, and efficient division of tasks among team members.</p>
<figure><img src="team-in-the-office-october.jpg" alt="Gav draws a diagram of Extract’s current architecture on a whiteboard, while Jenny shares a framework for keeping track of success criteria on another whiteboard."></figure>
<p class="govuk-body">Next up, Fabia gave everyone a run-through of the user journey she’d been drafting. It’s an early idea – and obviously not validated with users yet – but sharing feature ideas and early thinking on the user experience is always good.</p>
<p class="govuk-body">We talked about bulk uploads, per-document processing, how users would refine the outputted data, the need to undo and redo actions, how to keep data linked up with documents, human-in-the-loop versus human-over-the-loop, accessibility challenges and pre-existing workflows.</p>
<p class="govuk-body">Steve and Jenny also scoped out the plan for research and testing, optimising it for our time-poor colleagues in local planning authorities. They’re stretched and we want to make best use of the time they kindly offer us.</p>
<p class="govuk-body">It was a great day and we got a heck of a lot done.</p>
<h2 id="welcoming-a-user-researcher" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#welcoming-a-user-researcher"><span>Welcoming a user researcher</span></a></h2>
<p class="govuk-body">Our user researcher, Rob, joined us on Thursday, and Steve spent the afternoon onboarding him. That means stuffing his head with lots of context, the programme’s mission, our strategy, and how Extract will play a part.</p>
<p class="govuk-body">On Friday, Rob, Fabia, Jenny and Steve shared ideas for the first round of research, which we’ll be able to start planning. If you’re from a local planning authority and you’ve contacted us about being involved, we’ll be getting messages out very shortly!</p>
<h2 id="looking-at-red-line-boundaries" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#looking-at-red-line-boundaries"><span>Looking at red-line boundaries</span></a></h2>
<p class="govuk-body">Steve visited West Oxfordshire District Council way back in August, and despite taking lots of videos and detailing processes and hearing about pain points, one thing he didn’t manage to document was what happens when a new planning application arrives.</p>
<p class="govuk-body">Arthur and Sean demonstrated how they use Uniform and ArcGIS Pro to get red-line boundaries into their back-office systems. It was helpful not only to see the process but also to see which tools were available to the team already. It’s all valuable context which can highlight where Extract can be most useful for local planning authorities.</p>
<h2 id="next-week" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#next-week"><span>Next week</span></a></h2>
<p class="govuk-body">Next week, we’ll be</p>
<ul class="govuk-list govuk-list--bullet">
<li>prioritising risky assumptions</li>
<li>writing hypothesis statements based on those assumptions</li>
<li>planning the first round of research, and</li>
<li>recruiting research participants.</li>
</ul>
<p class="govuk-body"><a href="/extract-alpha/weeknotes/feed.xml" class="govuk-link">Subscribe to the feed</a> if you’d like to follow along, or check back here next week. (<a href="https://aboutfeeds.com/" class="govuk-link">Read more about how to use feeds</a>.)</p>

      ]]>
    </content>
  </entry>
  <entry>
    <title>Forming the research plan, and extracting trees</title>
    <link href="https://digital-land.github.io/extract-alpha/weeknotes/forming-the-research-plan-and-extracting-trees/"/>
    <updated>2025-11-07T00:00:00Z</updated>
    <id>https://digital-land.github.io/extract-alpha/weeknotes/forming-the-research-plan-and-extracting-trees/</id><author>
  <name>Steve Messer</name>
</author>
<content xml:lang="en" type="html">
      <![CDATA[
<p class="govuk-body">It’s the third week in our alpha phase and it’s been a really busy week, pulling plans together at quite a pace. If you want to catch up on the story so far, go and take a look at <a href="/extract-alpha/weeknotes/" class="govuk-link">previous weeknotes</a>.</p>
<p class="govuk-body">Here’s the highlights of what we got up to this week.</p>
<h2 id="burning-questions" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#burning-questions"><span>Burning questions</span></a></h2>
<p class="govuk-body">On Monday we prioritised our risky assumptions, identifying the key things we need to learn in order for the product to be a success. We wrote hypothesis statements to <a href="https://www.dxw.com/2020/02/using-hypotheses-to-hold-together-your-learning-thinking-and-making/" class="govuk-link">hold together our learning, thinking and making</a>, pairing clear outcomes with measures of success. These statements help us know when we’ve validated an assumption, so they’re a core part of the alpha phase.</p>
<p class="govuk-body">Our hypothesis statements took this format:</p>
<blockquote data-govspeak="blockquote">
<p class="govuk-body"><strong>We believe that</strong> [doing this/building this feature/creating this experience] <strong>for</strong> [these people/personas]<br>
<strong>will achieve</strong> [this outcome]. <br>
<strong>We will know this is true when we see</strong> [this market feedback, quantitative measure, or qualitative insight].</p>
</blockquote>
<p class="govuk-body">We also spent time calling out all our burning questions, the nuance we need to understand in order to design and deliver a product that meets users’ needs. These fell into five groups:</p>
<ul class="govuk-list govuk-list--bullet">
<li>Their process, and the end-to-end journey</li>
<li>Accuracy of output and time spent on task</li>
<li>Ownership of the output</li>
<li>Incentives to convert documents into data</li>
<li>Satisfaction with the output</li>
</ul>
<p class="govuk-body">This work is moving into a <a href="https://userresearch.blog.gov.uk/2017/02/16/how-a-knowledge-kanban-board-can-help-your-user-research/" class="govuk-link">knowledge kanban</a> to track the progress of our assumptions and learnings. We will continue to map our evolving assumptions, hypothesis and questions as we learn from the upcoming user research.</p>
<p class="govuk-body">As Erika Hall reminds us, enthusiasm isn’t a substitute for knowledge. Asking these questions will help us learn about and mitigate the <a href="https://www.svpg.com/four-big-risks/" class="govuk-link">four big risks</a> with product and service development.</p>
<h2 id="end-of-sprint-demo" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#end-of-sprint-demo"><span>End of sprint demo</span></a></h2>
<p class="govuk-body">During the end-of-sprint demo, Jordy showcased a new feature for extracting and identifying individual trees (as points) in Tree Preservation Order documents (TPOs). Previously only capable of handling area-like objects, this was achieved by integrating Gemini AI to recognise and label each tree, even providing details like species. Jordy generalised the method, meaning it can be applied to other point-like objects in future.</p>
<figure><img src="jordy-shares-tpo-feature.png" alt="Jordy demonstrates the new TPO feature."></figure>
<p class="govuk-body">Aydin also worked on adding versioning to track and changes that users make to shapes, allowing them to undo and redo changes while they’re refining the output. We expect this will be really useful for users.</p>
<p class="govuk-body">Gavin presented innovations in the georeferencing technology he’d developed. The system now performs smarter lookups for addresses, using an LLM to choose the correct address when ambiguity arises, significantly improving accuracy. He also showed improvements to pinpoint locations on maps. It can now more accurately identify and place shapes (polygons) over maps, even in challenging formats like old-style coastal maps.</p>
<figure><img src="rob-shares-research-plan.jpg" alt="Rob shares his research plan and how it came from mapping our assumptions."></figure>
<p class="govuk-body">Rob presented his research roadmap for the alpha phase. The roadmap showed how we would produce valuable, robust research outputs within five weeks, leading to an well-defined MVP.</p>
<p class="govuk-body">Rob proposed a high-level research timeline with specific focus areas for each week:</p>
<ul class="govuk-list govuk-list--bullet">
<li>Week 2: Metrics pilot (almost like prototype testing)</li>
<li>Week 3 and 4: Evaluative studies on user-centred design questions (the burning questions)</li>
<li>Week 5: Another look at metrics work, incorporating learnings from weeks 3 and 4 (aiming to show an improvement)</li>
<li>Week 6: Final documentation and handover of next steps</li>
</ul>
<p class="govuk-body">The metrics pilot (and follow-up study) aims to capture baseline data and surface necessary measures or logs for future benchmarking and success metric development. The evaluative studies will assess speed, accuracy, and overall experience using Extract compared to GIS software, with follow-up surveys to gather quantitative and qualitative data.</p>
<h2 id="a-new-sprint" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#a-new-sprint"><span>A new sprint</span></a></h2>
<p class="govuk-body">The team got together to plan the development of key features ready for testing. Though some of these have been in existence for a while, they were built into an inaccessible frontend. These are being ported over to an Express.js frontend by Kevin and Hannah at great speed, which can utilise the Extract backend.</p>
<p class="govuk-body">Fabia documented the <a href="/extract-alpha/design-history/design-thinking-for-extract/" class="govuk-link">design thinking for Extract</a>, covering some of the build-up to the start of the alpha.</p>
<p class="govuk-body">Fabia also started design research for one of the trickiest features: evaluating the AI’s output alongside the original document, in an accessible way. This falls right in the middle of the user’s journey and will set a core design constraint, helping us eliminate a big uncertainty early on. <a href="https://basecamp.com/shapeup/3.2-chapter-11#start-in-the-middle" class="govuk-link">Starting in the middle</a> can help us address design problems early on, making it quicker to design and develop features in successive weeks.</p>
<p class="govuk-body">Hannah and Kevin started looking at the who, what, where, when, why and how of frontend development, exploring where code will live, where it will be deployed, and other details. This operating model will pay dividends in coming weeks.</p>
<p class="govuk-body">Steve and Owen made a start on one of the assurance tasks, calling out how to meet the principles in the <a href="https://www.gov.uk/government/publications/ai-playbook-for-the-uk-government/artificial-intelligence-playbook-for-the-uk-government-html" class="govuk-link">AI Playbook for the UK Government</a>, which was mentioned in a <a href="https://questions-statements.parliament.uk/written-questions/detail/2025-10-22/84457" class="govuk-link">Parliamentary question and answer</a> last week.</p>
<h3 id="aligning-on-plans" tabindex="-1" class="govuk-heading-m"><a class="app-link--heading govuk-link" href="#aligning-on-plans"><span>Aligning on plans</span></a></h3>
<p class="govuk-body">Aligning two teams to work as one, with complementary methods, is tricky, especially as we’ve been moving at different speeds in the past. There’s a need to refine how <a href="https://ai.gov.uk/about/#:~:text=and%20government%20priorities.-,Incubating" class="govuk-link">incubating</a> and <a href="https://ai.gov.uk/about/#:~:text=wider%20public%20sector.-,Scaling" class="govuk-link">scaling</a> overlaps with the <a href="https://www.gov.uk/service-manual/agile-delivery/how-the-alpha-phase-works" class="govuk-link">alpha</a> and <a href="https://www.gov.uk/service-manual/agile-delivery/how-the-beta-phase-works" class="govuk-link">beta</a> phases, so that products and services can meet the <a href="https://www.gov.uk/service-manual/service-standard" class="govuk-link">Service Standard</a> and <a href="https://www.gov.uk/service-manual/agile-delivery/spend-controls-pipeline-process" class="govuk-link">spend controls</a>.</p>
<p class="govuk-body">This is a <a href="https://hr.mit.edu/learning-topics/teams/articles/stages-development" class="govuk-link">common stage of team development</a> though and will be beneficial to work through. And we’re all committed to the mission equally, so we’re moving in a common direction.</p>
<h2 id="invitations-to-join-research-and-testing" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#invitations-to-join-research-and-testing"><span>Invitations to join research and testing</span></a></h2>
<p class="govuk-body">Adding to all the progress already made, Friday ended with an achievement as the first invitation to join research and testing went out to a local planning authority! More will be going out early next week, recruiting even more people for research and testing.</p>
<p class="govuk-body">Onwards!</p>

      ]]>
    </content>
  </entry>
  <entry>
    <title>First research session booked</title>
    <link href="https://digital-land.github.io/extract-alpha/weeknotes/first-research-session-booked/"/>
    <updated>2025-11-13T00:00:00Z</updated>
    <id>https://digital-land.github.io/extract-alpha/weeknotes/first-research-session-booked/</id><author>
  <name>Steve Messer</name>
</author>
<content xml:lang="en" type="html">
      <![CDATA[
<p class="govuk-body">It’s the fourth week in the alpha and we’ve got the first research and testing session booked in the diary. It’s a win. If you want to catch up on the story so far, go and take a look at <a href="/extract-alpha/weeknotes/" class="govuk-link">previous weeknotes</a>, but here’s what happened this week.</p>
<h2 id="recruiting-research-participants" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#recruiting-research-participants"><span>Recruiting research participants</span></a></h2>
<p class="govuk-body">The week started off by selecting local planning authorities (LPAs) we’d like to do research with. We whittled a list of 31 LPAs down to 14 based on a few criteria:</p>
<ul class="govuk-list govuk-list--bullet">
<li>The LPA had published less than 5 datasets on planning.data.gov.uk</li>
<li>The LPA hadn’t yet published datasets for either <a href="https://www.planning.data.gov.uk/guidance/specifications/article-4-direction" class="govuk-link">Article 4 directions</a>, <a href="https://www.planning.data.gov.uk/guidance/specifications/conservation-area" class="govuk-link">conservation areas</a> or <a href="https://www.planning.data.gov.uk/guidance/specifications/tree-preservation-order" class="govuk-link">tree preservation orders</a></li>
<li>We research with a good mix of rural and urban authorities</li>
<li>We research with LPAs from regions across England</li>
</ul>
<p class="govuk-body">We’ve contacted LPAs in the following regions: Yorkshire and the Humber, South West, London, West Midlands, East Midlands, East of England, North West, and South East.</p>
<p class="govuk-body">Seven LPAs are urban (according to <a href="https://www.ons.gov.uk/methodology/geography/geographicalproducts/ruralurbanclassifications" class="govuk-link">rural/urban classifications from Office for National Statistics</a>), and seven are rural. In some cases, this can have an impact on the number of <a href="https://www.gov.uk/guidance/tree-preservation-orders-and-trees-in-conservation-areas" class="govuk-link">tree preservation orders</a> (TPOs) an LPA manages. (Adur &amp; Worthing councils recently shared a <a href="https://medium.com/awc-digital-design/open-digital-planning-month-5-382ee85b10b6" class="govuk-link">large, delicate TPO map</a>) in case you’ve not seen one before.)</p>
<h3 id="first-participants-recruited" tabindex="-1" class="govuk-heading-m"><a class="app-link--heading govuk-link" href="#first-participants-recruited"><span>First participants recruited!</span></a></h3>
<p class="govuk-body">And the big win of the week was recruiting the first participants! We’re really grateful to LPAs for giving up 1 or 2 hours a week to help us research and test Extract, it’s absolutely essential to ensuring we <a href="https://www.gov.uk/guidance/government-design-principles#start-with-user-needs" class="govuk-link">start with user needs</a>.</p>
<p class="govuk-body">Their Chief Technology Officer asked how Extract works, so they can think about how IT and information security policies might be affected. We’ve written <a href="/extract-alpha/a-note-on-the-technology/" class="govuk-link">a note on the technology</a> for senior leaders and IT teams who might have this question too.</p>
<h2 id="building-and-deploying-the-prototype" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#building-and-deploying-the-prototype"><span>Building and deploying the prototype</span></a></h2>
<p class="govuk-body">This week we also finalised the plans for the first prototype, figuring out the who, what, where, when and how. That includes</p>
<ul class="govuk-list govuk-list--bullet">
<li>what features will be in there</li>
<li>who’s building those</li>
<li>where we’re deploying the prototype (and how we’ll use pre-production environments in parallel)</li>
<li>when we’re having a code-freeze on model improvements</li>
<li>when we’re deploying the prototype (ahead of the first research session, for testing), and</li>
<li>how we’re working in phases, building version 0.1 while shaping up features in version 0.2.</li>
</ul>
<p class="govuk-body">We’re borrowing approaches from <a href="https://dovetail.com/product-development/what-is-dual-track-agile/" class="govuk-link">dual-track agile</a> which should increase our velocity, helping us make the most use of the limited time we have available. For example, while Hannah is building version 0.1, Fabia and Kevin are exploring accessible shape editing features in fullscreen mode (for version 0.2). (Re-using <a href="https://github.com/LandRegistry/search-local-land-charges-ui" class="govuk-link">HM Land Registry’s map component</a> has helped us move quickly here.)</p>
<figure><img src="editing-shapes-on-maps-video.gif" alt="An example of Kevin’s work on editing shapes from earlier in the week."></figure>
<p class="govuk-body">Working in these offset phases – or dual tracks – should help us make the most of each other’s strengths. We’ll experiment with the approach and iterate when we need to.</p>
<p class="govuk-body">Here’s a first look at version 0.1 taking shape.</p>
<figure><img src="first-look-at-extract.png" alt="The first part of the journey is uploading planning documents."></figure>
<h2 id="improving-accuracy" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#improving-accuracy"><span>Improving accuracy</span></a></h2>
<p class="govuk-body">Gavin, Jordy and Rich are constantly looking at ways to improve the technical accuracy of Extract, based on some metrics set when the incubation phase was started. This continuous improvement mindset has already delivered impressive results, and it’ll keep quality high as we encounter new variations in document styles.</p>
<p class="govuk-body">A couple weeks ago we spoke about ideas for using user behaviour to improve the accuracy of extracted shapes, and Gavin demo’d some experiments that had been happening. Whenever a user adjusts the position of a shape on the map, or adjust the lines and points that make up the shape, these corrections by users can be fed back to the model to improve outputs going forwards.</p>
<figure><img src="accuracy-modifier-engine.png" alt="A blue shape and a red shape on a map. The blue shape shows the original extraction, and the red shape shows edits the user has made to the shape."></figure>
<p class="govuk-body">This ‘accuracy modifier engine’ can also help us see how well the AI’s actions satisfy the user’s intent. Incorporating this kind of human feedback is crucial, but we’ll also be asking for qualitative feedback from users too.</p>
<p class="govuk-body">More details on this work in future.</p>
<h2 id="using-ai-as-a-new-material" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#using-ai-as-a-new-material"><span>Using AI as a new material</span></a></h2>
<p class="govuk-body">Fabia and Steve had a think about an emerging design principle for the work, making more use of AI as a new material.</p>
<p class="govuk-body">We don’t want to build another GIS tool. Users have those. Using AI as a new material means speaking to the agent to shape things, the user’s intent becoming the agent’s actions. (And if this is too tricky, we can fall back to established tools and patterns.)</p>
<p class="govuk-body">In some cases this could be quicker than editing point-by-point, though we’re aware that’s a behaviour users will be familiar with. It’s early thinking we’ll look to play with down the line.</p>
<h2 id="next-steps" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#next-steps"><span>Next steps</span></a></h2>
<p class="govuk-body">Next week is all about getting the first research session completed and more research booked into the calendar. The sooner we can get to insights, the better, and we’re really chugging along now.</p>
<p class="govuk-body">Onwards!</p>

      ]]>
    </content>
  </entry>
  <entry>
    <title>Benchmarking begins</title>
    <link href="https://digital-land.github.io/extract-alpha/weeknotes/benchmarking-begins/"/>
    <updated>2025-11-21T00:00:00Z</updated>
    <id>https://digital-land.github.io/extract-alpha/weeknotes/benchmarking-begins/</id><author>
  <name>Jenny Colebourn</name>
</author>
<content xml:lang="en" type="html">
      <![CDATA[
<p class="govuk-body">It’s the fifth week in the alpha and we’ve reached an important point: Extract 0.1 MVP is in the hands of users, with our first cohort of LPAs. A small step for Extract but a massive leap for product validation. If you want the backstory, the previous weeknotes cover it. Here’s what happened this week.</p>
<h2 id="mvp-0-1-release-and-testing" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#mvp-0-1-release-and-testing"><span>MVP 0.1 release and testing</span></a></h2>
<p class="govuk-body">Finally, after weeks of anticipation and preparation, we’ve put Extract into the hands of our users with the release of Extract 0.1 MVP. Earlier in the week, we began our first research sessions. We’re looking to understand how LPAs are digitising planning documents at the moment, without Extract, to get benchmark timings for the current process. We’re also carrying out research sessions with stripped back features so we can validate our core feature set and prioritise future features.</p>
<h2 id="3-s-the-magic-number" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#3-s-the-magic-number"><span>3’s the magic number</span></a></h2>
<p class="govuk-body">The team also began looking into the newly released <strong>Gemini 3</strong> and <strong>Segment Anything 3</strong> models. We think there may be meaningful improvements throughout the extraction process.</p>
<p class="govuk-body">We’re currently experimenting with how to integrate these into Extract as soon as possible and get their maximum performance. From initial experiments it should lead to a significant boost as well as speeding up the process.</p>
<p class="govuk-body">Below are examples of how precise the tools have been. The current version of Extract struggles with precision of results from these harder maps, so these images show how the new models can really boost our accuracy even further.</p>
<p class="govuk-body"><img src="image1.jpg" alt="Example 1"><br>
<img src="image2.jpg" alt="Example 2"><br>
<img src="image3.jpg" alt="Example 3"></p>
<h2 id="collaboration-and-new-ways-of-working" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#collaboration-and-new-ways-of-working"><span>Collaboration and new ways of working</span></a></h2>
<p class="govuk-body">As part of our alpha ways of working, the MHCLG and i.AI teams spent time refining how we plan and deliver together. We’re shaping a process that lets both teams test and research independently while staying aligned on priorities. This includes shared release planning, clearer and optimised agile ceremonies, and a coordinated research, delivery and release plan.</p>
<h2 id="next-steps" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#next-steps"><span>Next steps</span></a></h2>
<p class="govuk-body">We’re still in alpha, and much of our work continues to focus on the benefits model and understanding the problem space. Testing the 0.1 MVP is already giving us insight into what LPAs need, and will help us prioritise the backlog and identify the core functionality for a 1.0 release next year.</p>
<p class="govuk-body">Our focus next week is preparing the 0.2 MVP release and deepening our experiments with the new models. We’ll also continue research sessions with LPAs to validate what’s working in the 0.1 MVP, what isn’t, and what this means for our priorities as we build towards 1.0.</p>

      ]]>
    </content>
  </entry>
  <entry>
    <title>Early insights from users</title>
    <link href="https://digital-land.github.io/extract-alpha/weeknotes/early-insights-from-users/"/>
    <updated>2025-11-28T00:00:00Z</updated>
    <id>https://digital-land.github.io/extract-alpha/weeknotes/early-insights-from-users/</id><author>
  <name>Steve Messer</name>
</author>
<content xml:lang="en" type="html">
      <![CDATA[
<p class="govuk-body">Through Monday’s toil to Friday’s gleaming, here’s the sixth weeknote from the team.</p>
<h2 id="what-we-ve-heard-from-users-so-far" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#what-we-ve-heard-from-users-so-far"><span>What we’ve heard from users so far</span></a></h2>
<p class="govuk-body">You’ll remember from <a href="/extract-alpha/weeknotes/benchmarking-begins/" class="govuk-link">last week</a> that we started the first benchmarking session, and early this week it finished.</p>
<p class="govuk-body">In this first round of user research, we tested an interactive prototype of Extract with five users across four LPAs. This ‘<a href="https://www.svpg.com/minimum-viable-product/#:~:text=the%20point%20of%20these%20MVP%20Tests%2C%20which%20is%20to%20get%20to%20this%20notion%20of%20Product%20Market%20Fit.%20%C2%A0That%E2%80%99s%20when%20we%20actually%20have%20a%20viable%20product." class="govuk-link">MVP Test</a>’ allowed us to observe real users interacting with the tool in their actual context of use.</p>
<p class="govuk-body">Early analysis indicates that the core concept is well understood and shows strong potential to deliver effective, highly valued solutions.</p>
<p class="govuk-body">One promising insight is that users are already thinking about what the prototype might enable after an extraction has been reviewed. Even when an extraction was less successful, users were interested in being able to easily transfer and reuse parts of the data within their own systems and tools.</p>
<p class="govuk-body">As one participant explained:</p>
<blockquote data-govspeak="blockquote">
<p class="govuk-body">This would definitely speed up the process, but the question is: how transferable is the data is to back-office systems?</p>
</blockquote>
<p class="govuk-body">Seamless integration and reuse of extracted data within users’ existing systems in an important part of the proposition, so we’ll need to dedicate some research to help us achieve it.</p>
<p class="govuk-body">Insights from this study are helping us:</p>
<ul class="govuk-list govuk-list--bullet">
<li>deepen our understanding of the problem space</li>
<li>evaluate the technology from a user perspective, and</li>
<li>inform user-centred design and engineering decisions.</li>
</ul>
<p class="govuk-body">As we iterate the prototypes, they will continue to serve as research tools to probe broader, less-defined factors that influence implementation, governance, and trust, ensuring the solution aligns with <a href="https://paulgraham.com/users.html#:~:text=how%20well%20you%20understand%20them%2C%20and%20even%20how%20much%20they%20need%20what%20you%27re%20making" class="govuk-link">users’ expectations</a> and integrates seamlessly within their wider ecosystems.</p>
<p class="govuk-body">Early next week we’ll firm up the plan for the second round of research.</p>
<h2 id="merging-ideas" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#merging-ideas"><span>Merging ideas</span></a></h2>
<p class="govuk-body">Fabia, Hannah, Kev and Aydin spent time interrogating and critiquing multiple user flows, looking at the many options we have for features and the whole user journey. This was to help with merging the diverse ideas from both teams into one cohesive flow.</p>
<p class="govuk-body">The designers and frontend developers also spoke about the constraints that shape what we design and build. For example, there is not always a one-to-one relationship between an area and the planning documents that describe it. A user may need to upload multiple files (a map, a design statement, a legal document) to extract all the information about one place. How might we help users combine documents or combine extractions for one entity?</p>
<p class="govuk-body">The conversation also moved on to how we’ll merge multiple strands of work in the codebase. Working from one consistent codebase, using and iterating the shared components, will help us move faster – essential for rapid prototyping, and helpful for the future too.</p>
<h2 id="lunch-together" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#lunch-together"><span>Lunch together</span></a></h2>
<p class="govuk-body">One highlight of working together in Marsham Street on Wednesdays is lunchtime. For one hour, we step away from the work and get a chance to chat, to catch up – and, most importantly, devour a plate of pie, chips and gravy!</p>
<p class="govuk-body">Making these social bonds isn’t just fun, it also helps the work <em>work</em>. So that we don’t only have a shared commitment to the goal but also to each other.</p>
<p class="govuk-body">After lunch, Gordon kindly shared a gift he’d bought for each member of the team – an Extract <a href="https://www.instagram.com/p/C06VTlKOXyY/?utm_source=ig_web_copy_link&amp;igsh=MzRlODBiNWFlZA==" class="govuk-link">mission patch</a>!</p>
<figure><img src="extract-sticker-on-laptop.jpg" alt="A laptop is covered in vibrant stickers – including the shiny Extract mission patch in pink."></figure>
<h2 id="planning-for-version-0-2" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#planning-for-version-0-2"><span>Planning for version 0.2</span></a></h2>
<p class="govuk-body">Off the back of the research sessions, we started planning version 0.2 – what would be in it, and how it would work. Before we make a start on it though, we need to consolidate the codebase and plan who’s working on what. That’ll happen early next week.</p>
<p class="govuk-body">We’re going to trial something new, pairing a developer with a designer so they can work in unison. The goal is to eliminate the hand-off between design and development, instead figuring out the constraints and possible options together. We want to encourage creativity too, and coming up with lots of options. As Linus Pauling said:</p>
<blockquote data-govspeak="blockquote">
<p class="govuk-body">If you want to have good ideas you must have many ideas. Most of them will be wrong, and what you have to learn is which ones to throw away.</p>
</blockquote>
<p class="govuk-body"><a href="https://interconnected.org/home/2023/11/24/digestion" class="govuk-link">Get the obvious ideas out of the way and together we’ll come up with the good ones</a>.</p>
<h2 id="continued-experimentation-with-the-latest-ai-models" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#continued-experimentation-with-the-latest-ai-models"><span>Continued experimentation with the latest AI models</span></a></h2>
<p class="govuk-body">Gavin, Jordy and Rich have carried on experimenting with the latest versions of Gemini and Segment Anything. The new technology is performing well on the hardest maps we’ve tested.</p>
<figure><img src="better-output-from-gemini-3.png" alt="A 1965 map of Baston, Lincolnshire, showing a tree preservation order with marked trees highlighted in red on a detailed survey plan."></figure>

      ]]>
    </content>
  </entry>
  <entry>
    <title>Making improvements for users</title>
    <link href="https://digital-land.github.io/extract-alpha/weeknotes/making-improvements-for-users/"/>
    <updated>2025-12-05T00:00:00Z</updated>
    <id>https://digital-land.github.io/extract-alpha/weeknotes/making-improvements-for-users/</id><author>
  <name>Steve Messer</name>
</author>
<content xml:lang="en" type="html">
      <![CDATA[
<p class="govuk-body">It’s the seventh week of the alpha phase of Extract. We’ve been working on two timelines this week: what we’re doing over the next two weeks, and what we may be doing in the next couple of months. Acting on the now, and thinking ahead.</p>
<h2 id="making-improvements" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#making-improvements"><span>Making improvements</span></a></h2>
<p class="govuk-body">The first research session generated a series of insights which we’ve put into 3 categories:</p>
<ul class="govuk-list govuk-list--bullet">
<li>Concept: the overarching idea &amp; value proposition of the solution.</li>
<li>Context: the environment and workflows in which the solution operates.</li>
<li>Usability: the practical experience of interacting with the solution.</li>
</ul>
<p class="govuk-body">The insights we’ve categorised as conceptual and contextual can impact the success of Extract as a product as much as the usability insights, but we’re better able to act on and improve the usability issues. So we’ve been working on making those improvements before addressing the conceptual and contextual insights that were raised.</p>
<h3 id="human-computer-dynamics" tabindex="-1" class="govuk-heading-m"><a class="app-link--heading govuk-link" href="#human-computer-dynamics"><span>Human-computer dynamics</span></a></h3>
<p class="govuk-body">Ethan Mollick shared the concept of ‘centaurs’ in his book <em>Co-Intelligence</em>. Working like a centaur means giving AI tasks you hate but can easily check, seeing whether it speeds things up or allows you to focus on other, more meaningful tasks.</p>
<p class="govuk-body">Compared to humans, AI is not yet good at many things but does produce good outputs in a lot of other areas. Ethan Mollick called this ‘the jagged frontier’, and it’s why taking the centaur approach is useful. If AI cannot produce perfect outputs but can create something good enough – something you can improve and work with – the outputs from AI can still be useful. Indeed, that’s what users indicated to us in <a href="https://digital-land.github.io/extract-alpha/weeknotes/early-insights-from-users/#:~:text=Even%20when%20an%20extraction%20was%20less%20successful%2C%20users%20were%20interested%20in%20being%20able%20to%20easily%20transfer%20and%20reuse%20parts%20of%20the%20data%20within%20their%20own%20systems%20and%20tools." class="govuk-link">the first research session</a>.</p>
<p class="govuk-body">It’s clear that humans being able to smooth out the rough edges where AI isn’t good enough may be a factor in real-world AI adoption for some time. Helen Toner, the interim executive director at Georgetown’s Center for Security and Emerging Technology, <a href="https://helentoner.substack.com/p/taking-jaggedness-seriously" class="govuk-link">wrote about that recently</a>, saying:</p>
<blockquote data-govspeak="blockquote">
<p class="govuk-body">We should be thinking more about human-computer interaction dynamics, human factors, user experience, user interface. What does the trust between those parties look like? How do you maximize the benefit that you’re able to get from those AI systems by designing that human-AI system well? As opposed to just skipping to the end and assuming, well, at some point the AI is going to automate all of it, so who cares about the human-computer part.</p>
</blockquote>
<p class="govuk-body">There’s lots we can learn in this space – human-computer interaction – that can be useful. Addressing the usability issues first can also help us draw out more of the conceptual and contextual issues (which require more evidence to prove) in further studies, so it feels sensible to look at these first.</p>
<h3 id="usability-issues" tabindex="-1" class="govuk-heading-m"><a class="app-link--heading govuk-link" href="#usability-issues"><span>Usability issues</span></a></h3>
<p class="govuk-body">Our first prototype was stripped back, a basic flow and set of tools allowing users to upload a document, extract the data, and approve or reject the extraction. We didn’t include any editing tools as we wanted to understand whether it was good enough as it was. We expected to be proven wrong, and that ended up being true.</p>
<p class="govuk-body">Because the prototype was so minimal, a few issues came up. However, except the lack of editing tools, these issues weren’t major. On the whole, users understood what to do, completed the task in under 10 minutes, and a couple of participants even downloaded the data to refine further elsewhere. Not bad.</p>
<p class="govuk-body">Some of the issues included:</p>
<ul class="govuk-list govuk-list--bullet">
<li>extraction time duration and signposting</li>
<li>only being able to upload PDFs</li>
<li>not realising you could drag &amp; drop files (but being used to doing that)</li>
<li>how we designed the review stage, where users compare the original document to the extracted data</li>
<li>matching map data with related text data</li>
</ul>
<p class="govuk-body">We spent the majority of the week addressing these through interface changes, a complete re-design of the user journey, and better content design (clearer labelling, improved instructions). We believe these should make it easier for users to use Extract, and the addition of editing tools may influence more users to approve and export the outputted data.</p>
<p class="govuk-body">We’re running another round of research and testing next week, and we can’t wait!</p>
<h3 id="the-editing-tool" tabindex="-1" class="govuk-heading-m"><a class="app-link--heading govuk-link" href="#the-editing-tool"><span>The editing tool</span></a></h3>
<p class="govuk-body">In a previous weeknote, we shared how we’d <a href="https://digital-land.github.io/extract-alpha/weeknotes/first-research-session-booked/#:~:text=Re%2Dusing%20HM%20Land%20Registry%E2%80%99s%20map%20component" class="govuk-link">re-used HM Land Registry’s map component</a> as the basis for our own. We’ve been building on top of that, adding more features for the upcoming testing.</p>
<p class="govuk-body">Firstly we’ve developed new shape creation tools, allowing users to snap lines and points to the map for greater accuracy. Some users mentioned that they draw lines on the side of the road, some mentioned they draw lines in the middle of the road, and our tools should support both use cases.</p>
<p class="govuk-body">We’ve also added the option to toggle satellite view, which is especially helpful in viewing whether a tree is placed correctly.</p>
<figure><img src="new-map-tools.gif" alt="A map component with tools to toggle satellite view, plus change, move, rotate, cut, add and delete shapes."></figure>
<p class="govuk-body">We’ve also been working on the accessibility features, making it possible to navigate the map, move shapes and edit shapes using a keyboard. When you get feedback from people with disabilities early in the design and build of a product or service, problems become easier to fix, and you come up with solutions that benefit everyone. We wanted to get this in now rather than later, when it’s more expensive to address.</p>
<figure><img src="map-accessibility.gif" alt="A map component with keyboard navigation tools for accessibility."></figure>
<h2 id="gemini-3" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#gemini-3"><span>Gemini 3</span></a></h2>
<p class="govuk-body">Google’s newest release, Gemini 3 Pro, is now integrated into Extract. Gemini 3 is better at deciphering and pulling information from legacy records, is more reliable, and excels at vision tasks.</p>
<p class="govuk-body">We tested a single-shot prompt from Gemini 3 Pro against our current multi-stage tree preservation order (TPO) extractor (Gemini 2.5 Flash) and noticed significant improvements.</p>
<ul class="govuk-list govuk-list--bullet">
<li><strong>Speed</strong>: Single-shot Gemini 3 Pro is comparable in time to our previous method.</li>
<li><strong>Reliability</strong>: It is highly reliable, returning successfully on every one of the hundreds of examples tested.</li>
<li><strong>Accuracy</strong>: Extraction of trees from TPO documents is much more accurate. The F1 score (a measure of the overall quality of a predictor) is consistently higher. Crucially, Gemini 3 is far more likely to get the tree’s location exactly right (within one marker’s width).</li>
</ul>
<h2 id="extending-the-alpha" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#extending-the-alpha"><span>Extending the alpha</span></a></h2>
<p class="govuk-body">We’ve decided to extend the alpha period into next year. Originally we set ourselves a tight timebox, trying to learn as much as possible inside 8 weeks. Though we held one research session and have another coming up next week, the insights we’re getting back indicate that we’ll need time to learn more.</p>
<p class="govuk-body">It’s not surprising. Artificial intelligence is a new technology, there aren’t well established patterns for interacting with it yet, and the planning domain requires precision. Quality and trust are major factors in doing this well.</p>
<p class="govuk-body">Spending more time with users, collecting more insights, will massively improve our ability to mitigate risks and <a href="https://www.atlassian.com/blog/jira-product-discovery/how-to-build-the-right-thing-and-prevent-building-the-wrong-ones" class="govuk-link">build the right thing</a>. We’re extending the alpha phase by another 4 weeks, covering January 2026.</p>
<h2 id="thinking-ahead-to-the-beta" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#thinking-ahead-to-the-beta"><span>Thinking ahead to the beta</span></a></h2>
<p class="govuk-body">We’ve also been thinking ahead to the <a href="https://www.gov.uk/service-manual/agile-delivery/how-the-beta-phase-works" class="govuk-link">beta phase</a>, where we take our best idea from alpha and start building it for real.</p>
<p class="govuk-body">We want to make sure that local planning authorities are available for research and testing, to be part of the private beta, hence why we’re thinking about this now. We can’t build and improve Extract without their input, and they rightfully need time to plan how they’d get involved.</p>
<p class="govuk-body">We’ll share firmer plans in the next couple of weeks, but if you’re a local planning authority with thousands of planning documents you need to convert into data (tree preservation orders, Article 4 directions and conservation areas), <a href="https://www.planning.data.gov.uk/about/contact" class="govuk-link">get in touch</a>.</p>
<h2 id="and-finally" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#and-finally"><span>And finally…</span></a></h2>
<p class="govuk-body">Goodbye to Jenny, one of our product managers, who’s leaving us today. Jenny’s done a lot on Extract over this year, and we’re really grateful for all her input.</p>
<p class="govuk-body">Bye Jenny!</p>

      ]]>
    </content>
  </entry>
  <entry>
    <title>Second round of research</title>
    <link href="https://digital-land.github.io/extract-alpha/weeknotes/second-round-of-research/"/>
    <updated>2025-12-12T00:00:00Z</updated>
    <id>https://digital-land.github.io/extract-alpha/weeknotes/second-round-of-research/</id><author>
  <name>Steve Messer</name>
</author>
<content xml:lang="en" type="html">
      <![CDATA[
<p class="govuk-body">This is going to be a short weeknote, so that we’ve got more time for analysing the evidence that came out of the research, turning them into robust insights.</p>
<h2 id="design-rationale" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#design-rationale"><span>Design rationale</span></a></h2>
<p class="govuk-body">Earlier in the week, Fabia added more notes on the design of our first two prototypes to <a href="/extract-alpha/design-history/Prototyping%20alpha/" class="govuk-link">our design history</a>. It explains why we stripped away features we figured we’d need, talks about the Frankenstein phase of combining two frontends using collaborative ‘mix-and-match’ prototyping, and talks about how what we learned from users fed into later iterations of the design.</p>
<p class="govuk-body"><a href="/extract-alpha/design-history/Prototyping%20alpha/" class="govuk-link">Check it out</a>!</p>
<h2 id="rapid-pre-release-testing" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#rapid-pre-release-testing"><span>Rapid pre-release testing</span></a></h2>
<p class="govuk-body">Before we get into the research that happened this week, it’s worth calling out the excellent effort the team put in for testing the second prototype internally, before we released it to production for testing with users.</p>
<p class="govuk-body">After deploying the second prototype to staging, everyone started running through the prototype, looking out for inconsistencies, bugs and broken things.</p>
<p class="govuk-body">A couple of show-stoppers popped up. Instead of sorting these out in isolation, Hannah and Kev treated it like an incident and hopped on a call, debugging the problems and figuring out a way forward. Gavin joined later to help out too, and incredibly we got it all sorted in a couple of hours.</p>
<p class="govuk-body">The prototype needed to be ready on production by 3pm. It was online and sparkling clean by 2.45pm. Top work from all involved!</p>
<h2 id="second-round-of-research" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#second-round-of-research"><span>Second round of research</span></a></h2>
<p class="govuk-body">Rob ran the second round of research, speaking with five people from four local planning authorities. Most of the users were GIS officers, although one person worked in planning policy, which surfaced lots of interesting points.</p>
<p class="govuk-body">We’re not making any conclusions just yet nor sharing any early insights. We’ve got another session on Monday, meaning we’ll have tested with six people from five local planning authorities  come lunchtime next Monday.</p>
<p class="govuk-body">After that, we’ll be analysing the evidence, forming insights, and ready to end the year on a high. We’ll come back in the new year to continue the alpha.</p>
<h2 id="adapting-for-the-unexpected" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#adapting-for-the-unexpected"><span>Adapting for the unexpected</span></a></h2>
<p class="govuk-body">Researching, designing and developing products and services is not a linear process. Even though we describe it as a linear process with a start and an end, it’s <a href="https://www.producttalk.org/continuous-discovery-across-the-organization/#:~:text=While%20I%20described%20this%20process%20linearly%2C%20it%E2%80%99s%20not%20a%20linear%20process.%20Design%20is%20a%20messy%2C%20iterative%20process%20that%20often%20loops%20back%20on%20itself.%20When%20we%20learn%20that%20a%20particular%20solution%20doesn%E2%80%99t%20work%2C%20we%20also%20learn%20something%20new%20about%20the%20opportunity%20space." class="govuk-link">actually more like a loop</a>.</p>
<p class="govuk-body">Right now, we’re exploring solutions to a problem. Problems are seldom well-defined, coherent, and tractable. So when we’re testing a prototype, what we hear from users often tells us things about the problem we didn’t previously understand or know about. We’re continually discovering new insights we didn’t expect, testing and learning and testing and learning.</p>
<p class="govuk-body">Test and learn relies on a culture of learning, expecting the unexpected, and <a href="https://public.digital/pd-insights/blog/2024/12/just-what-is-test-and-learn#:~:text=a%20willingness%20to%20accept%20new%20information%20and%20change%20the%20plan%20in%20response" class="govuk-link">a willingness to accept new information and change the plan in response</a>. And new information is coming up suggesting we’ll need to adapt the plan a little bit.</p>
<p class="govuk-body">Early next year we’ll need to do some discovery-type research to better understand users’ needs, tasks, motivations and goals. There’s high variation in how they produce and maintain planning data (which doesn’t always include working with documents), and mapping this landscape a little more will teach us about where we need to focus.</p>
<p class="govuk-body">It may even suggest some other forms of prototype we hadn’t considered before, and it’ll definitely help with planning a private beta too.</p>

      ]]>
    </content>
  </entry>
  <entry>
    <title>Meetings and partings</title>
    <link href="https://digital-land.github.io/extract-alpha/weeknotes/meetings-and-partings/"/>
    <updated>2025-12-23T00:00:00Z</updated>
    <id>https://digital-land.github.io/extract-alpha/weeknotes/meetings-and-partings/</id><author>
  <name>Steve Messer</name>
</author>
<content xml:lang="en" type="html">
      <![CDATA[
<p class="govuk-body">Another short weeknote. This week was all about analysing the evidence that came out of research, looking at the findings, forming insights, tying up loose ends, and putting the project on ice for over the festive period. The team has earned a break to say the least.</p>
<p class="govuk-body">As previously mentioned, we’re going to continue the alpha phase in January to gather more evidence. We’ve tested two prototypes with eight users from seven local authorities but don’t feel like we’ve got conclusive evidence just yet. Collecting more will help us learn whether we can effectively meet users’ needs.</p>
<p class="govuk-body">So, what were the key findings and insights?</p>
<h2 id="what-we-learned" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#what-we-learned"><span>What we learned</span></a></h2>
<h3 id="georeferencing-is-the-foundation" tabindex="-1" class="govuk-heading-m"><a class="app-link--heading govuk-link" href="#georeferencing-is-the-foundation"><span>Georeferencing is the foundation</span></a></h3>
<p class="govuk-body">Overwhelmingly, users found Extract’s ability to look at old maps and locate the same place on a current map hugely helpful. This is known as georeferencing and is the most important part of creating geospatial data from documents. It’s also the hardest and most time-consuming job when working with historical maps. If you’re looking at an old map and a piece of land is developed or a track is turned into a junction, it can be hard to work out where it is on a newer map.</p>
<p class="govuk-body">Some quotes from users involved in testing show you how foundational a job it is for turning old maps into data.</p>
<blockquote data-govspeak="blockquote">
<p class="govuk-body">The first stage is to get it right with the georeferencing. That is the foundation. So if you get it wrong with that, there’s a 95% chances of you getting it completely wrong when it comes to the spatial position and accuracy of the feature itself.</p>
</blockquote>
<blockquote data-govspeak="blockquote">
<p class="govuk-body">Georeferencing and digitising is complex work, so this can help people.</p>
</blockquote>
<blockquote data-govspeak="blockquote">
<p class="govuk-body">Auto-georeferencing is good because that’s a cumbersome process.</p>
</blockquote>
<p class="govuk-body">Some users mentioned wanting to get this feature into their own GIS tools, so we’re looking into GeoTIFFs – a way of embedding geographic metadata into images. Our understanding is that this may let users draw shapes in their own tools, so it’s an alternative solution we’ll start prototyping in the new year.</p>
<h3 id="bringing-colleagues-along" tabindex="-1" class="govuk-heading-m"><a class="app-link--heading govuk-link" href="#bringing-colleagues-along"><span>Bringing colleagues along</span></a></h3>
<p class="govuk-body">Despite that, we also heard that from users that they’d like to invite others to work on digitising documents with them. Geospatial information system (GIS) officers are in high demand at local authorities and often work across many service areas. As a result, they’re stretched across many projects and may not always have the time to spend on lengthy projects – like digitising historical planning information, for example.</p>
<p class="govuk-body">Users suggested that Extract might help create efficiencies at local authorities by allowing GIS officers to pass some digitisation work to colleagues without GIS skills, allowing the GIS officer to ‘sign off’ anything produced by their colleagues.</p>
<h3 id="accuracy-of-shapes-is-very-important" tabindex="-1" class="govuk-heading-m"><a class="app-link--heading govuk-link" href="#accuracy-of-shapes-is-very-important"><span>Accuracy of shapes is very important</span></a></h3>
<p class="govuk-body">Users also said that the accuracy of the shapes outputted was paramount. Few users found the shapes drawn by the AI agentic workflow to be satisfactory enough to adopt, and many users redrew the shapes themselves. There’s an opportunity to look into whether we can encourage the model to produce more ‘regular’ shapes or with fewer points, to increase accuracy and user satisfaction.</p>
<p class="govuk-body">That’s just a couple of insights from a wealth of findings, but it’s enough for us to pause and consider the next steps carefully.</p>
<h2 id="challenging-preconceived-assumptions" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#challenging-preconceived-assumptions"><span>Challenging preconceived assumptions</span></a></h2>
<p class="govuk-body">Stepping back from it all, it’d be sensible to <a href="https://www.designreview.byu.edu/collections/design-thinking-part-4-framing-and-reframing-design-problems" class="govuk-link">reframe the problem</a> and truly understand its scale. From what we’ve heard from users, not all local planning authorities need to convert documents in order to prepare or provide data to national standards. Some times they already have data but it’s patchy, and some times it’s just hard to find the time to transform it from one schema to another.</p>
<p class="govuk-body">These details affect not only how we design and build Extract, but also how we launch it and onboard users. Ultimately it’s about making sure we <a href="https://www.gov.uk/service-manual/user-research/start-by-learning-user-needs" class="govuk-link">learn about users and their needs</a> so that we can build the right thing.</p>
<h2 id="that-s-a-wrap-for-now" tabindex="-1" class="govuk-heading-l"><a class="app-link--heading govuk-link" href="#that-s-a-wrap-for-now"><span>That’s a wrap…for now</span></a></h2>
<p class="govuk-body">It’s been a hugely exciting year though. This time last year, extracting data from documents using AI was just an idea, but we’ve developed a novel method to do it with multimodal large language models.</p>
<p class="govuk-body">As mentioned earlier, the team have properly earned their break over the next couple of weeks, and we’re really looking forward to starting up again in January.</p>
<p class="govuk-body">Anyway, now onto my speech, my Christmas speech: <a href="https://www.youtube.com/watch?v=paSUlf4rCpA" class="govuk-link">Thank you, all, and Merry Christmas</a>.</p>

      ]]>
    </content>
  </entry>
</feed>
