Skip to content

Distinguish facility IDs and building ones and start index both  #5

@Abbe98

Description

@Abbe98

Probably happening late December(required for Kyrkosok/web-client#23).

  • Figure out the exact value of the BBR ID change break point , see Template:BBR-länk for estimated values then do a loop with HTTP requests to get the exact one. Once done update the template too.

  • Run the kulturarvsdata-prefer-rdf.py bot.

  • Check for duplicate statements(should be none or very few), have seen something for this task over att Tool Labs.

  • Start indexing the WLM lists on sv.wikipedia.org to a CSV or SQLite file(index only WP articles and BBR URIs?)

  • check this list against existing data in Wikidata. Look for conflicts and data which exists only in Wikidata(which should not be the case).

  • fix any data that needs fixing

  • add Wikipedia articles for all the WLM BBR items missing one(if Geonames can be a source for bot created articles anything can be a source).

  • Index a new CSV or SQLite file from the WLM tables.

  • Import all the missing data to Wikidata.

  • start indexing both facility and building IDs(breaks the API). Use the "BBR ID change break point" if it's a fuzzy one create a buffer were all IDs gets verified using HTTP requests(the way all currently are validated).

  • Add all the Wikidata IDs to the WLM lists on sv.wikipedia.org and notify the folks over at Phabricator. Research on how to parse and process wikitext tabels <-- new to me

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions