To download the data set, please sign up or log in

Statistical Area and Regional Community Data Collection 2021

The dataset collection is a compilation of related data tables, sourced from the website of 'Tilastokeskus' (Statistics Finland) in Finland. The tables within this collection provide comprehensive statistical data. The dataset description provided by the source, which can be translated as 'Statistics Finland's service interface (WFS)', hints at the broad and extensive nature of the data included. The dataset collection is expected to be rich with diverse statistics relevant for various analytical and research purposes. This dataset is licensed under CC BY 4.0 (Creative Commons Attribution 4.0, https://creativecommons.org/licenses/by/4.0/deed.fi).

Tables

  • Regions 2021 (Scale 1:1 000 000) - Table from 2021 Statistical Dataset CollectionTSV

    The table is a structured gathering of related data within the dataset collection titled 'Regions 2021 (1:1 000 000)', sourced from the 'Statistics Finland' website. Its contents comprise a variety of geographical and statistical data focusing on different regions in Finland, with some columns even providing the names of these regions in different languages. The table includes a unique combination of an extraction date and a row number for each entry, which serves to uniquely identify each row. For instance, data extracted on a specific date will have a unique row number associated with it, providing an organized system of data identification. Additionally, an array of geographical data is available, presented in the WGS 84 coordinate reference system, with the axis order being...

  • Version History of Regional Units 2021 (1:1 000 000)TSV

    The 'table__history' is a part of the 'tilastointialueet_seutukunta1000k_2021' dataset collection and serves as a history table, providing a version history of its base table rows. This table contains data on regional areas for the year 2021 at a scale of 1:1 000 000, sourced from the Statistics Finland (Tilastokeskus) website. The data content includes geographic information, unique identifiers, and names in different languages. The geographic information, indicated by columns starting with 'geom_', is presented using the WGS 84 coordinate reference system, with axis order longitude first, followed by latitude. These geographic details can be used in geospatial data analytics to study regional patterns or changes over time. The table also includes unique identifiers, such as row number...

Column Descriptions

Regions 2021 (Scale 1:1 000 000) - Table from 2021 Statistical Dataset Collection

Column Type Comment
_extract_date date This column contains the date when the data on the respective row was extracted from the data source, serving as a timestamp for the collected data.
_row_number long This column contains the row number for the data extracted from the source on the extract date specified by the column '_extract_date'. The combination of these two columns uniquely identifies each row.
geom_geojson string This column contains the geometric data of the regions in a GeoJSON format, a common format for encoding a variety of geographic data structures.
geom_geotext string This column contains the geometric data of the regions in a text format.
geom_type string This column specifies the type of the geometric data represented in the 'geom_geojson' and 'geom_geotext' columns. It can be a polygon or a multipolygon.
geom_centroid string This column contains the coordinates of the centroid of the geometric data, the geometric center of a two-dimensional shape.
geom_center_x double This column contains the X-coordinate of the center of the geometric data.
geom_center_y double This column contains the Y-coordinate of the center of the geometric data.
gml_id string This column contains a unique identifier for each row, which is a combination of the dataset name and a unique number.
name string This column contains English names of the regions represented in the rows.
namn string This column contains the names of the regions represented in the rows in the original language of the country from where the data is sourced.
nimi string This column contains another representation of the names of the regions in the original language.
seutukunta string This column contains a unique identifier for each region.
vuosi long This column indicates the year the data was recorded.

Version History of Regional Units 2021 (1:1 000 000)

Column Type Comment
_start_date date The date when the row was extracted from the data source. This column plays a crucial role in uniquely identifying each row in this version history table along with the '_row_number' column.
_end_date date The date when a new version of the row was extracted from the data source. If this column has a null value, it indicates that the row is the most recent version.
_row_number long The number of the row in the raw data extracted from the data source. This column, in combination with the '_start_date' column, uniquely identifies each row in this version history table.
geom_geojson string Contains geographical data in GeoJSON format, which is a format for encoding a variety of geographic data structures.
geom_geotext string Contains geographical data, in the form of text, that corresponds to the GeoJSON data in the 'geom_geojson' column.
geom_type string Specifies the type of the geographical data contained in the 'geom_geojson' and 'geom_geotext' columns.
geom_centroid string Contains the central point of the geographical data represented in the 'geom_geojson' and 'geom_geotext' columns.
geom_center_x double Represents the X coordinate of the central point of the geographical data.
geom_center_y double Represents the Y coordinate of the central point of the geographical data.
gml_id string A unique identifier for each row in the table.
name string The name of the area in English.
namn string The name of the area, likely in a local language.
nimi string The name of the area, possibly in another local language.
seutukunta string A code representing a specific region within Finland.
vuosi long The year associated with the data in each row.

Additional Info

Last Updated November 12, 2024
Created November 11, 2024