Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data Accuracy Issues #3

Open
joelkarunungan opened this issue Sep 1, 2018 · 2 comments
Open

Data Accuracy Issues #3

joelkarunungan opened this issue Sep 1, 2018 · 2 comments

Comments

@joelkarunungan
Copy link

For those downloading the data, who may think this is accurate since it comes from government sources, set your expectations lower and be aware of the following issues:

  1. The postal code information in buildings.json is full of spam locations, especially banks. ATM machines are not buildings, nor are their temporary sales sites. How did this information even end up there?
  2. Data values are 99% in all caps. This is problematic since the actual building spelling nuances are not respected. iSuite vs ISUITE, MacDonald vs MACDONALD, etc.
  3. A lot of typographical errors.
  4. A lot of incomplete building names. A lot of the official building names are not provided.

Overall, very low quality considering its supposed to be scrubbed and well-maintained government information.

Postal codes should be cross referenced with the actual building names from another cleaner source, maybe URA SPACE?

@xkjyeah thanks for this, it is very helpful. Just want others to be aware of the issues.

@xkjyeah
Copy link
Owner

xkjyeah commented Sep 1, 2018 via email

@joelkarunungan
Copy link
Author

joelkarunungan commented Sep 1, 2018

Will do.

Besides the coordinates, another useful part is the BLK_NO and ROAD_NAME parameters. It appears that the values in these can be trusted. Say, in autofill for address fields, when POSTAL is provided by the user, the underlying system can use the OneMap data to autofill the BLK_NO and ROAD_NAME parameters.

The BUILDING parameter definitely can't be trusted and will have to be filled-in by the user.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants