-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Spain] Add GTFS feeds from NAP #118
Comments
I am having a quick look at this now. The NAP page shows 114 data sources and the |
I think they might not have existed when the list was put together. If you can find a machine readable list from the NAP, it would be great to have a script to generate feeds/es.json, like we have for France and Austria. |
There is an API, which documentation is here: https://nap.transportes.gob.es/Account/InstruccionesAPI |
Ouigo ES is currently in es.json but seems not working 🤔 Should I open a separate issue? |
"Ouigo ES is currently in es.json but seems not working 🤔 Should I open a separate issue?" " it would be great to have a script to generate feeds/es.json, like we have for France and Austria." |
I don't know what is the exact issue, but no train between Madrid and Barcelona is shown... |
Is it possible that URLs change over time, for example we should have AECFA Slots, but nothing appears in Transitous : https://nap.transportes.gob.es/Files/Detail/920 ? |
It would probably be good to be able to generate the list from the NAP automatically to catch such cases, like we already do for France. |
The file url used by Transitous seems to be fine, and the data correct too. Any idea on why the flights do not show up on the map and routing? |
I can't check it on the phone right now, but the first step is too see whether the post-processed feed (https://routing.spline.de/gtfs/es_AECFA-slots.gtfs.zip) looks reasonable, and if not, what the import log on the CI says. |
The stop time syntax is invalid in the feed, which causes no data to actually be imported. |
The CI/gtfsclean didn't complain about it? Should we add a fix to accept/convert from HH:MM? |
Currently all the entries in es.json have "fix": true, because manually looking through them was too much work for that number of feeds. I think it would be fine to extend gtfsclean to handle this, but I'd try to contact the feed producer first, since its much easier on their side and every consumer of the feed will need this fix. |
The time parsing only reads HH:MM anyway in MOTIS, so there should be no problem with this syntax. But probably quoted time fields won't work (haven't tried it) - that's something we could improve if necessary. |
In this case it's failing earlier on our side (gtfsclean), which we still need to filter other invalid stuff (wrong coordinates, too fast trips etc.) |
I think it is failing even earlier in https://github.com/public-transport/gtfsparser : all stop are badly formatted in HH:MM and therefor directly added to |
|
Currently it still goes through gtfsclean anyway. We could add an option to skip that but then we loose filtering of fast trips for that feed which could break things. I think making the |
Did this now, the next problem is
This feed must have seen zero testing or validation, I think it might be a case for an external data cleanup project. |
Wow this feed is amazingly bad 🤣 It would need a dedicated cleanup script at this point :/ |
This error is new, I have a version from january having the correct |
So after some more fiddling around, it is salvageable, but only with custom search-and-replace action and gtfsparser hacks. In case someone wants to clean it up, I have good experiences with the free CI on gitlab.com. It allows to get a stable link to the latest artifact, so you can set up a cron job to do a number of sed commands there without much work, and add the output link here. Obviously anything else that allows to get a stable link works as well. |
Maybe before putting in the effort to write a custom script, it would make sense to at least try to get in touch with the feed authors with a wish list which bugs would be most important to fix to have a usable feed. Maybe they are just not aware that their feed is effectively useless in the state they publish it. |
I found it quite useful that the official validator produces a static report URL that I can put into the email with the note that this validator is the official one by the standardization organization itself, so it has some "authority". |
The Spanish NAP (National Access Point) is available here https://nap.mitma.es/. It has feeds from operator companies and regional governments mixed, so some preprocessing to avoid duplicates may be needed. The NAP only has the feeds from the companies and governments who want to release them, having therefore gaps in the coverage.
The text was updated successfully, but these errors were encountered: