road, is it ANY different from M1? Yes it is
Map can help to understand, what is going on.
24/7 works must have special treatment.
It seems, that coordinate system, used in file is called "projected coordinates".
How to group data? By date probably. What are the time bounds? What is the most broken road?
Pandas by default can't read the xml, python std can convert it into dictionary. Data seems to have fixed fields, and can be represented as table.
Now I create Pandas DataFrame I use this instrument due to it's powerfull data manipulation and sorting analysis. I can always go back to std python data structures.
Data file "./he_roadworks_2016_02_29" contains 2196 data entries with unique ref numbers.
Now some refactoring.
I assume it is 2016 and it is around March, since the "newest" raw data file is dated of March 2016.
Here I determine the day in 2016 with the highest ammount of roadwork situations. I assume, in this context this information may be interesting.
Here I determine two things: the city in UK with the most ammount of roadworks in 2016. And second, the city with the highest amount of roadworks per day. This information may be important, since such place is better to be avoid by the drivers, who may choose another mean of transport during either 2016 or a specific date.
Data contains, besides all the datetime and string information, also coordinates, I can add this data to make the location of the roadworks clear.