Login  Register

Re: Hansard data extraction

Posted by Cory Horner on Apr 01, 2009; 4:32am
URL: http://civicaccess.416.s1.nabble.com/Hansard-data-extraction-tp1590p1592.html

On 29-Mar-09, at 4:47 PM, Daniel Haran wrote:

> The Hansard (minutes for Canada's parliament) are available in 2  
> formats:
>
> http://www2.parl.gc.ca/HousePublications/Publication.aspx?Language=E&Mode=1&Parl=40&Ses=2&DocId=3781663
> http://www2.parl.gc.ca/HousePublications/Publication.aspx?Language=E&Mode=2&Parl=40&Ses=2&DocId=3781663
>
> In those two addresses, only "Mode=1" changes to "Mode=2"
>
> It's clear from the 2nd mode ("Text Mode") that the information is
> stored in a machine-readable format (XML).
>
> Has anyone asked for Hansard to be made available in that format?

Yes, from October 2006:

"At this time, the House of Commons doesn't provide the public with  
access to its XML files.  A working group is being organized to  
discuss this issue.  If there are any changes or new information, I  
will try to let you know."

I never heard from them again, and my follow-up messages went  
unanswered.

It is probably a good time for another attempt, although I have a half-
done API that could provide the same service.

Cheers,
Cory.