Hansard data extraction

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

Hansard data extraction

Daniel Haran
Hi folks,

The Hansard (minutes for Canada's parliament) are available in 2 formats:

http://www2.parl.gc.ca/HousePublications/Publication.aspx?Language=E&Mode=1&Parl=40&Ses=2&DocId=3781663
http://www2.parl.gc.ca/HousePublications/Publication.aspx?Language=E&Mode=2&Parl=40&Ses=2&DocId=3781663

In those two addresses, only "Mode=1" changes to "Mode=2"

It's clear from the 2nd mode ("Text Mode") that the information is
stored in a machine-readable format (XML).

Has anyone asked for Hansard to be made available in that format?

Thanks,

Daniel.

Reply | Threaded
Open this post in threaded view
|

Re: Hansard data extraction

Cory Horner
On 29-Mar-09, at 4:47 PM, Daniel Haran wrote:

> The Hansard (minutes for Canada's parliament) are available in 2  
> formats:
>
> http://www2.parl.gc.ca/HousePublications/Publication.aspx?Language=E&Mode=1&Parl=40&Ses=2&DocId=3781663
> http://www2.parl.gc.ca/HousePublications/Publication.aspx?Language=E&Mode=2&Parl=40&Ses=2&DocId=3781663
>
> In those two addresses, only "Mode=1" changes to "Mode=2"
>
> It's clear from the 2nd mode ("Text Mode") that the information is
> stored in a machine-readable format (XML).
>
> Has anyone asked for Hansard to be made available in that format?

Yes, from October 2006:

"At this time, the House of Commons doesn't provide the public with  
access to its XML files.  A working group is being organized to  
discuss this issue.  If there are any changes or new information, I  
will try to let you know."

I never heard from them again, and my follow-up messages went  
unanswered.

It is probably a good time for another attempt, although I have a half-
done API that could provide the same service.

Cheers,
Cory.