Announcing the Where Does My Money Go? Assembly Kit

February 16, 2012 in Releases, Updates

Over the past few months, we’ve made a lot of progress on OpenSpending. The core of the application is now mostly stable and it is getting ever easier to load data into the platform through the web-based dataset editor. Yet, inevitably, this raises a simple question: I’ve imported my data, what next?

Thanks to our API, there can be an infinite number of answers. With the BubbleTree diagram, the Daily Bread application and the transactional spending browser, we have a few simple answers.

But as Michal Škop blogged recently, up to now it has been fairly difficult to use both these widgets and the OpenSpending API to create custom front-ends.

To make things easier, we’ve now created the Assembly Kit. The kit is in fact the source for a newly styled version of the Where Does My Money Go? site that has gone live yesterday. Contained in this is a clean set of templates that can anyone who knows basic HTML can easily use to make a lightweight, white-label budget visualization site, styled according to your own wishes.

Screen Shot 2012-02-16 at 2.35.48 PM

A set of widgets are included and can be adapted to another dataset with just a few edits. And since everything runs against the OpenSpending API, you don’t need to run your own database. Instead, you can load your data into OpenSpending.org and then customise the user facing side – for example, you can just use a generic blog or a set of static HTML files.

Our next step in March will be to make it easier for users – especially Journalists – to create custom configurations for the visualizations via a graphical interface, save specific views and share them through a simple embed code. We’ll also work to roll out the mapping support more widely and to create more custom apps on top of the API.

Our goal is to make OpenSpending the easiest way to publish and analyze a government finance dataset – with your help! So please provide us with feedback and contribute your own visualizations to the OpenSpending platform.

The Czech budget on-line: the half success story

February 15, 2012 in Working Group

This post is by Michal Škop, of KohoVolit.eu.

The half-success story of implementing OpenSpending.org and OtwarteDane.pl into BudovaniStatu.cz (‘Building of the State’, the name referes to Peroutka‘s book)

It all started almost 2 years ago. Our partner NGO NasiPolitici.cz started to think about putting the Czech public money data on the web and asked us at KohoVolit.eu if we were interested. And we said yes, we always wanted to do something ‘about money’ (we used to be a parliamentary watchdog only till then).

We found out that there is a huge amount of public financial data available on-line. Every single public organization has to fill several detailed accounting forms every year, the oldest data are from 1994 (not published, but they are there). And it is available even in xml. Can you ask for more?

Later on, we found that there were some serious catches. The Ministry of finance, which provided the data, severely limited the number of downloads from one IP. It would have taken us a couple of months just to download everything (some 60 GB of data). The Tor and mobile connection (changing IP) came in useful. The forms were in xml, but mixing raw basic data with sums with no clear distinction between them at all. Funny. They changed the system for 2010. Et cetera. We were progressing rather slowly, with no financial support at all.

Finally, help from Anticorruption Endowment came and we got funding for about two month (developer) to build a site connecting (just) the government budgets with the politicians. That was important, I could not just show the data in some nice way, I needed to do other things with the application – showing historical data, connecting to politicians.

I spent a month just fiddling with the data, trying to find a suitable a) data storage and b) application to build on.

I tried OpenSpending.org first, but I was not able to set up the data there. I tried to tweak our parliamentary API, but it was just too much work, I would not be able to finish it in time. After a few weeks, I still was not sure if I would get the results using OpenSpending.org. The guys behind OtwarteDane.pl were very helpful and so we decided to store the data with them.

I did not use OpenSpending.org’s API, but their bubbletree chart was good. I needed to catch a few bugs, but it took me just a few days to get it running more-or-less in a way I wanted (well yes, I still need to clean the code for ‘pull request’). And – importantly – it was possible to build our application(s) on it.

I think, we have hit the bubbletree’s limit on number of bubbles there. It runs rather well with data we limited it to later (about 3600 bubbles), but it takes javascript about 10 sec on my medium computer to process the full data, 24000 bubbles for 2010 year, Opera cannot handle it and IE had problems, too (try it on our development site).

And how about the ‘where does my taxes go‘ app? Well, it was rather easy from the developer’s view. I could copy the British idea, just program it in Javascript instead of the Flash. The hard part was the economics here. We could not use just the income tax as it accounts for about 10 % of all the taxes only (the VAT, the health tax, the social tax are more important). The taxes are messy. The general financial reporting is a mess, too. I have found about 15 % difference in ‘public taxes’ in different financial reports from Czech Statistical Office. So which one to use to calculate the overall taxes? But this is just one reason more why OpenSpending.org will be useful, to standardize this mess.

For the future, we will update the project once the 2011 data is available. We shall solve the problem with bubbles’ scaling. We will write analyses based on it mainly push others to do it. And I already have the Prague 2012 budget data ready to bubble…

Hakuna My Data: NBO Data Bootcamp

January 30, 2012 in Data Journalism, events

This post is by Friedrich Lindenberg, developer on OpenSpending.

“My Name is XXXX, I am a member of the Kenyan parliament for the constituency of XXXX in the 2007-2012 election cycle. During my time in parliament, I have positioned myself against taxes for MPs.

Of the Development Funds allocated to my constituency, I have spent 12mn KSH in 2010 and 8mn KSH in 2009. Since 2007, I’ve funded 201 projects, of which 72 (9mn KSH) related to Education, 56 (7.2mn KSH) related to Health and 20 (4.2mn KSH) to Infrastructure.

The largest projects I have funded include… “

Auto-generated, spending data-driven campaign speeches like this are just one of the many ideas of the Data Bootcamp that took place in Nairobi last week. Invited by the African Media Initiative and the World Bank Insititute, about 70 participants – both journalists and developers – met on Strathmore University’s campus to learn and practise both the skills and tools required for data-driven reporting.

The four-day programme combined tools training with practical work in small groups. Elena Egawhary (BBC NewsNight) gave a workshop on data analysis in Excel, Sreeram Balakrishnan (Google Fusion Tables) introduced both Refine and Fusion Tables. Team members from both the Kenya data portal and the World Bank finance site presented their respective offerings, while Gregor and myself from the OpenSpending team gave intros to web scraping and advanced map visualisation.

During group work, journalists and developers teamed up to try their newly learned skills in different domains ranging from sports (football player profiles) to education (missing toilets in schools, “The Shit Ordeal”) and the financial transparency story-telling mentioned above.

The workshop also served as a community-building event for Kenya’s young and impressive Open Data initiative. Future events, aimed at civil society organisations and polictical actors will help to further promote the re-use of government information released through the initiative.

All this is happening in a place where transparency is an essential tool to be developed: Not only is the access to information now guaranteed by the 2010 Kenyan constitution, there are also major political issues that deserve close attention from local and international watchdogs. These include not only the ongoing incursion of Kenyan troops into Somalia in an effort to fight Al-Shebab terrorist groups, but also the upcoming nationwide elections in December 2012. The elections will instate a new bicameral system of government, with many previously unknown candidates standing for office. In the previous 2007 vote, bad polling station data had quite literally led to widespread unrest and thousands of deaths across the nation.

In all, it was a fantastic to get in touch with the Kenyan participants of the workshop and to see how the organizers of the event – a brilliant team including Craig Hammer, Justin Arenstein and Jay Bhalla – are working to foster an open data community in this bustling developing nation.Given the great ideas generated during the team sessions, I’m sure this work will soon bear its first fruits.

Transparency and technology in Brazil: linking politicians to bad entrepreneurs

January 23, 2012 in Data Journalism, Spending Stories

This story by Fabiano Angélico, who formerly worked at Transparencia Brasil, is about how technology and the help of coders can be used to highlight links between politicians and corrupt entrepreneurs. It is followed by a brief “Behind the News” interview which shows some of the time costs of datawrangling and problems faced when getting the story out.

How can transparency and technology point out connections between politicians and bad entrepreneurs? Well, first of all you will need some information about the politicians and about the entrepreneurs.

In Brazil, in spite of the historical lack of transparency in governments (Brazil’s freedom of information law was sanctioned just late last year), the Electoral Court has been proactively providing information on political candidates since 2002. One piece of info is the financial donation to the candidates, containing info about who is donating to whom and how much. Although this database is released only after the elections — the info would surely be more powerful if it were released DURING the political campaigns –, one must admit this is a rich source of information.

January, 2010. Elections for President and for the Parliament, as well as for State Governors and State Parliaments, would happen in only 9 months time, in October. However, many people were already discussing them.

At that time, 2010 had just begun, I was at work, thinking of how to find rich and useful information on the candidates. Then I was reminded of the so-called “Dirty List” — this is a list regularly published by the Ministry of Labour which indicates the companies and farmers who are caught by government officials using workers in very lousy conditions, similar to slavery.

The list published in the Ministry’s website is in not-so-friendly PDF format, but it has a plus: there is not only the name of the companies or the entrepreneur/farmer, but also their registry numbers within the government. I remembered that in the Electoral Court one can also find the numbers. That was important because having the registry numbers would avoid ambiguities.

I had both lists: the donators to the previous elections (2008, 2006, 2004 and 2002) and the “Dirty” companies. But I had a problem; I did not know how to matchup the datasets. My tech knowledge allowed me to transform the PDFs into CSV, but I could no go further without help.

I then sent the datasets, in CSV format, to Transparencia Hacker, a Google Groups list which now gathers over 800 people interested in the connections between transparency and politics/public administration.

Within 2 days, the guys made the datasets talk, and we found that 16 politicians had been elected with the help of “Dirty” money in the 4 previous elections. Other 13 politicians had received donations from the “Dirty List” but had not succeeded in winning the elections.

A local newspaper told the story.

In October 2012, there are local elections in Brazil. Hope we can shed even more light in the candidates.

Behind the news:

Roughly how long did it take you to extract the data from the PDFs? Do you know how long the guys from Transparencia Hacker spent working on the data?

This was kind of easy. It took me just some minutes. The “Dirty List” is a 20-page PDF. I always use a website to convert it into xls or csv (I like Cometdocs for this work).

Here is the Dirty List, in PDF (last updated on the 8th of November, 2011; the list we used is in CSV but it it very outdated because it was due to January 2010) Here are the Electoral Court pages for the list of donators: 2002, 2004, 2006, 2008 and 2010.

What I asked the Transparencia Hacker community was to check whether the CNPJs (companies register number within the governments) in the CSV would match any item in the Electoral Court webpage. The guys worked on the data for 2 days.

Is sufficient data available to visualise the total amount lobbyists donated to political campaigns, and would it be useful to / no? If you were to visualise the info – what would the priorities be to show? Would any tools be useful to explore the data?

Yes, there is enough data. And YES, it would be very useful to visualize those links. I would prioritise the presidential and governor candidates as well as some Congressmen who hold top-positions in both Houses of Congress. Also, the donations to political parties (not to individual politicians) would be a plus.

A search form would be very useful. The search could have filters for position (Presidential candidate, governor candidate, political party etc), geography (Brazil, states) and donators (with no filters, just a blank for writing)

In your ideal world, in time for the impending elections – what would be done differently from last time? Any additional data you would like to see released?

I’d have to think more carefully to respond that, but concerning additional data: the number which identifies the market (the field) in which the companies work.

Interested in writing a “Behind the News” piece for the OpenSpending blog? Get in touch via our twitter account or email info [at] openspending.org.

Some useful links (mainly in Portuguese):

Updates from the OpenSpending Dev Team

January 23, 2012 in Releases, Spending Stories, Updates

What are we focusing on this week?

  • Working on implementing Collections <- draft notes, beware.
  • Prototyping the Compar-o-tron Mockup 1, Mockup 2.
  • Continuing work on Embeddable widgets for Spending Stories.

Open Bookkeeping: What role can accountants play in Open Spending & Budget Projects?

January 17, 2012 in Uncategorized

The next OpenSpending online community meeting will take place on Thursday, 19th Jan – 6pm GMT.

The topic

Accountants spend their entire working lives mapping the money:

  • How can their expertise be put to good use in Open Spending Data projects?
  • What interesting initiatives are going on around the world which could benefit from the input of accountants?

Via OpenClipArt

All are welcome! If you’d like to contribute to the discussion by joining the call, please just add your name and Skype ID to the pad

Please feel free to share with colleagues friends and other communities.

N.B. Over the next weeks, we’ll be trying to theme the discussions and proactively invite people along to join them. If you have a suggestion for a topic you think the group should discuss, please drop us a line via the OpenSpending mailing list.

Civil Society and Spending Data: Who is mapping the money?

January 12, 2012 in Contribute, OSF

This post is by Lucy Chambers, Community Coordinator on the OpenSpending project at the Open Knowledge Foundation. The post is cross-posted on the Open Knowledge Foundation blog.

We’re excited to announce that, thanks to the generous support of the Open Society Foundations, OKFN’s activities around financial transparency will expand to include a second pillar: next to the OpenSpending platform, we have just started a 6 month project to map the technology needs of Civil Society Organisations in relation to public spending and budget information.

We’re going to be working on…

  • Identifying CSOs around the world who are interested in working with spending data – building on the existing network of contacts from the OpenSpending.org project.

  • Connecting these CSOs with each other, with open data communities and with other key stakeholders to exchange knowledge, experiences and best practices in relation to spending data

  • Establishing how CSOs currently work with spending data, how they would like to use it, and what they would like to achieve – including:

  1. what existing tools are being used
  2. what current technical needs are unmet
  3. what would be required to meet these needs and how feasible is it to tackle them
  • Creating a registry of spending datasets, from official and unofficial sources in theDataHub.org
  • A Spending Data Manual – A wiki-like, community driven manual on acquiring, working with, publishing and archiving spending data, based on input and exchanges with CSOs we talk to.This will augment and reference existing publications from numerous organisations as well as channelling the results of our research into two areas:
    • A section to help CSO’s clarify their demands towards governments: e.g. guidance on open licensing and structured data formats, applicable for spending data.
    • A section focused on best practice for CSOs when using and reusing spending data: for example collaborative processes such as data-sharing.  
  • Running Spending Analysis Sessions with CSOs, both in person and virtually. We’re interested in learning from about what data people are trying to acquire / having difficulty in doing so, how they plan to use the data to further their mission and learning what barriers, legal, technical and otherwise could be removed to make their jobs easier.

  • Getting Spending Data from numerous countries loaded into OpenSpending.org – with the support of CSOs, OKFN developers, and volunteers from the open data community. We we’re interested in are using the OpenSpending.org tools, and collect input from them on how these could be improved to meet their needs.

Vision: Improved Spending Data Literacy, Sharing and Re-use amongst CSOs around the world

We are very keen to help more groups and individuals around the world to use and work with spending data more effectively to do the things they care about – whether this is investigative journalism, evidence based policy-making, political campaigning, budgeting or creating new useful applications and services.

In particular, we would like to document and spread best practices in the legal and technical aspects of reusing public information, and enabling re-use and better collaboration around this material.

Ultimately we would like to:

  • Build stronger, broader communities of groups and individuals who work together to acquire, use, and openly share spending data
  • Increase ‘literacy’ around spending data – enabling more CSOs to understand and work with large and complex spending datasets to help them to pursue their objectives
  • Encourage more CSOs to publish datasets which they acquire, use or create in machine readable formats, under open licenses, to avoid duplication of effort and enable CSOs to build on each others’ work, to harness external expertise more effectively and to facilitate stronger collaboration between different organisations who are interested in spending information

How can I get involved?

  • Join the Working Group on Spending Data. The working group will bring together data experts and CSOs who will help to weave a community of best practice around spending data, collect and provide feedback on material for the manual and help to develop the network of those collaborating around and sharing spending data. More details about the working group can be found on this wiki page.

  • Write for the Spending Data Blog – we’re interested in posts by and about CSOs who work with spending data, observations on the current status quo on releasing data in your area. Anything from short comment pieces to full proposals for what could be done, legal, technical or otherwise, to improve the situation in the sphere where you work. Contact details as above.

If you would like to get started, or know of organisations we should extend the invitation to: drop us an email via the mailing list or contact me directly via info [at] openspending.org.

Data = Seized, Sanitised and Sanity-checked. Open Data Day 2011

December 12, 2011 in events

This post is by Mark Brough, Research Officer at Publish What You Fund, Lucy Chambers, Community Coordinator for OpenSpending, and Irina Bolychevsky, Product Owner for CKAN. It is cross-posted on the OpenSpending Blog and the Open Knowledge Foundation Blog and Mark Brough’s contribution is also featured on aidinfolabs.org.

Saturday, December 3rd was Open Data Day, and London took the challenge to throw a hackday to help data be opened, cleaned and shown off to the world…

Fuelled only by enthusiasm, caffeine and 5 packets of ready-made popcorn, the CKAN, OpenSpending and IATI teams, along with some new faces, joined forces to liberate as much data as they could…

OpenSpending + IATI + CKAN

As part of the IATI Open Data Day challenges, Mark Brough did some work to get the existing IATI Data into OpenSpending. David Read, from the CKAN team, and a new face to the data wrangling crew, Johannes, scraped data on aid donations from France and Austria that were locked-up in web apps in order to help fill in the gaps in the global aid data jigsaw puzzle. You can see the results on OpenSpending.

The French (AFD) and Austrian (ADA) aid data appears to be incomplete: the AFD’s [2010 Annual Report]http://www.afd.fr/jahia/webdav/site/afd/shared/PUBLICATIONS/Colonne-droite/Rapport-annuel-AFD-VF.pdf suggests that South Africa is the biggest recipient country, receiving €403 million, but in the data, Morocco is the biggest recipient and there are no transactions in South Africa.

The Austrian Development Agency data was carefully cleaned by Johannes, with region and country codes being added for all entries to create a tidier dataset. However, the original data contained, for example, four different spellings of Bosnia and Herzegovina, suggesting that countries are being manually entered rather than selected from an existing list. [For 2010]http://openspending.org/ada/?_time=2010&_view=country, the second biggest recipient of the Austrian Development Agency’s aid (after aid not going to a specific country) appears to be Austria.

Nevertheless, despite the issues surrounding data quality, it was a useful exercise to show both the value of open data – that if you release your data, you can do pretty cool things with it – and the costs of keeping it locked away, namely that the data then has to be scraped from sites in quite a labour-intensive way.

These, along with many other datasets discovered on the day via tweets and emails have been added to the Open Data Day Group on theDataHub.org.

On the same day, we worked to get the data released as part of the International Aid Transparency Initiative into OpenSpending. You can see the results of the IATI wrangling process on OpenSpending.org/iati. This following section is written by Mark.

1. Getting the data

Downloading the existing IATI data has already become quite a big task; with 19 publishers so far, the data currently amounts to over 750MB with 1169 packages. Fortunately this is made easier by the IATI Registry, which provides an API to access all existing datasets, and a simple script (links at end) can retrieve all of the data.

2. Extracting the data

Extracting the data from the XML files is more complicated. Although IATI data uses a standard schema, there are a few cases where publishers have either used the markup incorrectly, or else interpreted the definitions slightly differently. This can be simple problems such as stating that an organisation is “implementing” rather than “Implementing”, or placing the date within the text of the tag and not the “iso-date” attribute of that tag, or more significant problems such as placing implementing organisations in the “accountable” organisation field.

However, these problems are still fairly limited and follow fairly regular patterns, so they are not too hard to overcome. There are more significant problems when some donors have for example used three-letter (ISO-3) country codes, rather than two-letter (ISO-2) country codes. (This is considered below in “next steps”.)

3. Wrangling the data

OpenSpending is designed to show spending data, and has a powerful aggregation system to show large collections of transactions in a meaningful way. However, IATI data is organised by activities, with transactions nested within activities (projects), and – reflecting the business models of funders – activities sit within other activities (e.g., projects within programs), although they are not nested in the actual XML. Furthermore, one of the significant advantages of IATI compared to other aid data formats is that it permits multiple sectoral classifications, allowing you to assign a proportion of the value of an activity to each sector. So, you might have an activity that is 50% related to health and 50% to education.

To prepare the data for OpenSpending, each transaction inherits the properties of its activity (and, if that activity has a parent, that parent activity’s title and description). Then, the transaction is broken out into mini transactions, with the proportion of the activity assigned to each sector used to assign a proportion of the value of the transaction to each sector. So, from transactions, you get mini “sector-transactions”.

This takes about 40 minutes to compile, and then one final step remains: to convert the currencies to a single currency. Currently, USD, EUR and GBP amounts are used in the IATI data. All data is converted to USD using the average for 2010 from the OECD’s Financial Indicators (MEI) dataset. (This is also considered below in “next steps”.)

4. Loading the data

OpenSpending’s new web-based loading interface makes it relatively easy to load data in, although you currently also have to write a model and views (links at end).

Results

The results can be viewed in the OpenSpending IATI dataset. You can explore the data by recipient country, sectors, funding organisation, and drill down through the data to see the data for an individual country.

Problems with the data

So far I’ve noticed the following problems:

  • “Unknown” recipient location is incorrectly marked as “South Sudan”
  • Recipient countries are listed twice, as Spain has used ISO3 rather than ISO2 country codes.
  • Sweden is listed as “Ministry of Foreign Affairs” (this is how they have listed themselves as the Funding Organisation in the data)
  • Sweden’s implementing organisations have been lost as they placed them in the accountable organisation field.

Please let me know if you see anything else problematic, if you have and criticisms of feedback of the way the data has been presented, or if you think there are other ways you’d like to be able to explore the data, based on the available dimensions.

Next steps

As mentioned above, there are some problems with the data which should properly be dealt with at the level of the donor agency. But there are others that will probably have to be dealt with by users of the data:

  • Mapping between different sector vocabularies, so that you can see all “Health” projects, and not only the health projects according to a single vocabulary
  • Mapping between countries and regions, so that every project in a country has a related region
  • Correctly converting currencies using the “value-date” column to get a more precise (at least month-specific) conversion.

What else have you noticed with the data? Is there anything else that should be changed? Anything interesting?

You can contact Mark about this data via the OpenSpending mailing list

Useful Links

How Spending Stories Spots Errors in Public Spending

December 5, 2011 in Data Journalism, Spending Stories

This article was originally published on MediaShift Idea Lab and was co-written by Martin Keegan, project lead for Spending Stories and Lucy Chambers, Community Coordinator for OpenSpending.

How public funds should be spent is often controversial. Information about how that money has already been spent should not be ambiguous at all. People arguing about the future will care about the present, and if data about past or present public spending is available, many will certainly look at it. When they do, occasionally they will find errors, or believe themselves to have found errors.

OpenSpending, which aims to track every (public) government and corporate financial transaction across the world, encourages users to:

  • augment the existing spending database with additional sources of data
  • use that data — e.g., to write evidence-based articles and formulate informed decisions about how their society is financed.

Spending Stories is our effort to make OpenSpending a natural way to do data journalism about public spending.

openspending.jpg

The Problem

FACT 1: Errors occur in data, no matter how official the source.

FACT 2: Data wrangling (manipulating or restructuring datasets to correct inaccuracies, remix with other datasets to augment the data, or perform calculations on the data), generally improves data quality, for example, through reconciling entities and flagging amounts that are obviously incorrect.

FACT 3: Data wrangling can also introduce errors if not tackled correctly.

Crucial to ensuring the use of this data in articles or ensuring re-use by concerned citizens is the ability to show that the data is valid. In addition, maintaining a good relationship with public bodies who are confident that they are not being misrepresented in the data is vital to ensuring the data continues to be released in the first place. In practice, this means that the provenance of the data has to be clear including:

  • where the data originally came from (preferably a URL)
  • whether anyone (e.g., government, community data wrangler, or OpenSpending) has worked on the data since it was published, and what steps they took to change the data (i.e., these steps should be reproducible to produce the same result)

The OpenSpending team has gone to lengths to retain enough information to say who was responsible for both of the above.

OpenSpending is a system, somewhat like a wiki, which allows you to track back through the data wrangling process and work out what changes were made to the data, when and by whom.

Error reporting in practice

OpenSpending recently received a pointed inquiry from the U.K. Treasury disputing the claims we were making about the payment of British public money to a private company. Believing that an error had been introduced, we attempted to retrace our steps and find out where this had occurred, and who was responsible.

As we discovered, the payment had actually taken place, but the the OpenSpending descriptions used to label the transaction were not sufficiently detailed to accurately reflect the item in question.

With Spending Stories, we were able to retrace our steps because we had preserved a copy of the software tools we used for collecting the data (the data is published by about 50 public bodies, and must be downloaded, stitched together, and firmly molded into shape). These tools had been also made available to the public, so the Treasury and other concerned citizens could have checked our work themselves; the availability of this kind of check keeps all participants in the fiscal debate honest.

What had gone wrong was a problem of terminology: The transactions existed, but ambiguous language had been used to describe them, glossing over the distinction between the government department reporting what money had been spent and the government agency which actually spent the money. The bodies in question were the Department of Health and a regional health care trust; this distinction is certainly one which a concerned citizen would expect to be made clearly — so we should make sure our system makes it easy to know which question is being asked.

Checkpoints in OpenSpending

In the short term, we are mitigating the problem of data errors as follows:

  • Data provenance – is the source identifiable and the process reproducible? OpenSpending encourages people to add modified datasets to a “package” in the Data Hub. This allows other users to see the original document alongside any modified documents and track the chain of changes made to see clearly which points errors could have been introduced.
  • Crowdsourcing feedback on spending data.
  • Permitting re-use of the structured data we present, so that it can inform decisions in other fact-checking systems.

Ultimately, we will build our part of the ecosystem to provide feedback to the political process, by improving democratic discourse about the public finances.

Lucy Chambers is a community coordinator at the Open Knowledge Foundation. She works on the OKF’s OpenSpending project and coordinates the data-driven-journalism activities of the foundation, including running training sessions and helping to streamline the production of a collaboratively written handbook for data journalists.

Martin Keegan is a software engineer and linguist, currently leading the Open Knowledge Foundation’s OpenSpending project. He is also on the Open Knowledge Foundation’s board, and has worked for SRI, Citrix, University of Cambridge and co-founded and worked for various civil society organizations.

OpenSpending visualisations featured in the Guardian

November 28, 2011 in Coverage

This post is by Lucy Chambers, Community Coordinator on OpenSpending.

On Friday, the Guardian Poverty Matters blog published a piece on the Uganda visualisation that the OpenSpending team had been working on with Publish What You Fund.

From the article

“The Publish What You Fund campaign group and the Open Knowledge Foundation have now produced a visualisation of Uganda’s aid and budget data for 2003-2006, billed as the first time both sets of data have been displayed together in a way that is easy to explore. A quick look shows just how big a piece of the puzzle aid spending is – more than 50% of overall resources available in Uganda for 2005-2006. The vast majority of this $1.1bn in aid was spent directly by donors on various projects, with only a third given to the government to spend along with its domestic resources. Interestingly, aid money made up only a small proportion of resources for education, while accounting for the majority of resources for health, agriculture, water and the environment.”

Busan Aid effectiveness meeting

The release of the visualisation comes ahead of the Busan aid effectiveness meeting and highlights some of the key benefits of opening up spending data, both to the donor organisations and the governments of the recipient countries themselves:

“Four years ago, researchers at the London-based Overseas Development Institute took up the enormous task of trying to figure out how dozens of donors were spending aid in Uganda, and how that compared with where the government was allocating its own resources. The results were striking: it turned out the Ugandan government was only aware of half the aid being spent in the country, despite routinely requesting this information from donors.”

It is hoped that visualisations such as these will make it easier to digest complex datasets of this type, where a government receives support from multiple sources. It is also hoped that discussions around the topic will result in the more timely and regular release of data to help highlight practices that will lead to aid money being most effectively spent.

Read the full Article in the Guardian Poverty Matters blog.

Have data similar to this you would like to create a similar visualisation for? Drop us an email via the OpenSpending mailing list.