Tag: deepbills

Transparency Is Breaking Out All Over!

On Monday, Cato is hosting a briefing on Capitol Hill about congressional Wikipedia editing. Over a recent 90-day period, there were over 400,000 hits on Wikipedia articles about bills pending in Congress. If congressional staff were to contribute more to those articles, the amount of information available to interested members of the public would soar. Data that we produce at Cato go into the “infoboxes” on dozens and dozens of Wikipedia articles about bills in Congress.

A popular Twitter ‘bot called @congressedits recently created a spike in interest about congressional Wikipedia editing. It puts a slight negative spin on the practice because it tracks anonymous edits coming from Hill IP addresses, which are more likely to be inappropriate. But Congress can do a lot of good in this area, so Cato intern Zach Williams built a Twitter ‘bot that shows all edits to articles about pending federal legislation. This should draw attention to the beneficial practice of informing the public before bills become law. Meet @Wikibills!

Also, as of this week, Cato data are helping to inform some 26 million visitors per year to Cornell Law’s Legal Information Institute about what Congress is doing. Thanks to Tom Bruce and Sara Frug for adding some great content to the LII site.

Let’s say you’re interested in 18 U.S. Code § 2516, the part of the U.S. code that authorizes interception of wire, oral, or electronic communications. Searching for it online, you’ll probably reach the Cornell page for that section of the code. In the right column, a box displays “Related bills now in Congress,” linking to relevant bills in Congress.

Those hyperlinks are democratic links, letting people know what Congress is doing, so people can look into it and have their say. Does liberty automatically break out thanks to those developments? No. But public demands of all types—including for liberty and limited government—are frustrated now by the utter obscurity in which Congress acts. We’re lifting the curtain, providing the data that translates into a better informed public, a public better equipped to get what it wants.

The path to liberty goes through transparency, and transparency is breaking out all over!

Government Data Flows Visualized

Today, I’m at the House Administration Committee’s Legislative Data and Transparency Conference. It’s become the annual confab for learning what the House is doing to improve transparency, for learning what the Senate is not doing to improve transparency, and to mix and mingle with others working on opening Congress’s deliberations to digital access.

In our 2012 study, Grading the Government’s Data Publication Practices, we issued letter grades reflecting the quality of data the government makes available about its own deliberations, managment, and results, covering legislative process and budgeting, appropriating, and spending. The grading was based on criteria set out in an earlier study, Publication Practices for Transparent Government.

Grades are a way of showing the public, opinion leaders, and legislators what’s going on. For most areas, the grading study showed that access to data is relatively poor.

There is no question that people are working hard on things, and the House has consistently put in the most effort over the last few years. (The recently passed DATA Act now requires the administration to make an effort. Oversight and badgering will help ensure that it does.)

My contribution this year is a brief talk in which I’ll present what’s happening with data another way: by presenting a visualization of what’s happening with data flows—pictures!

Water is a good metaphor for data. Ideally, data would emerge at the source, like a spring, drinkable and ready for use. But very often, key information about government is not available as data at all. People have to pump it out of the ground, turning paper or PDF documents into usable data. Sometimes data isn’t in a format that’s truly useful. It’s undrinkable or “polluted.”

A lot of people in a lot of places are working to take data that is not ready for use and make it available. Our own contribution at Cato is the Deepbills project, which adds data to bills that allows computers to more readily access their meaning. Like a little water treatment plant. It’s not the only one.

It’s a big file (5.6 MB), but if you want, you can look through the PowerPoint. (Ignore the “Soup to Nuts” page—that’s a funny, funny joke, in my opinion, aimed at those who attended last year.)

Let’s See What DATA Can Do

The New York Times reported at the top of page one yesterday on the $4.1 million in payments that a single physical therapist in Brooklyn got from Medicare in 2012. It’s a shocking sum, and Medicare fraud is common in both physical therapy and the Brooklyn area. The therapist who received the money says that the billings are for his large, multi-office practice.

The point is broader: Reporters, medical trade association figures, investigators and researchers are poring over newly released data about Medicare spending. They’re strengthening public oversight and the public’s capacity to question this government program. It’s data that the American Medical Association and other industry groups fought against releasing. There is risk that the numbers will lead some to unfair conclusions, perhaps even in the case of this Brooklyn physical therapist, but the public oversight it brings to the Medicare program and the circumspection it brings to fraudsters and others will be more than worth it. Data is a powerful oversight tool.

That’s why I think it’s good news that the House of Representatives passed the DATA Act yesterday. The Digital Accountability and Transparency Act, introduced by Mark Warner (D-VA) in the Senate and Darrell Issa (R-CA) in the House, requires the federal government to adopt data standards for all federal spending and publish all of it online. This will permit the public to gather insights like the ones in that New York Times story across the vastness of the federal spending enterprise. It will make the diffuse cost of government a little more acute in the minds of many, positioning Americans to say specifically which spending should stop.

Change will not come instantly, and the legislation is not self-executing, but groups like the Data Transparency Coalition, a prime mover behind the legislation, appear poised to insist on full execution of the law. Implementation should not have the cost that the Congressional Budget Office estimated for it, and if it does, the billions saved thanks to availability of information to the public should justify the costs. If another “cost” of transparency is improvement of federal programs that should be eliminated, I think that beats the today’s status quo of having them on the books and failing.

The DATA Act is not a direct response to a 2008 Cato event asking the Obama administration to “Just Give Us the Data.” Indeed, the administration has been conspicuously unsupportive of transparency in this area, though transparency was a key campaign theme in President Obama’s first election. Cato studies in this area since then include “Publication Practices for Transparent Government” and “Grading the Government’s Data Publication Practices.” We’ll be repeating the grading study during the summer, though it’s doubtful the administration’s grades will improve by that time. We will use the data structures that the DATA Act requires in our Deepbills project, which shines light on Congress’s proposals, including its plans for spending.

Transparency and Liberty

John McGinnis has some kind words for work I oversee here at Cato in a recent blog post of his entitled: “The Internet–A Technology for Encompassing Interests and Liberty.”

As he points out, the information environment helps determine outcomes in political systems because it controls who is in a position to exercise power.

The history of liberty has been in no small measure the struggle between diffuse and encompassing interests, on the one hand, and special interests, on the other.  Through their concentrated power, special interests seek to use the state to their benefit, while diffuse interests concern the ordinary citizen or taxpayer, or in William Graham Sumner’s arresting phrase, The Forgotten Man. When the printing press was invented, the most important special interests were  primarily the rulers themselves and the aristocrats who supported them. The printing press allowed the middle class to discover and organize around their common interests to sustain a democratic system that limited the exactions of the oligarchs.

But the struggle between diffuse and special interests does not disappear with the rise of democracy. Trade associations, farmers’ associations and unions have leverage with politicians to obtain benefits that the rest of us pay for. As a successor to the printing press, however, the internet advances liberty by continuing to reduce the cost of acquiring information. Such advances help diffuse groups more than special interests.

The Internet is the new printing press, and we’re generating data here at Cato that should allow it to have its natural, salutary effects for liberty.

My favorite current example is the “Appropriate Appropriations?” page published by the Washington Examiner. It allows you to easily see what representatives have introduced bills proposing to spend taxpayer money, information that—believe it or not—was hard to come by until now.

In John McGinnis, we have a legal scholar who recognizes the potential ramifications for governance of our entry into the information age. Read his whole post and, for more in this area, his book, Accelerating Democracy: Transforming Governance Through Technology.

Hacking for Liberty

You’ve probably heard the old parable about the man looking for his car keys under a street lamp because the light is better there.

I’ve regularly worried aloud about the government transparency project following the same path. Most recently, I pointed out that the president’s executive order was about open data, not transparent government.

“Open data” is pretty much any data the government makes available in useful formats – Agriculture Department data about the gender of farm operators, for example. But don’t look there for government transparency. The Ag Department’s check register is still in the dark.

Transparent government is going to result from data that reflects the deliberations, management, and results of all the government’s agencies and organs. It’s fine to release interesting data, and it’s fine for people to build things with it, but the government transparency project doesn’t advance without data about what government entities are thinking and doing, and how well they’re doing it.

That’s why I’m happy to have offered the legislative data we produce to a hack-a-thon happening this week in San Francisco. Lincoln Labs’ Liberty Hackathon offers $5,000 in prizes to the top producers of technologies that advance civic values like individual privacy and economic liberty. “Top ideas and teams will be considered for future investment.” Sounds good.

My hope is that someone will build something that makes it easier to automatically track what’s happening in Congress, like, oh, spending for example. Our data can automatically reveal every bill that proposes spending, the amount, and the purpose. Wouldn’t it be nice to have that information at your fingertips? You might be inspired to contact your senators and member of Congress and tell them what you think. Maybe an app will tell you how your representatives voted on each and every spending bill that becomes law.

“Data excavation” is how Seamus Kraft at the OpenGov Foundation has characterized the work we do in our Deepbills project, and I’ve been very complimented by his recognition of the work. Transparency will not be a gift from government. We’ll have to dig out the data about the government’s deliberations, management, and results. Maybe this weekend some of the projects produced at the Liberty Hackathon will show how excavated government data energize democracy and protect liberty.

Cato’s “Deepbills” Project Advances Government Transparency

It’s not the culmination–that will come soon–but a major step in our work to improve government transparency has been achieved. I’ll be announcing and extolling it Wednesday at the House Administration Committee’s Legislative Data and Transparency conference. Here’s a quick survey of what we’ve been doing and the results we see on the near horizon.

After president Obama’s election in 2008, we recognized transparency as a bipartisan and pan-ideological goal at an event entitled: “Just Give Us the Data.” Widespread agreement and cooperation on transparency has held. But by the mid-point of the president’s first term, the deep-running change most people expected was not materializing, and it still has not. So I began working more assiduously on what transparency is and what delivers it.

In “Publication Practices for Transparent Government” (Sept. 2011), I articulated ways the government should deliver information so that it can be absorbed by the public through the intermediary of web sites, apps, information services, and so on. We graded the quality of government data publication in the aptly named November 2012 paper: “Grading the Government’s Data Publication Practices.”

But there’s no sense in sitting around waiting for things to improve. Given the incentives, transparency is something that we will have to force on government. We won’t receive it like a gift.

So with software we acquired and modified for the purpose, we’ve been adding data to the bills in Congress, making it possible to learn automatically more of what they do. The bills published by the Government Printing Office have data about who introduced them and the committees to which they were referred. We are adding data that reflects:

- What agencies and bureaus the bills in Congress affect;

- What laws the bills in Congress effect: by popular name, U.S. Code section, Statutes at Large citation, and more;

- What budget authorities bills include, the amount of this proposed spending, its purpose, and the fiscal year(s).

We are capturing proposed new bureaus and programs, proposed new sections of existing law, and other subtleties in legislation. Our “Deepbills” project is documented at cato.org/resources/data.

This data can tell a more complete story of what is happening in Congress. Given the right Web site, app, or information service, you will be able to tell who proposed to spend your taxpayer dollars and in what amounts. You’ll be able to tell how your member of Congress and senators voted on each one. You might even find out about votes you care about before they happen!

Having introduced ourselves to the community in March, we’re beginning to help disseminate legislative information and data on Wikipedia.

The uses of the data are limited only by the imagination of the people building things with it. The data will make it easier to draw links between campaign contributions and legislative activity, for example. People will be able to automatically monitor ALL the bills that affect laws or agencies they are interested in. The behavior of legislators will be more clear to more people. Knowing what happens in Washington will be less the province of an exclusive club of lobbyists and congressional staff.

In no sense will this work make the government entirely transparent, but by adding data sets to what’s available about government deliberations, management and results, we’re multiplying the stories that the data can tell and beginning to lift the fog that allows Washington, D.C. to work the way it does–or, more accurately, to fail the way it does.

At this point, data curator Molly Bohmer and Cato interns Michelle Newby and Ryan Mosely have marked up 75% of the bills introduced in Congress so far. As we fine-tune our processes, we expect essentially to stay current with Congress, making timely public oversight of government easier.

This is not the culmination of the work. We now require people to build things with the data–the Web sites, apps, and information services that can deliver transparency to your door. I’ll be promoting our work at Wednesday’s conference and in various forums over the coming weeks and months. Watch for government transparency to improve when coders get a hold of the data and build the tools and toys that deliver this information to the public in accessible ways.