Tag: deepbills

A Transparency Milestone

This week, I reported at the Daily Caller (and got a very nice write-up) about a minor milestone in the advance of government transparency: We recently finished adding computer-readable code to every version of every bill in the 113th Congress.

That’s an achievement. More than 10,000 bills were introduced in Congress’s last-completed two-year meeting (2013-14). We marked up every one of them with additional information.

We’ve been calling the project “Deepbills” because it allows computers to see more deeply into the content of federal legislation. We added XML-format codes to the texts of bills, revealing each reference to federal agencies and bureaus, and to existing laws no matter how Congress cited them. Our markup also automatically reveals budget authorities, i.e., spending.

Want to see every bill that would have amended a particular title or section of the U.S. code? Deepbills data allows that.

Want to see all the bills that referred to the Administration on Aging at HHS? Now that can be done.

Want to see every member of Congress who proposed a new spending program and how much they wanted to spend? Combining Deepbills data with other data allows you to easily collect that imporant information.

Congress’s Archaic Information Practices

There have been more than 2,700 bills introduced so far in the current Congress. That’s more than 30 bills per day, every day this year, weekends included. Ordinary Americans have a hard time keeping up, of course. Congress does, too.

The controversy around the anti-sex-trafficking bill in the Senate last week illustrates this well. Debate around the formerly non-controversial bill fell into disarray when Democrats discovered language in the bill that would apply the Hyde Amendment to fines collected and disbursed by the government. (The Hyde Amendment bars government spending on abortion. Democrats argue that it has only applied in the past to appropriated funds, not disbursement of fines.)

How is it that it took until late March for Democrats to discover controversial language in a bill that was introduced in January?

Well, Congress is awash in archaic practices. For one, bills are written in “cut and bite” style—change this line, change that word, change another—rather than in a form that lays out what the law would look like if the bill were passed. That makes bills unreadable—a situation Rep. Justin Amash (R-MI) has sought to remedy.

Computer-Aided Reporting: Looking Where the Light Is Good

Upshot (New York Times) writer Derek Willis tweeted this morning, “We need to stop doing stories (and maps) with meaningless data.” At the link, a story on Vox charts the poorest members of Congress. It’s based on a Roll Call story published in September.

His main point, I think, is the failure of the data to reliably reflect what it’s supposed to. The disclosures on which these stories rely don’t include the value of homes members own, for example, and information is reported in broad bands, so it’s probably not very accurate and may be wildly inaccurate.

The data is meaningless in another, more important way. Neither story suggests any correlation between wealth (or its absence) and legislators’ behavior or fitness for office. It’s just a look at who has money and who doesn’t—uninformative infotainment. Maybe some readers stack up inferences to draw conclusions about Congress or its members, but this is probably an exercise in confirming one’s biases.

This illustrates a real problem for computer-aided journalism. When the only data available depicts a certain slice of the world, that will skew editorial judgments toward that slice of the world, overweighting its importance in news reporting and commentary.

In my opinion, reporting on public policy suffers just such a skew. There is relatively good data about campaign financing and campaign spending, which makes it easy to report about. The relatively high level of reporting on this area makes it appear more important while the actual behavior of public officials in office—the bills they sponsor, the contents of bills, amendments, votes, and the results for society—goes relatively unreported.

It won’t be the fix for all that ails reporting on public policy, but our Deepbills project makes essential content of legislation available as data. It vastly expands the territory around U.S. federal public policy that computer-aided jounalists can cover. Deepbills data has been picked up various places, but we need more adoption before it will provide all the value it can to a better-informed public.

Update: On Wednesday, the House Government Reform and Oversight Committee will have a hearing on implementation of the DATA Act, which could yet further expand the data available to journalists, and all of us.

Public Oversight of Congress, One Click at a Time

In mid-August, using Cato Deepbills data, the Legal Information Institute at Cornell University started alerting visitors to its U.S. Code pages that the laws these visitors care about may be amended by Congress.

The most visited bills are an interesting smattering of issues.

Getting top clicks is H.R. 570, the American Heroes COLA Act. Would it surprise you to learn that beneficiaries of Social Security’s Old Age, Survivors and Disability Insurance program are looking to see if veterans’ disability compensation will get the same cost-of-living increases? The relevant section of the Social Security Act on the Cornell site points to the bill that would grow veterans’ benefits in tandem with Social Security recipients’.

S. 1859, the Tax Extenders Act of 2013, is the second bill with the most referrals from Cornell. People looking into federal regulation of health insurance—or myriad other statutes—are finding their way to this complex piece of legislation. We know visitors to the Cornell site are legally sophisticated. They just might be able to follow what S. 1859 does.

Immigration is a hot-button issue, and Deepbills links at Cornell such as the code section dealing with reimbursement for detaining aliens are sending people to S. 744, the Border Security, Economic Opportunity, and Immigration Modernization Act.

Another hot-button issue and top source of clicks from Cornell’s site: federal gun control. People looking at gun control law are following links to Senator Dianne Feinstein’s (D-CA) bill to ban assault weapons.

As of Thursday morning, 674 people had clicked 855 times on links to the bills in Congress that affect the laws they’re interested in. Those numbers aren’t going to instantaneously revive public oversight of the government. But usage of these links is rising, and Tom Bruce at Cornell says he plans changes that may increase clicks by 3 to 5 times. He guesses that people see Cato’s sponsorship of the data they can access 20,000 times a day. (“I should have asked you for a penny per impression ),” he says. Funny guy.)

A lot more people are aware of work Cato is doing to increase government transparency, but, more importantly, a small but growing cadre of people are being made aware of what Congress is doing. This positions them to do something about it. Public oversight of Congress is increasing one click at a time.

Transparency Is Breaking Out All Over!

On Monday, Cato is hosting a briefing on Capitol Hill about congressional Wikipedia editing. Over a recent 90-day period, there were over 400,000 hits on Wikipedia articles about bills pending in Congress. If congressional staff were to contribute more to those articles, the amount of information available to interested members of the public would soar. Data that we produce at Cato go into the “infoboxes” on dozens and dozens of Wikipedia articles about bills in Congress.

A popular Twitter ‘bot called @congressedits recently created a spike in interest about congressional Wikipedia editing. It puts a slight negative spin on the practice because it tracks anonymous edits coming from Hill IP addresses, which are more likely to be inappropriate. But Congress can do a lot of good in this area, so Cato intern Zach Williams built a Twitter ‘bot that shows all edits to articles about pending federal legislation. This should draw attention to the beneficial practice of informing the public before bills become law. Meet @Wikibills!

Also, as of this week, Cato data are helping to inform some 26 million visitors per year to Cornell Law’s Legal Information Institute about what Congress is doing. Thanks to Tom Bruce and Sara Frug for adding some great content to the LII site.

Let’s say you’re interested in 18 U.S. Code § 2516, the part of the U.S. code that authorizes interception of wire, oral, or electronic communications. Searching for it online, you’ll probably reach the Cornell page for that section of the code. In the right column, a box displays “Related bills now in Congress,” linking to relevant bills in Congress.

Those hyperlinks are democratic links, letting people know what Congress is doing, so people can look into it and have their say. Does liberty automatically break out thanks to those developments? No. But public demands of all types—including for liberty and limited government—are frustrated now by the utter obscurity in which Congress acts. We’re lifting the curtain, providing the data that translates into a better informed public, a public better equipped to get what it wants.

The path to liberty goes through transparency, and transparency is breaking out all over!

Government Data Flows Visualized

Today, I’m at the House Administration Committee’s Legislative Data and Transparency Conference. It’s become the annual confab for learning what the House is doing to improve transparency, for learning what the Senate is not doing to improve transparency, and to mix and mingle with others working on opening Congress’s deliberations to digital access.

In our 2012 study, Grading the Government’s Data Publication Practices, we issued letter grades reflecting the quality of data the government makes available about its own deliberations, managment, and results, covering legislative process and budgeting, appropriating, and spending. The grading was based on criteria set out in an earlier study, Publication Practices for Transparent Government.

Grades are a way of showing the public, opinion leaders, and legislators what’s going on. For most areas, the grading study showed that access to data is relatively poor.

There is no question that people are working hard on things, and the House has consistently put in the most effort over the last few years. (The recently passed DATA Act now requires the administration to make an effort. Oversight and badgering will help ensure that it does.)

My contribution this year is a brief talk in which I’ll present what’s happening with data another way: by presenting a visualization of what’s happening with data flows—pictures!

Water is a good metaphor for data. Ideally, data would emerge at the source, like a spring, drinkable and ready for use. But very often, key information about government is not available as data at all. People have to pump it out of the ground, turning paper or PDF documents into usable data. Sometimes data isn’t in a format that’s truly useful. It’s undrinkable or “polluted.”

A lot of people in a lot of places are working to take data that is not ready for use and make it available. Our own contribution at Cato is the Deepbills project, which adds data to bills that allows computers to more readily access their meaning. Like a little water treatment plant. It’s not the only one.

It’s a big file (5.6 MB), but if you want, you can look through the PowerPoint. (Ignore the “Soup to Nuts” page—that’s a funny, funny joke, in my opinion, aimed at those who attended last year.)

Let’s See What DATA Can Do

The New York Times reported at the top of page one yesterday on the $4.1 million in payments that a single physical therapist in Brooklyn got from Medicare in 2012. It’s a shocking sum, and Medicare fraud is common in both physical therapy and the Brooklyn area. The therapist who received the money says that the billings are for his large, multi-office practice.

The point is broader: Reporters, medical trade association figures, investigators and researchers are poring over newly released data about Medicare spending. They’re strengthening public oversight and the public’s capacity to question this government program. It’s data that the American Medical Association and other industry groups fought against releasing. There is risk that the numbers will lead some to unfair conclusions, perhaps even in the case of this Brooklyn physical therapist, but the public oversight it brings to the Medicare program and the circumspection it brings to fraudsters and others will be more than worth it. Data is a powerful oversight tool.

That’s why I think it’s good news that the House of Representatives passed the DATA Act yesterday. The Digital Accountability and Transparency Act, introduced by Mark Warner (D-VA) in the Senate and Darrell Issa (R-CA) in the House, requires the federal government to adopt data standards for all federal spending and publish all of it online. This will permit the public to gather insights like the ones in that New York Times story across the vastness of the federal spending enterprise. It will make the diffuse cost of government a little more acute in the minds of many, positioning Americans to say specifically which spending should stop.

Change will not come instantly, and the legislation is not self-executing, but groups like the Data Transparency Coalition, a prime mover behind the legislation, appear poised to insist on full execution of the law. Implementation should not have the cost that the Congressional Budget Office estimated for it, and if it does, the billions saved thanks to availability of information to the public should justify the costs. If another “cost” of transparency is improvement of federal programs that should be eliminated, I think that beats the today’s status quo of having them on the books and failing.

The DATA Act is not a direct response to a 2008 Cato event asking the Obama administration to “Just Give Us the Data.” Indeed, the administration has been conspicuously unsupportive of transparency in this area, though transparency was a key campaign theme in President Obama’s first election. Cato studies in this area since then include “Publication Practices for Transparent Government” and “Grading the Government’s Data Publication Practices.” We’ll be repeating the grading study during the summer, though it’s doubtful the administration’s grades will improve by that time. We will use the data structures that the DATA Act requires in our Deepbills project, which shines light on Congress’s proposals, including its plans for spending.