When a judicial opinion cites something from the web, researchers should be able to check it later and find exactly what it looked like when it was cited.1
Unfortunately, legal publication hasn't fully moved into the age of the web, so these web citations are prone to "rot." That is, what a researcher may find (if anything) when they check that URL may not be the same as it was when it was first consulted by the judge or clerk.
SCOTUS WebCites captures a snapshot of any webpage cited by the US Supreme Court as soon as possible after the court opinion is issued.
Courts cite the web, and the web is always changing. In the time between the moment when a website was cited and the time an opinion is published, information on that website can disappear or change subtly... or substantially.
This obscures and dilutes the value of citations to web resources in court documents. Over time, these unarchived resources are exposed ever more to the vagaries of the web—site reorganizations, domain-name lapses, etc. The citations become useless or increasingly suspect.
SCOTUS has begun to address this matter, but at present offers only PDFs made from printouts of webpages, and does so only weeks or months after the citing opinion was published.
A Stopgap Measure
Our system determines in near real-time when the Court publishes an opinion and promptly takes snapshots of the web pages it cites.
Though it's possible the web page has changed in the time between citation and publication of the opinion, this process catches the resource at the freshest possible moment.
A Long-Term Solution
As legal publishing evolves over time, we expect judges, clerks and others to use services like Harvard's perma.cc to create an archive of any web resource they cite and provide links to it as well.
How does this work?
- our application watches the Supreme Court website for new opinions
- when it sees one, it downloads it and converts it from a PDF to text
- it looks for web citations in the text, then passes any it finds to a human on our team for evaluation
- if the resource is findable online when the human validates the URL, the application archives it using perma.cc's API
- users can browse opinions, justices or the master list of citations found in the opinions
-
notifications go out to subscribers whenever citations are harvested from new
opinions
- open an issue on the github repo to request subscriber access.
This site was developed by Phil Ardery and is hosted by UC Berkeley Law Library.