For the previous decade, ICIJ’s Datashare has been the gateway for a whole lot of reporters exploring the greater than 100 million leaked recordsdata on the coronary heart of investigations just like the Panama Papers, Pandora Papers, FinCEN Recordsdata and extra.
Now, after 10 years of steady enhancements, the analysis platform has acquired a significant improve, with a collection of recent and up to date options geared toward making the device simpler, sooner and extra accessible for a greater variety of customers.
The 2-year redesign venture drew on suggestions from dozens of journalists, culminating in a brand new interface that enhances performance and a design system that enables future platform upgrades to be built-in simply and seamlessly. See what’s new.
Sustaining a mountain of knowledge
Datashare allowed ICIJ to centralize all of the extremely delicate paperwork from the largest leaks in historical past in a safe place. Investigations just like the FinCEN Recordsdata and Pandora Papers relied on Datashare to assist journalists unearth secrets and techniques buried in advanced units of paperwork utilizing highly effective search performance and AI-fueled options.
However with every new investigation over the previous 10 years, our builders have made iterative updates to the free, open-source platform, usually responding to requests from our customers or to the distinctive wants of several types of knowledge on the coronary heart of our varied investigations.
Over time, these enhancements made the person expertise advanced. For journalists who spend hours buried in paperwork inside Datashare, they want simplicity and effectivity. Even essentially the most tech-savvy reporters had been lacking as a lot as 20% of Datashare’s highly effective options that weren’t seen sufficient within the person interface.
On the identical time, we’ve heard curiosity from different person teams who may benefit from a device like Datashare — customers corresponding to researchers, civil servants and personal companies.
After 10 years, it was time for a complete makeover.
Datashare for all
To revamp Datashare, we started with our customers: what did they like, what did they want, and the way may we make the device extra accessible and helpful for extra folks?
The design section included on-line person interviews with reporters from all all over the world, focus teams with journalists to check design choices, session with an accessibility specialist to verify the brand new interface complies with accessibility tips and on-line high-fidelity prototype testing of the ultimate variations.

Throughout this course of, we discovered that even when journalists actually love Datashare, when pushed they’d change into like pals choosing faults together with your ex after a breakup: some mentioned Datashare was too sluggish, others weren’t conscious of some key options, explicit bugs had been actually annoying, document-opening animations had been inefficient (journalists undergo a whole lot of paperwork per day). One person even confessed “you understand, Datashare is a frightening place.” We knew we had work to do.
Good design permits customers to simply and intuitively navigate software program — both as a result of features are clear or, as Jakob’s legislation suggests, as a result of buttons are the place customers count on them to be, primarily based on their expertise with different generally used web sites.
A part of the problem on this course of got here from certainly one of Datashare’s most vital safety features — we don’t gather any details about our customers. We don’t know what gadgets they’re utilizing, which buttons they principally click on on or some other contextual info. With out this knowledge, we had to make sure we had been using greatest design practices to make Datashare work for all customers no matter who they had been or how they had been utilizing to the platform.
We introduced in an accessibility advisor to assist: we elevated buttons’ sizes, font sizes and contrasts, on prime of engaged on a greater keyboard navigation and labeling elements in order that display readers higher work on Datashare.
Every web page was designed with info hierarchy and prioritization in thoughts: as elements are greater for accessibility causes, there may be much less room on every web page. As a designer, I labored, with customers’ enter, on deciding which options are extra vital or which may very well be grouped to scale back cognitive load. This was an enormous problem to deal with on an utility with a number of choices and a great deal of info to show.
The result’s a streamlined interface that’s accessible and intuitive for extra customers, whether or not journalists, researchers or anybody else who would possibly profit from the device.
One other problem was responsiveness: most journalists spend hours on computer systems to discover 1000’s of pages from 1000’s of paperwork. However typically they entry Datashare from smaller screens, and even their smartphones. This made it all of the extra vital to handle the hierarchy of our elements and the knowledge we had been displaying.
For consistency, we created a whole design system to deal with all of the elements of the Datashare person interface: buttons, search bar, playing cards, menu, tables, and extra. This method retains us constant throughout pages and makes the platform extra comfy for customers, who at all times see the identical elements within the utility. It additionally helps future-proof the design for brand spanking new options to be added within the coming months and years.
So… what’s new?
Let’s dive into Datashare’s new interface:
Navigation: the menu has extra entries to immediately attain all pages, saving a number of clicks every time.
Doc’s full display view: you may develop the view of a doc for a extra comfy studying expertise and use a carousel with previews to navigate between paperwork
Settings: on all pages, you may customise what knowledge is displayed (metadata for paperwork’ playing cards, knowledge columns for tables, and so on). Every investigation is completely different and typically, you need to see paperwork’ creation date whereas different instances, the language or kind of the doc is extra vital
Person knowledge panels: the variety of tags and suggestions are actually grouped and summed up on prime of a doc and you may open respective panels to see extra particulars on the left of the doc
Filters: as a result of the checklist of obtainable filters has grown over time, it was time to group them into 3 collapsable classes: paperwork information, person knowledge and entities
Batch search outcomes: a brand new web page now lets you immediately see the variety of outcomes per question, whereas earlier than, you wanted to make use of the filters one after the other
Darkish mode: now you can change between gentle and darkish modes
You’ll be able to try this new model on Datashare’s demo web site the place you may discover the paperwork from the LuxLeaks investigation.
What’s subsequent: speech-to-text, structured content material and rather more
The brand new model of Datashare is now out there. Despite the fact that the prototype was examined on dozens of beta customers, we count on suggestions from customers who’re going to work on their very own paperwork with this model. As we’ve skilled for the previous 10 years, every investigation is exclusive and requires particular changes: whether or not it’s translations, file varieties, entities or the way in which the paperwork had been initially scanned, every set of paperwork brings its personal problem. Datashare goals to assist customers get one of the best from their recordsdata in as many use circumstances as attainable.
As an open supply software program, we share our backlog on Github and we welcome suggestions and ideas. Upcoming options within the pipeline embrace:
- Superior search type: we need to present a fast, user-friendly type, as a substitute of requiring customers to make the most of operators like AND, OR, NOT or others within the search bar
- Cut up view: for PDFs, we’d like customers to see each the extracted textual content and the unique doc on the identical display to allow them to shortly entry the unique web page the place their search time period has been discovered
- Shared saved searches: customers ought to be capable to share the searches that they discovered attention-grabbing with colleagues, as they do for batch searches
- Speech-to-text: customers will be capable to ask Datashare to extract textual content from audio and video recordsdata
- Structured content material: we need to extract content material from the paperwork in a greater manner by preserving the information’s authentic format and construction. We created a person survey on structured content material – please fill it in in case you have time
- Re-extract textual content from paperwork: if textual content recognition is imperfect on scanned paperwork, customers ought to be capable to flag these recordsdata for re-processing
- Alerts: customers will be capable to arrange alerts and obtain notifications when new paperwork added to Datashare match specified search phrases
- Highlights: we’ve plans to let customers resolve the size and the variety of highlights they see after they seek for a reputation and shortly see it in context within the paperwork
Anybody can obtain Datashare at no cost and work with their very own paperwork on their pc with the native model of the software program. A server model may also be arrange by superior technologists to permit groups to collaboratively work on the identical paperwork on-line. The older variations of Datashare are nonetheless out there on datashare.icij.org. If in case you have any suggestions or ideas on this new model, please share them with us by writing at datashare@icij.org.