Any data that is stored in an electronic form may be subject to production under common eDiscovery rules. This type of data has historically included email and office documents (spreadsheets, presentations, documents, PDFs, etc.) but can also include photos, video, instant messaging, collaboration tools, text (SMS), messaging apps, social media, ephemeral messaging,
Internet of things (smart devices like
smart watches,
virtual assistants, and
smart home hubs), databases, and other file types. Also included in ediscovery is "
raw data", which forensic investigators can review for hidden evidence. The original file format is known as the
"native" format. Litigators may review material from ediscovery in one of several formats: printed paper, "native file", or a petrified, paper-like format, such as PDF files or TIFF images. Modern document review platforms accommodate the use of native files and allow for them to be converted to TIFF and
Bates-stamped for use in court.
Electronic messages In 2006, the
U.S. Supreme Court's amendments to the
Federal Rules of Civil Procedure created a category for electronic records that, for the first time, explicitly named emails and instant message chats as likely records to be archived and produced when relevant. One type of preservation problem arose during the
Zubulake v. UBS Warburg lawsuit. Throughout the case, the plaintiff claimed that the evidence needed to prove the case existed in emails stored on UBS' own computer systems. Because the emails requested were either never found or destroyed, the court found that they were more likely to exist than not. The court found that while the corporation's counsel directed that all potential discovery evidence, including emails, be preserved, the staff that the directive applied to did not follow through. This resulted in significant
sanctions against UBS. To establish authenticity, some archiving systems apply a unique code to each archived message or chat. The systems prevent alterations to original messages, messages cannot be deleted, and unauthorized persons cannot access the messages. The formalized changes to the Federal Rules of Civil Procedure in December 2006 and 2007 effectively forced civil litigants into a compliance mode with respect to their proper retention and management of
electronically stored information (ESI). Improper management of ESI can result in a finding of spoliation of evidence and the imposition of one or more sanctions, including adverse inference jury instructions,
summary judgment, monetary fines, and other sanctions. In some cases, such as
Qualcomm v. Broadcom, attorneys can be brought before the bar. In
United States v. Heppner (2026), a federal court held that chats with a public generative AI tool, Claude, were not privileged under the circumstances presented, highlighting AI chatbot conversations as a potential category of discoverable electronically stored information.
Databases and other structured data Structured data typically resides in databases or datasets. It is organized in tables with columns, rows, and defined data types. The most common are Relational Database Management Systems (
RDBMS) that are capable of handling large volumes of data such as
Oracle,
IBM Db2,
Microsoft SQL Server,
Sybase, and
Teradata. The structured data domain also includes spreadsheets (not all spreadsheets contain structured data, but those that have data organized in database-like tables), desktop databases like
FileMaker Pro and
Microsoft Access, structured
flat files,
XML files,
data marts,
data warehouses, etc.
Audio Voicemail is often discoverable under electronic discovery rules. Employers may have a duty to retain voicemail if there is an anticipation of litigation involving that employee. Data from voice assistants like
Amazon Alexa and Siri have been used in criminal cases. ==Reporting formats==