Linear density '
(BPI) is the metric for the density at which data is stored on magnetic media. The term BPI can refer to ', but more often refers to
bytes per inch. The term BPI can mean
bytes per inch when the tracks of a particular format are byte-organized, as in nine-track tapes.
Tape width The width of the media is the primary classification criterion for tape technologies. has historically been the most common width of tape for high-capacity data storage. Many other sizes exist and most were developed to either have smaller packaging or higher capacity.
Recording method Recording method is also an important way to classify tape technologies, generally falling into two categories: linear and scanning.
Linear The linear method arranges data in long parallel tracks that span the length of the tape. Multiple tape heads simultaneously write parallel tape tracks on a single medium. This method was used in early tape drives. It is the simplest recording method, but also has the lowest data density. A variation on linear technology is linear serpentine recording, which uses more tracks than tape heads. Each head still writes one track at a time. After making a pass over the whole length of the tape, all heads shift slightly and make another pass in the reverse direction, writing another set of tracks. This procedure is repeated until all tracks have been read or written. By using the linear serpentine method, the tape medium can have many more tracks than read/write heads. Compared to simple linear recording, using the same tape length and the same number of heads, data storage capacity is substantially higher.
Scanning Scanning recording methods write short dense tracks across the width of the tape medium, not along the length. Tape heads are placed on a drum or disk which rapidly rotates while the relatively slow-moving tape passes it. An early method used to get a higher data rate than the prevailing linear method was
transverse scan. In this method, a spinning disk with the tape heads embedded in the outer edge is placed perpendicular to the path of the tape. This method is used in
Ampex's DCRsi instrumentation data recorders and the old Ampex
quadruplex videotape system. Another early method was
arcuate scan. In this method, the heads are on the face of a spinning disk which is laid flat against the tape. The path of the tape heads forms an arc.
Helical scan recording writes short dense tracks in a
diagonal manner. This method is used by virtually all current
videotape systems and several data tape formats.
Block layout and speed matching In a typical format, data is written to tape in blocks with inter-block gaps between them, and each block is written in a single operation with the tape running continuously during the write. However, since the rate at which data is written or read to the tape drive varies as a tape drive usually has to cope with a difference between the rate at which data goes on and off the tape and the rate at which data is supplied or demanded by its host. Various methods have been used alone and in combination to cope with this difference. If the host cannot keep up with the tape drive transfer rate, the tape drive can be stopped, backed up, and restarted (known as
shoe-shining). A large memory buffer can be used to queue the data. In the past, the host block size affected the data density on tape, but on modern drives, data is typically organized into fixed-sized blocks which may or may not be compressed or encrypted, and host block size no longer affects data density on tape. Modern tape drives offer a speed matching feature, where the drive can dynamically decrease the physical tape speed as needed to avoid shoe-shining. In the past, the size of the inter-block gap was constant, while the size of the data block was based on host block size, affecting tape capacity – for example, on
count key data storage. On most modern drives, this is no longer the case.
Linear Tape-Open type drives use a fixed-size block for tape (a
fixed-block architecture), independent of the host block size, and the inter-block gap is variable to assist with speed matching during writes. On drives with compression, the compressibility of the data will affect the capacity.
Sequential access to data Tape is characterized by
sequential access to data, similar to
punched tape or
microfilm. The physical nature of a tape on a spool means that to load a specific file, the tape must be moved into position under the read head. By contrast,
drum or
disk storage allows for much more rapid positioning of read heads (
random access). Serpentine tape drives (e.g.,
QIC,) offer improved access time by switching to the appropriate track; tape partitions are used for directory information. For some systems,
metadata such as file name or modification time may not stored. Modern
file archiver and
backup tools allow for both compression and retention of metadata on tape backups. The
Linear Tape File System is a method of storing file metadata on a separate part of the tape. This makes it possible to
copy and paste files or directories to a tape as if it were a disk, but does not change the fundamental sequential access nature of tape.
Access time Tape has a long random access time since the deck must wind an average of one-third the tape length to move from one arbitrary position to another. Tape systems attempt to alleviate the intrinsic long latency, either using indexing, where a separate lookup table (
tape directory) is maintained which gives the physical tape location for a given data block number (a must for serpentine drives), or by marking blocks with a
tape mark that can be detected while winding the tape at high speed.
Data compression Most tape drives now include some kind of
lossless data compression. There are several algorithms that provide similar results:
LZW (widely supported), IDRC (Exabyte), ALDC (IBM, QIC) and DLZ1 (DLT). Embedded in tape drive hardware, these compress a relatively small buffer of data at a time, so cannot achieve extremely high compression even of highly redundant data. A ratio of 2:1 is typical, with some vendors claiming 2.6:1 or 3:1. The ratio actually obtained depends on the nature of the data so the
compression ratio cannot be relied upon when specifying the capacity of equipment, e.g., a drive claiming a compressed capacity of 500 GB may not be adequate to back up 500 GB of real data. Data that is already stored efficiently may not allow
any significant compression and a sparse database may offer much larger factors. Software compression can achieve much better results with sparse data, but uses the host computer's processor, and can slow the backup if the host computer is unable to compress as fast as the data is written. The compression algorithms used in low-end products are not optimally effective, and better results may be obtained by turning off hardware compression and using software compression (and encryption if desired) instead. Plain text, raw images, and database files (
TXT,
ASCII,
BMP,
DBF, etc.) typically compress much better than other types of data stored on computer systems. By contrast, encrypted data and pre-compressed data (
PGP,
ZIP,
JPEG,
MPEG,
MP3, etc.) normally
increase in size if data compression is applied. In some cases, this data expansion can be as much as 15%.
Encryption Standards exist to
encrypt tapes. Encryption is used so that even if a tape is stolen, the thieves cannot use the data on the tape. Key management is crucial to maintain security. Compression is more efficient if done before encryption, as encrypted data cannot be compressed effectively due to the entropy it introduces. Some enterprise tape drives include hardware that can quickly encrypt data.
Cartridge memory and self-identification Some tape cartridges, notably
LTO cartridges, have small associated data storage chips built in to record metadata about the tape, such as the type of encoding, the size of the storage, dates and other information. It is also common for tape cartridges to have bar codes on their labels in order to assist an automated tape library. ==Viability==