AAC | An audio format designed by Apple as a successor to MP3 and used in its iPod and iTunes products |
AACR2R | Anglo-American Cataloging Rules (second edition, revised), published by the American Library Association |
ASCII | American Standard Code for Information Interchange, a 1968 standard 7-bit code for representing the Roman alphabet plus numerals and special symbols |
Authority control | The practice of using a list of preferred terms or proper names for metadata values within a library catalog |
BibTeX | Scheme for managing bibliographic data and references within documents in the TeX format |
Boolean query | Query to an information-retrieval system that may contain AND, OR, NOT |
Browsing | Accessing a collection by scanning an organized list of metadata values associated with the documents (such as author, title, date, and keywords) |
Building | Process of creating the indexing and browsing structures that are used to access a collection |
Case-folding | Making uppercase and lowercase words look the same, for searching purposes |
CD-ROM | Compact disk read-only memory, a 650-MB disk that uses the same technology as audio CDs |
CGI | Common Gateway Interface, a scheme that allows users to activate programs on the host computer by clicking on Web pages |
CGI script | Code associated with a button, menu, or link on a Web page that specifies what the host computer is to do when clicked |
Collection | Set of documents that are brought together under a uniform searching and browsing interface |
Collection configuration file | File that specifies how a Greenstone collection is to be imported and built, what indexes and language interfaces are to be provided, and so on |
Cookie | A short message sent by a Web server to a Web browser and then returned unchanged by the browser each time it accesses that server, the purpose being to retain a unique identifier for the user between invocations |
CSS | Cascading Style Sheets, a way of controlling the presentation of HTML and XML documents |
DDL | Description Definition Language, a form of XMLSchema used in MPEG-7 to allow users to create their own metadata format |
Digital library | Collection of digital objects (text, audio, video), along with methods for access and retrieval, as well as methods for selection, organization, and maintenance |
Document | Basic unit from which digital library collections are constructed, which may include text, graphics, sound, and video |
DOI | Digital Object Identifier, a scheme for permanent URLs that uses the Handle system and is maintained by the International DOI Foundation |
DSpace | Open source institutional repository software, designed by the MIT Library in conjunction with Hewlett-Packard (Document Space) |
DTD | Document Type Definition, a specification used in XML (and also SGML) to express the structure of a particular set of documents |
Dublin Core | An intentionally minimalist standard for describing metadata, designed to be applied to resources on the Web |
DVD | Digital Versatile Disk, a disk format that can hold from 5 to 20 GB of data |
Dynamic HTML | Combination of HTML with CSS, Web-page scripting, and the document object model that provides functionality comparable to XSL |
EBCDIC | Extended Binary Coded Decimal Interchange Code, an alternative code to ASCII |
Encapsulated PostScript | Variant of PostScript designed for expressing graphics of a single page or less that are to be included in other documents |
Endnote | Commercial program for managing bibliographic data and references within documents |
Faceted classification | Browsing scheme that allows users to navigate the document space using different metadata elements or “facets” |
FAO | United Nations Food and Agricultural Organization |
Fedora | A repository architecture founded upon a powerful digital object model, which is extremely flexible and configurable (Flexible Extensible Digital Object Repository Architecture) |
FLAC | Lossless audio coding scheme (Free Lossless Audio Codec) |
FO | Formatting Objects, the component of the XML specification that deals with page layout and related formatting issues |
FTP | File transfer protocol |
GB-encoding | Standard way of encoding the Chinese language |
GIF | Graphics Image Format, a widely used compression scheme for lossless images |
GNU Public License | Software license that permits users to copy and distribute computer programs freely, and to modify them, so long as all modifications are made publicly available |
Greenstone | The name of the digital library software used as an example in the book (www.greenstone.org) |
HTML | HyperText Markup Language, the language in which Web documents are written |
HTML Tidy | Software utility that converts older HTML formats to XHTML |
HTTP | Hypertext transfer protocol |
Hyperlink | Link to another document or to another place within this document |
Importing | Process of bringing collections of documents into a digital library system |
Index | Information structure that is used for searching or browsing a collection |
Institutional repository | A specialized kind of digital library designed to collect, preserve, and disseminate an institution's intellectual output |
ISBN | International Standard Book Number |
ISCII | Indian Script Code for Information Interchange, an ASCII extension that incorporates Brahmi-based Indic scripts |
JPEG | Standard for (mainly) lossy image compression, named after the Joint Photographic Experts Group |
JPEG 2000 | Later version of the JPEG image compression standard |
LCSH | Library of Congress Subject Headings, a controlled vocabulary for assigning subject descriptors |
LZW | Lempel-Ziv-Welch compression scheme, used in GIF, TIFF, PDF, and PostScript Level 2 and above |
MADS | Metadata Authority Description Schema, an XML-based format for authority control information |
MARC | Machine-readable cataloging format, a metadata scheme designed in the late 1960s for use by professional library catalogers |
MARCXML | A standard way of representing MARC data in the XML language |
Metadata | Structured information, such as author, title, date, keywords, and so on, that is associated with a document (or document collection) |
METS | Metadata Encoding and Transmission Standard for the representation, maintenance, and exchange of digital objects |
MIDI | Musical Instrument Digital Interface, a representation of music used by music synthesizers |
MIME | Multipurpose Internet Mail Extensions, a standard for including different types of file—text, images, audio, video, or application-specific data—in e-mail messages |
Mirror | The process of copying a Web site, or part of a Web site, to another location and making it available there |
MP3 | A lossy digital audio encoding format (MPEG-1 Audio Layer 3) |
MODS | Metadata Object Description Schema, an XML-based format for metadata from the U.S. Library of Congress |
MPEG | Set of standards for representing multimedia material, named after the Motion Picture Experts Group |
MXF | Material Exchange Format, a container format for video data |
New Zealand Digital Library (NZDL) Project | Research project in the Computer Science Department at the University of Waikato, New Zealand, that created the Greenstone software (www.nzdl.org) |
OAI-ORE | Standard designed expressly for representing aggregations of digital objects (Open Archives Initiative Object Reuse and Exchange) |
OAI-PMH | Protocol designed for the efficient dissemination of digital library metadata (Open Archives Initiative Protocol for Metadata Harvesting; often abbreviated to simply OAI) |
OCR | Optical Character Recognition, the process of producing a digital representation of the textual content of a document image |
ODF | Open Document Format, an XML-based set of specifications for office documents |
Ogg Vorbis | Lossy digital audio encoding format specifically designed to be free of all patent restrictions |
Ogg Theora | Lossy digital video encoding format specifically designed to be free of all patent restrictions |
OID | Object Identifier, a unique identification code associated with a document or other digital object |
OMR | Optical Music Recognition, the musical analog of OCR |
OOXML | Office Open XML, an XML-based format for representing documents in Microsoft Office products |
Phrase query | Query to a full-text retrieval system that comprises several consecutive words |
PDF | Portable Document Format, a page description language designed for interactive use as a successor to PostScript |
PNG | Portable Network Graphics, an open standard for lossless images |
PostScript | The first widely used page description language |
PREMIS | PREservation Metadata Implementation Strategies, a standard for preservation and administrative metadata |
Protocol | Set of conventions according to which two systems communicate |
PURL | Persistent URL, a particular style of URN |
Ranked query | Natural-language query to an information-retrieval system, for which the documents that match the query are sorted in order of relevance |
RDF | Resource Description Framework, a scheme designed to facilitate the interoperability of metadata |
RSLP collection description | A particular application of RDF by the Research Support Libraries Program based at the University of Bath, United Kingdom |
RTF | Rich Text Format, a standard format for interchange of text documents |
Searching | Accessing a collection through a full-text search of its contents (or parts of contents, such as section titles) |
SGML | Standard Generalized Markup Language, a metalanguage for describing markup formats that was standardized in 1986 and forms a precursor to XML |
SQL | Structured Query Language, an industry-standard database-query language |
SRU | Search/Retrieval via URL, a Web service version of Z39.50 |
Stemming | Stripping endings off a term to make it more general |
SVG | XML-based language for describing two-dimensional graphics |
TEI | Text Encoding Initiative, a project founded in 1987 that developed SGML DTDs for representing scholarly texts in the humanities and social sciences |
TIFF | Tagged Image File Format, a file format for raster images that incorporates facilities for descriptive metadata |
UCS | Unicode Character Set, the set of characters supported by Unicode |
UNESCO | United Nations Educational, Scientific and Cultural Organization |
Unicode | Standard scheme for representing the character sets used in the world's languages |
URI | Universal Resource Identifier, a generic name for URLs and URNs |
URL | Universal Resource Locator, a standard way of addressing objects on the Web (but this term is supposed to be superseded by URI) |
URN | Universal Resource Name, a way of naming resources instead of specifying their locations |
UTF | UCS Transformation Format, a scheme for representing Unicode characters with three variants: UTF-32, UTF-16, and UTF-8 |
VRML | Virtual Reality Modeling Language, used for presenting virtual reality experiences on the Web |
Web server | Standard program that computers use to make information accessible over the World Wide Web |
Word | Microsoft Word, a widely used word processing program |
XHTML | Modern version of HTML that incorporates the stricter syntactic rules of XML |
XLink | XML linking language that provides a more powerful method for connecting resources than HTML hyperlinks |
XML | Extensible Markup Language, a metalanguage for describing markup formats for structured documents and data on the Web |
XML Schema | Way of specifying the structure of a particular set of documents that provides more expressive facilities for structures and data types than DTDs |
XMP | Extensible Metadata Platform, a way of embedding metadata within files like images and PDF documents |
XPath | XML specification that allows the selection of parts of a document |
XPointer | Development of XPath that provides a finer degree of control over the parts of the document being selected |
XQuery | Format used for expressing queries in the style of XML |
XSL | Extensible Stylesheet Language, a way of controlling the presentation of XML documents (and XML-compliant HTML ones) that is more expressive than CSS |
XSLT | XSL Transformations, a component of the XML specification that allows you to manipulate parts of the document tree |
Z39.50 | International standard communications protocol developed for use by library catalog systems |