GRI CatalogPLUS



Exhibitions
Conducting Research
Scholarly Activities
About the Research Institute



Research Institute Home Conducting Research Learn about the Getty Vocabularies Getty Vocabularies Download Center
Getty Vocabularies Download Center

The AAT, ULAN, and TGN are currently available for licensing in three formats: XML, relational tables, and MARC. The MARC format will be discontinued in 2010. Sample data may be downloaded from this page free of charge.

Licensing: The Getty vocabulary data is copyrighted by the J. Paul Getty Trust. All rights reserved. Your institution must sign a license and pay the required fees in order to use the data. The data is compiled from various contributors using published sources, which must be cited along with the J. Paul Getty Trust when the data is displayed.

The data files are released annually, in July or slightly earlier each year (barring unforeseen technical difficulties). If you wish to license multiple vocabularies, you must obtain a separate license for each one. Licensing agreements are for a fixed five-year term; annual data updates are now being offered free of charge during that five-year term. The license fee must be paid and the license agreement signed prior to the transfer of data. At the end of five years, you may renew your license for an additional five-year term upon payment of a fee. Thousands of records are updated or added each year; users are advised to update their data annually. Contact us at vocab@getty.edu, subject line: Licensing, to learn the amount of the fees and terms of licenses. Please include an explanation of how you intend to use the data and whether your institution is for-profit or not-for-profit.

Formats of future releases: For releases beginning in 2010, we plan to release the data in XML and Relational Tables formats only, discontinuing the MARC format. If you have comments regarding this proposed change, please contact us at vocab@getty.edu. With this change, we will also institute Unicode in our data, replacing our $xx diacritic codes. We will then release the data in relational tables and XML UTF-8 format only.

Upcoming changes to the data structure: Changes have been made in the structure of all three vocabularies to allow for the seamless integration of multilingual terms. One significant change is that the Qualifier field is now linked to language (which has multiple occurrences per Term), to better accommodate qualifiers in multilingual data. Originally, there was only one qualifier per term; however, the same term may be appropriate for multiple languages, while the qualifier for different languages should be different (e.g., English: gouache (paint), Spanish: gouache (pintura); French: gouache (pientre)). In other words, while language was always repeating for each Term, now with the revised data structure, the Qualifier will also be repeating for each term. In addition, the Term Type was moved to the language link (because, for example, the same term could be a Descriptor in one language, but a Used For term in another language). A Language Status flag was added, which will eventually be used to flag loan terms. The Scope Note was made multilingual, so there are multiple occurrences of SN, and it is linked to language. Another change is that the Hierarchical Relationship Type is now linked to the hierarchical relationship as a separate field, whereas it had previously been stored in the relationship historical flag field. The Hierarchical Relationship Type records the thesaural standards codes for the type of relationship between child and parent: BTP (part/whole or partitive), BTG (genus/species or generic), or BTI (instance of). The new data dictionary will be posted soon.
When will these changes be implemented? The new data structure will be included in the June 2010 licensed data file releases. It will appear in the Web services data, online browser displays, Web contribution forms, and import XML schema in early 2010. For those of you who copy and paste from the Web browsers, the descriptor will still continue to display with the qualifier in parentheses, so you will be able to highlight and copy term-plus-qualifier as you are now accustomed to doing.

Data dictionaries: Data dictionaries for the 2009 release are available by clicking the links below. Note that new data dictionaries, with changes to the fields noted above, will be posted for the 2010 release. The documentation below does not give step-by-step instructions on how to construct a database or interface based on the data files; analysis and a competent programmer will be required to implement the vocabulary data files. The Getty does not provide technical support. For details regarding data content and editorial rules, see the Editorial Guidelines.

Persistent IDs: We are pleased to announce a significant improvement in our data releases. We have implemented new functionalities in our editorial system that will result in more persistent IDs for our vocabulary records. Previously, although each record had a unique numeric ID, the ID would change when new records were "merged" with existing records, and in other rare situations. While licensees of Getty vocabulary data received annual mappings of old IDs to new ones, our user community was anxious to have a more persistent ID for the Getty vocabulary records over time. The new merge process, implemented in January 2008, results in the ID of the original vocabulary record being maintained when a new record is merged into it. Other editorial situations may occasionally require the generation of new IDs (e.g., when one existing record is divided into two records); for these rare cases, a mapping of the old IDs to the new ones will continue to be published with the annual releases.

Web services: The Getty Vocabulary Program, together Getty Information Technology Services (ITS), is developing a set of Web services APIs (application programming interfaces) to enable access to the most up-to-date version of the Getty vocabulary data in real time. The APIs are being Beta tested during 2009. Please read details about the project in Overview of the Vocabulary APIs (pdf) and the Web Services User's Instructions (pdf). To volunteer your institution as a Beta tester, please write to vocab@getty.edu.

Data currently available: The data currently available for licensing was cut in May 2009; the next release is scheduled to occur in July 2010, or slightly earlier. Diacritic codes in the data may be translated into Unicode using the master code table.

Download sample records from the Art & Architecture Thesaurus (AAT). The full AAT is a hierarchical vocabulary of around 34,000 records, including 131,000 terms, descriptions, bibliographic citations, and other information relating to fine art, architecture, decorative arts, archival materials, archaeology, and other material culture. The full XML data file for the AAT is 17,323 kilobytes in size. The sample available here is a small subset of the AAT. For further information, see About the AAT.

AAT Sample Data
Format options:

 

Data Dictionary for the
AAT Data Release

 

 

 

XML (1.12 MB)

 

XML format PDF

XML UTF-8 (1.12 MB)

 

XML format PDF

Relational Table (2.58 MB)

 

Relational Table format PDF

Relational Table UTF-8 (2.59 MB)

 

Relational Table format PDF

MARC (1.24 MB)

 

MARC format PDF

Download sample records from the Union List of Artist Names (ULAN). The full ULAN is a vocabulary of around 120,000 records, including 293,000 names and biographical and bibliographic information for artists, architects, firms, shops, and art repositories, including a wealth of variant names, pseudonyms, and language variants. The full XML data file for the ULAN is 65,062 kilobytes in size. The sample available here is a small subset of the ULAN. For further information, see About the ULAN.

ULAN Sample Data
Format options:

 

Data Dictionary for the
ULAN Data Release

 

 

 

XML (10.2 MB)

 

XML format PDF

XML UTF-8 (10.3 MB)

 

XML format PDF

Relational Table (49.2 MB)

 

Relational Table format PDF

Relational Table UTF-8 (49.3 MB)

 

Relational Table format PDF

MARC (12.1 MB)

 

MARC format PDF

Download sample records from the Getty Thesaurus of Geographic Names (TGN). The full TGN is a hierarchical vocabulary of around 912,000 records, including 1.1 million names, place types, coordinates, and descriptive notes, focusing on places important for the study of art and architecture. TGN is not a GIS; coordinates are included for many records, but they are for finding purposes only (i.e., to find the place on a map). The full XML data file for the TGN is 117,066 kilobytes in size. The sample available here is a small subset of the TGN. For further information, see About the TGN.

TGN Sample Data
Format options:

 

Data Dictionary for the
TGN Data Release

 

 

 

XML (1.65 MB)

 

XML format PDF

XML UTF-8 (1.65 MB)

 

XML format PDF

Relational Table (1.85 MB)

 

Relational Table format PDF

Relational Table UTF-8 (1.86 MB)

 

Relational Table format PDF

Data is copyrighted: The Art & Architecture Thesaurus® (AAT), the Union List of Artists Names ® (ULAN), and the Getty Thesaurus of Geographic Names® (TGN) are copyrighted by the J. Paul Getty Trust. Companies and institutions interested in regular or extensive use of the vocabularies should explore licensing options by sending an email to vocab@getty.edu.

No warranties by Getty: The databases are provided "as is." Getty disclaims all other warranties, either express or implied, including, but not limited to, implied warranties of merchantability and fitness for a particular purpose, with respect to the databases.

Related Sections

Learn about the Getty Vocabularies

Obtain the Getty Vocabularies

Contribute to the Getty Vocabularies

Frequently Asked Questions

Editorial Guidelines

Training Materials for Vocabularies and Standards



Email Vocabulary Program

Additional Information

To view and print Portable Document Format (PDF) you will need Adobe Reader.


Back to Top