TOOLS: MetaMap

Additional DataSets

These optional datasets are configured for the 2012 and later versions of MetaMap, so they will not work with releases of MetaMap prior to 2012 unless otherwise specified. See the documentation on "Using Additional Datasets with Public MetaMap" for information on installing optional datasets.

A description of the contents of the Default (USAbase) and Optional Datasets is available on the "Description of MetaMap Data Versions" page. Other DataSets including 2006, 1999, and non-UMLS data sets are listed on the Optional DataSets Page.

Covid-19/SARS-CoV-2 Strings in the 2020AA and Later DataSets

A note about the inclusion of Covid-19/SARS-Cov-2 strings in the 2020AA and later data sets

2023 Specialist Lexicon

The 2023 DB Lexicon is needed for the following 2023 Linux and Mac OS/X datasets.

2023AB UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2023AB UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2023AB UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2023AA UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2023AA UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2023AA UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2022 Specialist Lexicon

The 2022 DB Lexicon is needed for the following 2022 Linux and Mac OS/X datasets.

2022AB UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2022AB UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2022AB UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2022AA UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2022AA UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2022AA UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2021 Specialist Lexicon

The 2021 DB Lexicon is needed for the following 2021 Linux and Mac OS/X datasets.

2021AB UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2021AB UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2021AB UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Dataset Files

2021AA UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2021AA UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.

Note: The the 2021AA USAbase Strict and Base Data Sets are supplied in the MetaMap 2017 Main Distribution.

Linux and Mac OS/X Datasets

2021AA UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2020 Specialist Lexicon

The 2020 DB Lexicon is needed for the following 2020 Linux and Mac OS/X datasets.

2020AB UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2020AB UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.

Note: The the 2020AB USAbase Strict and Base Data Sets are supplied in the MetaMap 2017 Main Distribution.

Linux and Mac OS/X Datasets

2020AB UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2020AA UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2020AA UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.

Note: The the 2020AA USAbase Strict and Base Data Sets are supplied in the MetaMap 2017 Main Distribution.

Linux and Mac OS/X Datasets

2020AA UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2019 Specialist Lexicon

The 2019 DB Lexicon is needed for the following 2019 Linux and Mac OS/X datasets.

2019AB UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2019AB UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.

Note: The the 2019AB USAbase Strict and Base Data Sets are supplied in the MetaMap 2017 Main Distribution.

Linux and Mac OS/X Datasets

2019AB UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2019AA UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2019AA UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.

Note: The the 2019AA USAbase Strict and Base Data Sets are supplied in the MetaMap 2017 Main Distribution.

Linux and Mac OS/X Datasets

2019AA UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2018 Specialist Lexicon

The 2018 DB Lexicon is needed for the following 2018 Linux and Mac OS/X datasets.

2018AB UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2018AB UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.

Note: The the 2018AB USAbase Strict and Base Data Sets are supplied in the MetaMap 2017 Main Distribution.

Linux and Mac OS/X Datasets

2018AB UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2018AA UMLS Base Datasets

This dataset includes all and only sources of Restriction Category 0. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

2018AA UMLS USAbase Datasets

This dataset includes the Base vocabularies (those with Restriction Category 0), plus the five Category-4 sources and the four Category-9 sources (including SNOMEDCT). See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.

Note: The the 2018AA USAbase Strict and Base Data Sets are supplied in the MetaMap 2017 Main Distribution.

Linux and Mac OS/X Datasets

2018AA UMLS NLM Datasets

The NLM data version includes the full Metathesaurus. Dataset versions prior to 2015AA of the NLM model excluded the CPT, CPTSP, HCPT, and MTHCH vocabularies from the CPT family, and the HCDT, HCPCS, and MTHHH vocabularies from the HCPCS family. See also, MetaMap FAQ: MetaMap's Base, USAbase, and NLM Data Versions.
Linux and Mac OS/X Datasets

Additional DataSets are available; these are listed on /nfsvol/cgsb_share2/ind/ind1/ind_repository the Optional DataSets Page.