• Help
    Discussion forum
    Search tips
  • About
    CERN Open Data
    ALICE
    ATLAS
    CMS
    DELPHI
    JADE
    LHCb
    OPERA
    TOTEM
    Glossary

Important notice: opendata-qa.cern.ch is a quality-assurance server. Please use it for testing purposes only. The content may be erased from time to time. Please use opendata.cern.ch for production.

CERN Open Data search tips

2024-01-31 by CERN Open Data team

Documentation Help


The search engine of the CERN Open Data Portal allows different types of queries. The most common filters can be applied with the selection boxes on the left: experiment, type of record, file format, etc...

The search bar offers more flexibility, and it allows to search by any terms. By default, it will do an AND of all the terms that have been introduced. For instance, a search for heavy ion will return only documents that contain both words. The default behaviour can be modified using the reserved word AND, OR and the parenthesis (, ). Note that this is case sensitive. Some examples of queries are:

  • heavy ion: returns documents that contain both words.
  • heavy ion electron: returns documents that contain the three words.
  • heavy ion AND electron: identical to the previous case, it will return documents that contain the three words.
  • heavy ion OR electron: returns documents that contain the word heavy, and either ion or electron.
  • (heavy ion) OR electron: returns documents that contain either heavy ion or electron.

It is possible to filter out results using the -. A couple more examples:

  • heavy ion -electron: returns documents that contain heavy ion and do not contain electron.
  • -electron: returns all the documents that do not contain the word `electron.
  • -"heavy ion" electron: returns documents that contain electron and not heavy ion.

On top of that, it is also possible to search on specific fields. For instance, take a look at the following queries:

  • authors.orcid:0000-0002-4485-2972: returns all the documents from that author.
  • 10.7483/OPENDATA.CMS.W26R.J96R: returns all documents with those terms in any field.
  • doi:10.7483/OPENDATA.CMS.W26R.J96R: returns the document with that Digital Object Identifier (doi)
  • doi:10.7483*: returns all the documents with a doi starting with those values.

Searches based on the title are a bit trickier, since the title is stored in two formats (one for searching and a different one for sorting). Here you have a couple of examples searching in the title:

  • title.tokens:muon: returns all the documents where the title contains that word. Note that words are delimited by spaces or special symbols (like '-'). Therefore, di-moun will also match. dimoun will not.
  • title.tokens:*muon: similar to the previous one. It will return the documents where the title contains a word finishing with muon (di-muon, dimuon and muon will match).
  • title.tokens:*muon*: documents where any word in the title contains the sequence muon will be returned (DoubleMuon, MuonEG, ...).

The syntax for these queries is defined by the OpenSearch query_string, which is based in the Apache Lucene syntax

ALICE experiment
ATLAS experiment
CMS experiment
DELPHI experiment
JADE experiment
LHCb experiment
OPERA experiment
PHENIX experiment
TOTEM experiment
© CERN, 2014–2025 ·
Terms of Use ·
Privacy Policy ·
Help ·
GitHub ·
Twitter ·
Email
Powered by Invenio
Open Data Portal v0.3.0
CERN