EDRM Search Project Print
  • RSS
  • Twitter
  • Add to favorites
  • LinkedIn
  • Facebook
  • Google Bookmarks

Search Guide Glossary

This entry is part of a series, EDRM Search Guide»

« Previous: “Appendix 2: Application of Sampling to E-Discovery Search Result Evaluation”
» Next: “Search Guide Versions”

Boolean Search

A search technique that utilizes Boolean Logic, using terms such as AND, OR, and NOT. See alsoBoolean search“.

Concept Search

A search technique that provides words which are similar in concept to a query word. A concept search will return documents that relate to the same concept as the query word, regardless of whether the query word exists in the search results documents. Concept searches can be implemented as a simple thesaurus match, or by using sophisticated statistical analysis methods. See alsoConcept search“.

ESI

Electronically Stored Information.

Fuzzy Search

A search technique that identifies ESI based on terms close to another term, with closeness defined as a typographical difference and/or change. For example, snitch, switch, and swanky can all match swatch, depending on how many incorrect letters are allowed within the search threshold. See alsoFuzzy search“.

Inverted Index

An index that maps a keyword to the list of documents that contain the keyword. See alsoInverted index“.

Keyword Index

A technique that examines the ESI and builds a searchable electronic index. This index typically maps from a keyword to all the documents that contain the keyword. See alsoKeyword index“.

Keyword Search

A very common search technique that uses query words (“keywords”) and looks for them in ESI, using an index. See alsoKeyword search“.

Privileged Documents

A set of documents that a Producing Party is not required to provide, since they fall into Privilege such as Attorney-Client Privilege. The existence of such documents should be recorded in the Privilege Log. See alsoPrivileged documents“.

Privilege Log

A set of documents that a Producing Party did not produce on account of Privilege such as Attorney-Client Privilege. See alsoPrivilege log“.

Phrase Search

A search consisting of multiple keywords separated by spaces to form a single phrase. For a document to match this search, the entire phrase as entered must be contained within the document. See alsoPhrase search“.

Producing Party

A party that owns the complete collection of ESI, and is responsible for producing a portion of the ESI that is deemed to be relevant for a legal case or legal enquiry. See alsoProducing party“.

Proximity Search

A Proximity Search searches for multiple keywords. The matching documents must contain all the keywords, with the keywords occurring within a specified number of words from each other. See alsoProximity search“.

RDBMS

Relational Database Management System. This is a technical term for the class of software programs that manage data using a relational schema, such as Microsoft SQL Server or Oracle. See alsoRDBMS“.

Regular Expressions

A pattern that describes what the search should return based on special characters added to the keyword. For example, car* uses the character * as a wildcard, and the resulting documents should contain words that begin with the characters “car”, such as car, cartoon, or cartography. See alsoRegular expressions“.

Relevancy Rank

A measurement of relevancy of a document, so that the Search Hits within a Search Results can be ordered. Relevancy measurements often involve counting the number of occurrences of a keyword within a document, as well as number of documents a keyword is found in. See alsoRelevancy rank“.

Requesting Party

A party that does not own the ESI and is requesting that the Producing Party which owns the ESI to provide some subset of the ESI based on a Search Request. See alsoRequesting party“.

Responsive Documents

A subset of ESI that matches the desired set of documents for the case. See alsoResponsive file“.

Search Engine

A search component that implements the actual process of interpreting a search request and identifying subsets of documents. For example, a database management system such as Microsoft SQL Server contains a component that manages searches of the data stored in its databases. See alsoSearch engine“.

Search Hit

A document in the ESI that is considered to match the requested Search Query. See alsoSearch hit“.

Search Query

A well-formulated Search request that an automated search engine can interpret in order to produce matching results. See alsoSearch query“.

Search Results

A collection of Search Hits that match the intended documents of a Search Request. See alsoSearch results“.

Synonym Search

A synonym search returns documents that contain terms similar in meaning to the query words, usually using a thesaurus to determine which terms would match the query words. See alsoSynonym search“.

Stemming

A search option that returns matches for all variations of the root word of the initial query word. For example, if the query word was sing, then if a search used stemming the search results would match singing, sang, sung, song, and songs as well as sing. See alsoStemming“.

Tokenization

An operation that examines a document or block of text and breaks the text into words. Typically, a space is used to separate words, but special characters such as a hyphen, period, or quotation mark can also be used. See alsoTokenization“.

Truncation

A Search Specification that indicates that matching documents must contain words that begin with the letters entered, but that the matching words can end with any combination of letters. See alsoTruncation“.

Wildcards

Symbols such as * or ? included within a Keyword to indicate that the location where the symbols are used may match a single letter or multiple letters. See alsoWildcard search“.

« Previous: “Appendix 2: Application of Sampling to E-Discovery Search Result Evaluation”
» Next: “Search Guide Versions”

Leave a comment

Go to top

 

 

 

You can use these HTML tags

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>