# Databases vs. Information Retrieval

**Information Retrieval** is concerned with the representation, storage, organization of, and access to information items.

* Focus on automatic processing (indexing, clustering, search) of **unstructured** data (text, images, audio, ...)
* Some applications:
  * searching in a library catalog
  * categorizing a collection of articles by area
  * web search engines etc

The main difference between databases and IR is that databases focus on structured data while IR focuses mainly on unstructured data ("documents") such as web pages, emails, images etc.

Also, databases are concerned with *data retrieval*, not *information retrieval*.

Additionally, while IR is focused on the human user, databases are designed to allow applications to be built on top of them which can then be used by human users.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://vikram-bajaj.gitbook.io/cs-gy-6083-principles-of-database-systems/databases-vs-information-retrieval.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
