1.Business use data to derive information that is critical to their day-to-day operations.
2.Storage is a repository that enables uses to store & retrieve this digital data.
1.It is a collection of raw facts from which conclusions may be drown.
2.Data in the form of 0's & 1's is called digital data & is accessible by the user only after it is processed by a computer.
3.While the advancement of Computer & Communication technologies, the rate of data generation & sharing has increased exponentially.
Types of data:-
Data can be classified as structured or unstructured based on how it is stored & managed.
1.Data created by individuals or business must be stored so that it is easily accessible for futher processing.
2.In a computing environment, devices designed for storing data are termed storage devices or simply storage.
3.The types of storage used varies based on the type of data & the rate at which it is created & used.
Information Storage Systems
They are built by taking into considerations the basis capability of a storage device, such as HDD & adding hardware & software to obtain high performing, reliable & easily managed system.
Information Retrieval Systems
1.Informative retrieval system is a system with a user interface that provides the facility for the user to create, search & modify the data stored in a storage network.
2.This is typically a peer-to-peer network which is operated & maintained by private organisations, however access rights are provided to the public.
3.The access can be performed via Internet from outside the organisation & via Intranet within the organisations.
4.Information Storage & IR are addressed as two sides of the same cover.
5.If a person is able to search the required information, then that information must have already been stored in same format. The format in which the information is often represented to the people can be either techs, images, audios or videos, which makes it different to obtain clear & precise answer to multiple questions which the users may require. Searching a document involves a collection of information which may be either easy or complicated depending on how the collection is organised.
6.Almost all of the IR systems fielded today are either Boolean IR systems for major document collections or text pattern search systems for handling small document collections (for ex: personal collections of files).
7.Text patterns search queries are strings or regular expressions. The grep family of tools, in the UNIX environment is a well-known example of text pattern searchers.
8.In Boolean IR System, documents are represented by sets of keywords, usually stored in an inverted file. In inverted file is a list of keywords & identifiers of the documents in which they occur. Boolean queries are keyword connected with Boolean logically operations (AND,OR,NOT).
9.Conceptual models facet focuses on the performance enhancements of IR systems neither the information associated with statistical distribution of terms.
10.The statistical models such as vector space, probabilistic or clustering models do the statistical distribution of terms where every documents in retrieved collection is allocated with probability of relevance.