Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from many structural and unstructured data.Data science is related to data mining and big data. As you can see data science involves working with data(structural or unstructured).

So, let me ask you a question...

What is data?

Data are characteristics or information, usually numerical, that are collected through observation. In a more technical sense, data is a set of values of qualitative or quantitative variables about one or more persons or objects, while a datum is a single value of a single variable. And in a lay man language we can say data are facts and statistics collected together for reference or analysis.

Techniques used in data science

  1. Clustering
  2. Dimensionality reduction
  3. Machine Learning

Just as we have techniques in data science, we have technologies used for it, when i say technologies i mean programming languages and some tools used for it

  1. Python - is a programming language with simple syntax that is commonly used for data science.There are a number of python libraries that are used in data science including numpy, pandas, and scipy.
  2. R - is a programming language that was designed for statisticians and data mining and is optimized for computation.
  3. Tensorflow - is a framework for creating machine learning models developed by Google.
  4. Pytorch - is another framework for machine learning developed by Facebook.

