Cos'è la Data Science e a cosa serve

Blog IT Impresa - Data Science, la scienza alla base dei Big Data Analytics

Di : Alessandro Achilli 1 Settembre 2020

Il termine Data Science non è recente, risale al 1974 quando l’informatico Peter Naur lo usò nel suo libro “Concise Survey of Computer Methods” spiegando il cambiamento e l’evoluzione delle discipline legate all’informatica che, secondo lui, si sarebbero sempre più avvicinate a quella che prima identificò come “datalogy”, disciplina che poi rivisitò in scienza dei dati, Data Science appunto.

Tuttavia, negli anni recenti, diciamo più o meno nell’ultimo ventennio, il termine è tornato alla ribalta grazie ai fari puntati su Big Data e, più nello specifico, sui Big Data Analytics, ed alla necessità per le aziende di affidarsi, per una parte dell’analisi dei dati, a nuove competenze, quelle dei Data Scientist (nonché al fatto che dal 2001 la Data Science è diventata disciplina a sé stante, staccandosi definitivamente dall’alveo delle discipline informatiche e matematico-statistiche).

Indice dei contenuti

Cos’è e cosa significa esattamente Data Science?

Secondo quanto riportato dalla libera enciclopedia globale Wikipedia, la Data Science è un insieme di principi metodologici (basati sul metodo scientifico) e di tecniche multidisciplinari fondamentali per interpretare, analizzare ed estrarre conoscenza dai dati.

I principi metodologici della scienza dei dati sono spesso associati al cosiddetto Data Mining e sfruttano, come accennato, tecniche multidisciplinari coniugando saperi da più fonti quali matematica, statistica, scienza dell’informazione, informatica e persino scienze sociali.

Come anticipato, nonostante il termine fece la sua comparsa nei primi anni ’70 del secolo scorso, si è dovuto attendere il nuovo millennio per offrire alla Data Science un posto tutto suo nelle discipline scientifiche; nel 2001 uscì dalla branca dell’informatica e della statistica e William Cleveland ne delineò i campi di competenza, elencando sei diverse aree: ricerca multidisciplinare, modelli, elaborazione dati, pedagogia, valutazione degli strumenti e teoria.

Da allora, in meno di vent’anni, la disciplina è in realtà molto evoluta, soprattutto con l’avvento dei Big Data e l’attenzione si è sempre più focalizzata sul “valore dei dati” anziché sulla sua mera gestione. La Data Science è così diventata una scienza olistica che comprende ancora ambiti quali l’informatica, la statistica e la matematica, come nell’accezione originale, ma cui si sono aggiunte competenze di tipo più ampio, manageriali e di business, legate alla più recente necessità di sapere leggere, interpretare e capitalizzare i dati per prendere decisioni più efficaci (da qui la sua strettissima correlazione con gli Analytics).

Chi è e cosa fa il Data Scientist

Via via che la Data Science ha preso piede come scienza multidisciplinare, anche le competenze ad essa collegate si sono evolute facendo “nascere” ed evolvere nuove figure professionali, come quella del Data Scientist, definita qualche anno fa dall’Harvard Business Review come la professione più sexy del ventunesimo secolo (per le enormi opportunità lavorative legate a questo mestiere).

Data Scientist concept — Data Scientist, una figura con competenze diversificate sia tecniche sia manageriali e di business

Come per la scienza dei dati, anche una delle sue figure professionali di riferimento non è “nuova” sull’asse storico ma l’evoluzione degli Analytics e l’esplosione di Big Data hanno messo in luce la necessità di avere “scienziati dei dati” con capacità e competenze più evolute rispetto agli analisti dei dati tradizionali.

Se da un lato la Data Science ha spostato il focus dalla gestione del dato al suo valore per il business, dall’altro lato anche le competenze delle figure professionali legate al Data Management e Data Analytics si sono dovute evolvere ampliando le capacità di analisi tradizionali verso abilità non solo tecniche (statistica, matematica, informatica) ma anche di business (comprensione delle esigenze e degli obiettivi di business, capacità di problem solving, gestione del rischio, ecc.).

Secondo l’Osservatorio Big Data Analytics & BI del Politecnico di Milano, il Data Scientist è la figura professionale che comunemente si associa alla capacità di gestire i Big Data e trarne informazioni rilevanti. Da un punto di vista più tecnico, viene “inquadrato” come figura altamente specializzata che conosce in maniera approfondita le tecniche matematico-statistiche, che sa sviluppare e implementare algoritmi di Machine Learning, conosce diversi linguaggi informatici di programmazione (come R o Python tra i più recenti legati a Machine Learning ed Intelligenza Artificiale) e gestisce gli Analytics.

Tra le altre competenze, al Data Scientist è richiesta anche una certa capacità comunicativa per poter presentare, comunicare e chiarificare i risultati delle sofisticate analisi dei dati agli utenti di business che non hanno competenze tecniche ma necessitano dei Data Analytics per prendere decisioni più efficaci.

Ecco dunque che per riassumere chi è e cosa fa un Data Scientist, può venire in aiuto l’identikit che ne ha fatto l’Osservatorio del Politecnico: una persona con una conoscenza approfondita di modelli matematico-statistici e algoritmi, tecniche di programmazione necessarie per implementarli e capacità di raccontare le evidenze in modo sintetico e semplice.

La “nuova” scienza dei dati nei Big Data Analytics

Dalle sue origini ad oggi, il concetto di Data Science si è evoluto fino a trasformarlo, come accennato, verso quella scienza multidisciplinare come la intendiamo oggi il cui fulcro sta nella valorizzazione dei dati ai fini del business.

L’importanza dell’analisi dei dati ai fini di business, per prendere decisioni più efficaci

Da qui la sua stretta correlazione con i Big Data Analytics che, volendo provare a darne una sorta di definizione, fanno riferimento alla scienza dell’analisi dei dati (in particolare grandi moli di dati eterogeni, destrutturati e provenienti da più fonti, soprattutto esterne all’azienda) il cui obiettivo è l’interpretazione da cui estrapolare conoscenza utile al processo decisionale.

Scienza che guarda sempre più intensamente alle analisi predittive e prescrittive per definire possibili scenari futuri ed avere informazioni utili e di valore per definire azioni strategiche e/o operative, e che sempre più frequentemente si innerva di nuove tecniche e tecnologie, come il Machine Learning.

Cookie	Durata	Descrizione
_GRECAPTCHA	5 months 27 days	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
_GRECAPTCHA	5 months 27 days	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Durata	Descrizione
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	2 years	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Durata	Descrizione
__kla_id	2 years	Cookie set to track when someone clicks through a Klaviyo email to a website.
SRM_B	1 year 24 days	Used by Microsoft Advertising as a unique ID for visitors.

Cookie	Durata	Descrizione
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_UA-137720848-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gat_UA-35242002-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	Hotjar sets this cookie to detect the first pageview session of a user. This is a True/False flag set by the cookie.
_hjFirstSeen	30 minutes	Hotjar sets this cookie to identify a new user’s first session. It stores a true/false value, indicating whether it was the first time Hotjar saw this user.
_hjIncludedInPageviewSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's pageview limit.
_hjIncludedInSessionSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's daily session limit.
_hjTLDTest	session	To determine the most generic cookie path that has to be used instead of the page hostname, Hotjar sets the _hjTLDTest cookie to store different URL substring alternatives until it fails.
ajs_anonymous_id	1 year	This cookie is set by Segment to count the number of people who visit a certain site by tracking if they have visited before.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Durata	Descrizione
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
ANONCHK	10 minutes	The ANONCHK cookie, set by Bing, is used to store a user's session ID and also verify the clicks from ads on the Bing search engine. The cookie helps in reporting and personalization as well.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
MUID	1 year 24 days	Bing sets this cookie to recognize unique web browsers visiting Microsoft sites. This cookie is used for advertising, site analytics, and other operations.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Data Science, la scienza alla base dei Big Data Analytics

Cos’è e cosa significa esattamente Data Science?

Chi è e cosa fa il Data Scientist

La “nuova” scienza dei dati nei Big Data Analytics

Tag

Articoli

Parla con un Nostro Esperto

Siamo disponibili per ogni chiarimento e problema, non esitare a contattarci

Hanno scelto IT Impresa

Contatti e Indirizzi

Sedi

Restiamo in contatto

Categorie Blog

Cookie	Durata	Descrizione
__awc_tld_test__	session	No description
_clck	1 year	No description
_clsk	1 day	No description
_hjSession_1956240	30 minutes	No description
_hjSessionUser_1956240	1 year	No description
AnalyticsSyncHistory	1 month	No description
CLID	1 year	No description
last_pys_landing_page	7 days	No description
last_pysTrafficSource	7 days	No description
li_gc	2 years	No description
pys_first_visit	7 days	No description
pys_landing_page	7 days	No description
pys_session_limit	1 hour	No description
pys_start_session	session	No description
pysTrafficSource	7 days	No description
SM	session	No description available.