Structured data offers a predetermined structure which makes them storable in databases. Being already “structured”, these data are collected in fixed “spots” making it easier to store them, and to run queries and analysis.
On the contrary, unstructured data don’t have a predetermined structure, which is the reason for them being stored as they are.
As of today, 80% of data flows are unstructured in the form of text, audio, video or images, but also as web pages, comments, and surveys.
Looking more closely at the example of a YouTube video, there are some metadata, such as the time of upload, the date of upload, the number of views (partial or complete), the number of likes and dislikes, etc. But the content within the title of the video, the description of the video and the video itself are not structured. They have a qualitative aspect that cannot be acquired only through numbers.
All these flows of data require the use of the cloud. But what is a cloud and why is it important?
Let’s start from the basics. The term cloud can be thought of as an “ecosystem” where remote servers connect. We often name it Cloud computing because it offers different computing services. The user, indeed, does not need to install any software except for a good internet connection to get access to data.
Cloud computing works thanks to a technology known as virtualization, and it offers three type of services:
IaaS (infrastructure as a service): In this case, the company outsources the IT infrastructure. IaaS offers indeed all the necessary hardware components such as RAM, space, server, network. As a result, the company doesn’t have to neither buy physical servers nor maintain IT infrastructure, which is borne by the cloud.
Addresses to: Network Architects, IT administrators
Examples: Google Compute Engine, AWS, Oracle
PaaS (Platform as a service): In this case, the cloud platform provides a platform including all the (hardware and software) instruments needed to build a software. Hence, the software developer has “only” to develop the on-cloud application.
Addresses to: Software developer
Example: Google App Engine, Microsoft Azure
SaaS (Software as a service): Amongst them, it’s the most popular one since it’s ready-to-use by users on the web by the companies.. Salesforce, Slack, Mailchimp, Gmail, Trello, and Office365 are all examples of SaaS applications.
Common questions
A brief history of cloud computing
In 1961, a person called John McCarthy delivered a speech at MIT saying that “Computing can be sold as a utility, like water and electricity”.
In the following years, American computer scientist J.C.R. Licklider - which is believed to be the father of cloud computing technology - imagined a world where connecting was possible regardless of the location. His intuition indeed brought to the creation of ARPANET (Advanced Research Projects Agency Network), which allowed the sharing of digital resources between computers that were physically distant.
As for the term cloud computing, it was coined in 1997 by the professor Ramnath Chellappa during a talk on the “new computing paradigm”.
Eventually, the first global company to implement Cloud computing services was Salesforce. Since then, Cloud computing services have been in huge demand, so that the big organizations like Amazon, Microsoft, and Google are all providing Cloud Computing services.
Sources
https://www.talend.com/resources/data-science-vs-business-intelligence/
https://www.purestorage.com/it/knowledge/big-data/structured-vs-unstructured-data.html
https://www.salesforce.com/it/learning-centre/tech/cloudcomputing/
https://www.cloudflare.com/it-it/learning/cloud/what-is-the-cloud/
https://www.ficpa.org/what-cloud-computing-all-you-need-know-numbers
https://www.naquadria.it/it/blog/cloud-computing-cos-nascita-e-vantaggi