In this information rich age, data visualizations are designed to make the knowledge transfer between deliverers and receivers easier. Therefore, it is crucial for the dashboard creators to know which chart is aligned with the key delivery objectives. On the other hand, having a basic understanding of the underlying meaning of each chart also helps the audience to interpret dashboards effectively. In this article, I introduced a way that may help to better understand some common charts and graphs, e.g. scatter plot, map, pie graph and stacked bar chart etc, by categorising them into four main types: distribution, comparison, composition…
Exploratory Data Analysis, also known as EDA, has become an increasingly hot topic in data science. Just as the name suggests, it is the process of trial and error in an uncertain space, with the goal of finding insights. It usually happens at the early stage of the data science lifecycle. Although there is no clear-cut between the definition of data exploration, data cleaning, or feature engineering. EDA is generally found to be sitting right after the data cleaning phase and before feature engineering or model building. EDA assists in setting the overall direction of model selection and it helps…
While taking the first step into the field of machine learning, it is so easy to get overwhelmed by all kinds of complex algorithms and ugly symbols. Therefore, hopefully, this article can lower the entry barrier by providing a beginner-friendly guide. Allow you to get a sense of achievement by building your own ML model using BigQuery and SQL. That’s right, we can use SQL to implement machine learning. If you are looking for several lines of code to get your hands dirty in the ML field, please continue reading :)
Learning data science is a long journey, following a rigid course curriculum inevitably makes learning a mundane task. Therefore, I have compiled a list of data science blogs that are able to bring you daily does of inspiration on various domains: AI and Machine Learning, Data Engineering, Data Visualization, and Business Acumen.
I have created an infographic as a summary, feel free to steal it at the end of this article. Additionally, if you are looking for data science podcasts to follow, have a read of the podcast list I collected :).
Towards Data Science gathers a large community of…
To perform advanced analytical processing and data discovery, one table is often not enough to bring valuable insights, hence combining multiple tables together is unavoidable. SQL, as a tool to communicate with relational database, provides the functionality to build relationships among tables. This article introduces how to use SQL to link tables together. If you want to learn more about the basics of SQL, I suggest have a read of my first article about learning SQL in everyday language. It gives a comprehensive SQL introduction for absolute beginners.
Maybe you haven’t even realized, we frequently come across joining in Excel…
If we only learn data science through a rigid curriculum created by universities or online courses from Coursera or Udemy, we may find the learning process too boring. If you ever find yourself losing motivation in this long journey of studying data science, you may just need some podcasts to break the routine and get some inspiration. The major difference between these two approaches of learning is that the former focuses on theory and concepts, whereas the latter introduces more practical experience and projects that add flesh to the bones.
Listening to podcasts is a great way to absorb knowledge…
SQL stands for “Structured Query Language”, so it is a programming language just like python, java or R. What it differs from most common language is that it is a type of declarative language, which means it tells computers what to do instead of how to do it. As its name suggests, SQL is the language used to communicate with the database for the purpose of requesting and extracting the data we want. Let’s first understand what is a database? …
Missing data is one of the most common data quality issues among three most common issues: Missing Value, Duplicated Value and Inconsistent Value.
Many people may confused landing page with website home page. The main difference is that all landing pages have a strong intention — “convert”. Convert viewers into buyers, or let me restate it straightforwardly, convert traffic into money. Landing page is a standalone page built for marketing campaigns. Due to this nature, it needs to deliver information more effectively and grab viewers attention more efficiently, compared to other kinds of website pages. So how can we design a landing page that converts better?
Needless to say, call to action is the most important element of a landing page. Call to…
Don’t let money become the barrier that stops you from learning whatever you are passionate about. This article introduces some free visualization tools to create dashboards, e.g. Tableau, PowerBI, Google Data Studio ... Additionally, there are many free resources and tutorials that help you to master these tools.
on my way to become a data storyteller