How To Start Using PySpark On Your BigQuery Data Using Google Dataproc

Google BigQuery is a great big data storage – it is simple, fast, and highly scalable. BigQuery’s SQL allows you to satisfy most of the needs that usually buisness may have – join tables, data transformation, calculations and so on. But sometimes you may need more complicated cases of data transformation when SQL does not Read more about How To Start Using PySpark On Your BigQuery Data Using Google Dataproc[…]

How To Deploy A Simple Dockerized Flask App On Google Cloud Run

Google Cloud Platform offers a wide range of serviced which allows you to deploy your own web applications and share them with the whole world. Here you have: App Engine, Compute Engine, Kubernetes clusters or even Cloud Functions for simple endpoints. All this solutions are mainly differ with technical complexity, flexibility and of course costs. Read more about How To Deploy A Simple Dockerized Flask App On Google Cloud Run[…]

How To Get Google Analytics 4 Property Report to Pandas Dataframe Using API

If your work is even slightly related to online marketing – you probably have heard already about new version of Google Analytics – Google Analytics 4 Property. Everybody are talking about it now because Google Analytics is the most popular web analytics tool, so a lot of businesses worldwide are depend on it. One of Read more about How To Get Google Analytics 4 Property Report to Pandas Dataframe Using API[…]

Import Google Sheets Data Into Google BigQuery Using Python And Pandas

Google Sheets is a great tool for storing not too large amount of data, with possibility of easy adding and editing data manually without deep technical knowledge of SQL or programming. On the other hand Google offers a BigQuery – a big data warehouse with super-fast SQL wich allows to manipulate and analyse huge amount Read more about Import Google Sheets Data Into Google BigQuery Using Python And Pandas[…]

How To Import Cost Data Into Google Analytics Using API and Python

Google Analytics have different built-in integrations with other Google advertising platforms like Google Ads or Campaign Manager (in case of GA360) in order to import data from those platforms and present more complete reports. But in case of non-Google ad platforms like Facebook Ads – there are no built-in solutions within Google Analytics. It can Read more about How To Import Cost Data Into Google Analytics Using API and Python[…]

How To Make Complicated PDFs From Basic HTML And CSS

Many web applications require PDF reports and downloads. There are numerous open-source libraries that can build basic PDFs, but if you need a more complicated PDF document or are sending your document to a printer, you’ll need a commercial PDF generation tool. Unlike the browser-based open-source HTML to PDF libraries, commercial engines are designed specifically Read more about How To Make Complicated PDFs From Basic HTML And CSS[…]

Google Cloud Datastore – A Simple And Fast NoSQL Database For Your Project (with Python API examples)

Google Cloud Platform has a lot of different data storing options, but if you are looking for a simple NoSql solution with fast read-inserts – Google DataStore will be the right choice here. If you are writing some web application – Google Datastore will help you to easily handle your app’s data like data of Read more about Google Cloud Datastore – A Simple And Fast NoSQL Database For Your Project (with Python API examples)[…]

How To Deploy A Simple Flask App On Google App Engine In Couple Of Minutes

Google App Engine is a Google Cloud service which lets you to deploy your web site or web application and share it with other world in a very simple way without carrying about all these complicated server stuff and infrastructure. In this guide we will deploy our own app, the whole process can be described Read more about How To Deploy A Simple Flask App On Google App Engine In Couple Of Minutes[…]

Regular Expressions (RegEx) in Google Data Studio – Guide With Examples

Regular expressions are great helpers for every data analyst because they allows to transform dimensions applying quite complicated logic rules. In Google Data Studio I often use REGEXP text functions – so I decided to share with you some my most often use cases. In addition – we’ll find out the difference between REGEXP_REPLACE, REGEXP_EXTRACT Read more about Regular Expressions (RegEx) in Google Data Studio – Guide With Examples[…]

Pandas Python Library = Excel On Steroids (tutorial for beginners)

People that never tried programming often think, that programming is something very complicated and hard to start. Honestly I had the same thinking until I tried python – one of the most intuitive programming languages with a low entry barrier. While learning programming it’s important to be able instantly try your knowledge in practice. In Read more about Pandas Python Library = Excel On Steroids (tutorial for beginners)[…]

How You Can Create Your Own Bot Using Selenium WebDriver And Python

When I created my first bot with Selenium library – I could not believe how easy and intuitively Selenium is. It’s like writing a scenario for a movie – You just describing step by step what should be going on. In my particular case – Selenium helped me to automate reporting of some advertising platforms, Read more about How You Can Create Your Own Bot Using Selenium WebDriver And Python[…]

How To Automate Facebook Ads Reporting In Google Data Studio With Python And FB Marketing API

If You ever wanted to create a dashboard with Facebook Ads data – You probably had an issue with data connectors. The most popular like Supermetrics costs lot of money, although there are some cheap solutions like Pipelinica. But also You can try to create your own solution for free. There is a scheme of Read more about How To Automate Facebook Ads Reporting In Google Data Studio With Python And FB Marketing API[…]

Markov Chain Attribution – Simple Explanation Of Removal Effect

“Markov Chain Attribution” is one of the most popular data driven attribution models. The most important concept behind this model is removal effect. In this article I`ll try to explain the math behind removal effect in a simple way without any formulas. As an example we will take a very simple use case – four Read more about Markov Chain Attribution – Simple Explanation Of Removal Effect[…]

3 Helpful Python Functions For Data Manipulation

Working with Pandas dataframes It`s often needed to reshape the data, in order to prepare it for further analysis, visualization or library that requires particular data form. I`ve decided to share with you some functions that I used a lot working with customer journey paths, exported from Google Analytics. For experienced data scientist that could Read more about 3 Helpful Python Functions For Data Manipulation[…]

Parallel Coordinates For Multidimensional Data Visualization

Parallel coordinates were invented in far 1885 by French engineer and mathematician Philbert Maurice d’Ocagne. When I discovered this way of visualization – I was really impressed how it allows to visualize such a complicated thing as multidimensional data in a simple and intuitive way. This is how I visualized few dimensions of Mobile App Read more about Parallel Coordinates For Multidimensional Data Visualization[…]

5 Google Spreadsheet Formulas Which Every Web-Analyst Should Know

Google Spreadsheet formulas is a powerful tool which can make your life much more easier if you are working with data a lot. There are a lot of interesting extensions that can be very useful for a online marketeer, like Google Ads extension or Google Analytics extension for Google Spreadsheets, but output data which you`ll Read more about 5 Google Spreadsheet Formulas Which Every Web-Analyst Should Know[…]