If you’re working with weighted quantitative data, statistical software like SPSS (and its open source counterpart PSPP) have easy built-in functionality to handle the weights. But it’s less straightforward if you’re working in other programs, so this quick tutorial shows how to complete basic calculations with weighted survey data in Power BI. For this tutorial, […]
When I’m working with data, I probably spend 90% of my time cleaning. In a recent project, I ran into an issue where I needed to make ZIP codes consistent – some were five-digit, some had the plus-four, some had a hyphen while others didn’t, and some were just invalid. I searched for a simple […]
If you work with large volumes of data in Google Analytics, you have likely encountered sampled data at some point. Seeing that your data has been sampled might make you suspicious – can I really trust the data if it is sampled at 50%? What about only 5%? To get to the bottom of this, […]
What this does: Append rows from one Google sheet to another & schedule the task to run daily Requirements: Baseline understanding of Google sheets Concepts covered: Google Apps Scripts, Google Sheets One of the coolest features of Google Sheets is the ability to automate tasks via scripts. For example, say you need to perform a […]
What this does: Cleans text from meta tags using basic JavaScript to pass to Google Analytics reports Requirements: Google Analytics deployed through Tag Manager, understanding of regular expressions & JavaScript, admin access to Google Analytics Over the years I’ve found one of the more useful tools in Google Analytics is the ability to pass custom […]
What this is for: Extracting bulk data from the NPI registry and downloading as CSV Requirements: Knowledge of JSON, Pandas dataframes Concepts covered: Making an API request, flattening JSON, combining dataframes Download: Download the Jupyter notebook If you work with health care provider data, you are no doubt familiar with NPI (National Provider Identifier) data. […]
What this is for: Forecasting seasonal data Requirements: Python Anaconda distribution, Understanding of statistics and experience with machine learning Concepts covered: Calucating confidence intervals and forecasting future values with pmdarima library Download the Jupyter notebook One of the more helpful applications of data science to marketing is developing forecasts. You can use forecasts to predict […]
What this is for: Collecting comments from a public Facebook page and performing a basic content analysis Requirements: Python Anaconda distribution, basic understanding of HTML structure and Chrome inspector tool Concepts covered: Social listening, word clouds For anyone who works in strategic communications, it’s critical that you have a finger on the pulse of your […]
Google Analytics’ Demographics report can provide a wealth of valuable data about the users of your website, including age, gender, interests and affinity categories. One of the cool but lesser known features of Google Analytics is the ability to create pivot tables with your web data. In this quick tutorial, we’ll make a simple pivot […]
What this is for: Telling a Selenium web scraper to click on link or enter basic information into a form Requirements: Python Anaconda distribution, Basic knowledge of HTML structure and Chrome Inspector tool Concepts covered: Selenium Selenium is an incredibly useful tool for scraping websites with Python, but occasionally your scraper may need to interact […]