Hello! I'm Francisco Iago

Transforming Public Health Data into Actionable Insights | Python, R, & SQL Expert

Who am I?

A little about me

Data Scientist with a strong Statistical background (B.Sc.) and currently pursuing a Master's in Applied Computing. I specialize in Health Tech and Epidemiological Surveillance, currently managing large-scale laboratory data pipelines (GAL) for the Brazilian Ministry of Health. My focus is bridging the gap between raw public health data and strategic decision-making using Python, SQL, and Cloud Engineering. Previously optimized judicial workflows at Federal Labor Courts using BI tools

But what do I know?

Hard Skills
Rstudio

Main libraries for data analysis and machine learning, as well as libraries for creating dashboards.

Python

Pandas, Scikit-learn, NumPy, TensorFlow, Matplotlib, Seaborn, among others. Learning Streamlit.

SQL

Relational databases, querying, updating, insertion, filtering, aggregation, creation, and data modification.

Excel

Advanced formulas, automation via macros, pivot tables, and charts.ㅤㅤㅤㅤㅤㅤㅤㅤㅤ

Power BI

Creation of interactive visualizations and integration with R and Python.

Tableau

Creation of interactive visualizations and integration with R and Python².

Looker

Creation of interactive visualizations.ㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤ

AWS

Initial concepts on cloud computing.ㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤ

GitHub

Working with remote repositories.ㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤ

Databricks

Using the platform for creating analytical solutions.ㅤ

Developed Projects

Portfolio
Sports Analytics: Multivariate Analysis

By Francisco Ferreira

This work was developed to obtain the degree of Bachelor in Statistics from the University of Brasília. My idea was to analyze the performance of players in the 2022 Brasileirão Série A (Brazilian League) using Multivariate Analysis techniques. The work was presented on Dec 18, 2023, the same day I formally obtained my bachelor's degree in statistics!

Read more
Ad A/B Testing

By Francisco Ferreira

A/B is a fundamental approach in statistical experimentation, often used to compare two versions of a product, marketing strategy, or whatever the variant of interest may be. In this context, I will apply an A/B analysis to the data found in this link. The aim will be to assess whether the responses between the control and exposed groups, using appropriate statistical tests to determine whether there is a significant difference in the proportions of 'yes' responses.

Notebook link
Judicial Efficiency Predictor (R Shiny)

By Francisco Ferreira

During my internship at the Regional Labor Court (10th Region), I developed an app in Shiny to simulate the placement of labor courts in the IGEST ranking. At the time, we used a dashboard made in Tableau, but it was barely functional. The biggest challenge was to create a dashboard that was fast, could filter values added by the user, and was able to show the ranking on a single page. The development process for this work can be found on my Github profile.

Read more Info about IGEST
Which is the best forecast?

By Francisco Ferreira

During the Time Series course, one thing my professor used to say was that the best model is not always the one chosen by the software.

In this small study, I use one of the assignments I did during the course to try to demonstrate this point.

Read more
National Health Monitoring System

By Francisco Ferreira

Statistics Laboratory 2 was one of the last courses I took during my undergraduate studies. In it, my colleagues (Arthur Rodrigues, Ana Carolina Gomez, Daniel Miranda and Rayssa Lorrane) and I developed a dashboard using Looker. Until then, this was my first experience with this software. Initially, this represented a barrier, but with study and dedication, I managed to overcome these obstacles. The course syllabus, information about Previne, and the dashboard can be consulted below.

Course Syllabus Info about Previne Dashboard
Big Data Processing with PySpark

By Francisco Ferreira

In a world where data analysis/data science is growing strongly, choosing the most appropriate tool is crucial for exploring the maximum amount of information. In this small study, I explore some possibilities of using the platform, utilizing the SQL language.

Markdown
Ready to accelerate your data team?

Get in touch via my LinkedIn.