Data Analyst/Data Scientist with 15+ years of experience (mostly in insurance and banking).
Looking for a new role: Python, SQL, SAS, PySpark, Big Data, ML, AML, Gen AI, etc.
I'm a US citizen, had a green card through National General (NY) previously.
I have degrees in both statistics and math from the University of São Paulo (the best in Brazil).
I also have an MBA from EAESP-FGV in São Paulo (an accredited business school).
| Accreditor |
Country |
| AACSB (Association to Advance Collegiate Schools of Business) |
USA |
| AMBA (Association of MBAs) |
UK |
| EQUIS (EFMD Quality Improvement System) |
Belgium |
Resumes:
Data Analysis
Data Science
Python projects developed in VS Code
I have a knack for programming (it's about algorithm development, not memorization). 🐍
Here are some projects that I've developed in Python:
- Creation of video/images with Gen AI/AI Prompting/TTS. AI video
- Using n8n (a NoCode orchestration tool) to automate Gen AI. n8n WF
- Cloning my own voice with the TTS module.
- Checking a stock price online and sending an e-mail alert if price above target (scheduled job).
- Speech recognition and transcription/translation (PT to EN).
- Automatic creation of subtitle file (.srt) from transcribed/translated audio. Ana Paula
- Web scraping of tables on Wikipedia using BeautifulSoup.
- Automatic creation of formatted PowerPoint slides with data scraped from Wikipedia. 200 Brazilian actresses
- Automating tasks of apps that don't expose an API (e.g., applying auto correct to images in Office Picture Manager).
- Video enhancement using AI: Enhanced vs. Original
- Given a person's name in a given language, predict the gender of the person.
- Installing, configuring and debugging PySpark on Windows (Python 3.11.8, Java 11.0, Hadoop 3.3.5), for Big Data.
-
Fitting models such as Logistic Regression using PySpark ML
(Logistic Regression)
- AI video/image enhancement using Real-ESRGAN (Generative Adversarial Network).
- Checking if NVIDIA CUDA is enabled after installation.
- Play counts syncing between iTunes and Windows Media Player. GUI applet
- Moving mp3 files into their correct folders and creating full log files of changes.
- Searching, downloading, and attaching mp3 artwork (Apple/Discogs) using REST API's and logging results.
- Image pattern recognition for mp3 covers (detecting generic vinyl/LP covers and replacing them).
- Auto-populating mp3 tags using Discogs data using REST API.
- Synchronizing mp3 tag names with file names to resolve inconsistencies.
- Incremental backup of roughly 63k mp3 files from SSD to MicroSD (mirrored backup system).
- Downloading videos from YouTube automatically (cookies may be required).
- Creating standalone executables from Python scripts.
- Passing functions as parameters using lambda and dynamic execution.
- Image, audio, and video processing using ffmpeg.
- Running Windows shell commands from Python.
Here's a little guide to some of the iTunes/WMP scripts ➔
Music Scripts
Reports created in SAS
A dashboard created with SAS/GRAPH
A SAS stored process created to automate data extracts
A Proc Report Excel output (redacted)
Personal math research
I plan to upload my latest paper in Number Theory here as soon as it's finished (very close now).
It introduces a new symmetry relation for the Lerch Φ function that may have implications for proving if a number is algebraic or transcendental (still checking). It is a generalization of the Riemann functional equation with three parameters.
Using GitHub for the first time as an option to pre-print repositories (using Google Analytics to track page hits).
Research papers on arXiv
© Jose R. Sousa. All rights reserved.
Applets on Gumroad
Links to profiles