I'm
Competent in Python programming, focus on data processing and the development of data pipelines (CLI Syntax).
Have knowledge about data preprocessing, building machine learning model for predicting outcome and evaluate the models.
Have abilities to process data by programming languages above, visualize them with Redash, Looker Studio,PowerBI, Excel,... provide answers to business questions
Have basic knowledge in working with big data. Used Hadoop and Apache Spark in school projects
Skilled in developing ETL pipelines using Python and modern tools such as Airbyte and n8n, ensuring data quality and integrity.
Proficient with query languages such as: SQL, MySQL, MDX, AQL... can perform data processing in multi types of data (csv, json, txt,...) and multi database types (SQL, NoSQL)
What I've learned
(Check the link to go to my github respository)