Data Scientist - Beymen
Hi I’m Taner Sekmen 👋
I am a Data Scientist & Software Developer, camp lover, and learner, living in Istanbul, Turkey.
I’m currently developing things at Beymen and trying to contribute open source projects or company repositories such as Microsoft, Google Research, HuggingFace, Github Education, EthicalML and Data Talks Club.
Also, I’m creating an issue to find out if I can support the development of OpenAI, obsei and networkx.
Additively, I contributed to pandas-ai examples section.
In addition, I translated into Turkish and deployed the dair.ai site, which follows artificial intelligence, natural language processing, and prompt engineering and has over 20k followers.
Lastly, I made a direct contribution to LangChain to enhance the documentation of examples.
- Developed a RAG that works in a text-to-sql format.
- Created a report for the unique products in the marketplace and retail vertical.
- Generated customized reports, conducted in-depth data analysis, and tailored existing studies to align with specific organizational requirements.
- Prediction of customers' pin location over 10M unique client using machine learning algorithms e.g. Gaussian Mixture Model, Hierarchical-based, K-Means, BIRCH
- A cost reduction of over 500k $ was achieved through the pin location project by annually.
- Worked with AWS Redshift, PostgreSQL, and Athena database systems.
- Built reports and cron jobs.
- The ad filtering system with natural language processing methods for GetirJobs has been enhanced using the HuggingFace.
- Developed topic modeling using the natural language processing method BERTopic based on the feedback obtained from couriers.
- Created a baseline NLP model to detect profanity.
- Utilized APIs to write data into our databases from third-party services.
- Developed interactive dashboards utilizing Qlik Sense to enhance data visualization and facilitate insightful business analysis
- Examined shop and delivery data sourced from SAP, implementing comprehensive data validation processes to ensure accuracy and reliability
- Conducted in-depth analysis of missing data, identifying optimization opportunities and implementing strategic solutions to enhance overall data quality
- Coded object oriented programming to optimize customer portfolio
- Worked a financial theory to invest
In the dataset, It is the 2nd most star-rated notebook among the notebooks studied for this dataset.