Siddharth Deshpande
Verified Expert in Engineering
数据科学家和开发人员
Siddharth is an interdisciplinary researcher with unique perspectives derived from translational projects and his combined educational background in materials engineering, biochemistry, healthcare, 自然语言处理(NLP), and data science. He has extensive experience working with biological structured and unstructured data and using state-of-art AI techniques to solve complex healthcare problems.
Portfolio
Experience
Availability
Preferred Environment
GPT, 自然语言处理(NLP), 生成预训练变压器(GPT), Biomedical Skills, Machine Learning, Language Models, 非结构化数据分析, Data Visualization, 人工智能(AI), Biochemistry, Amazon Web Services (AWS), Python
The most amazing...
...thing I've developed is an NLP framework that extracts and visualizes biomedical entities from documents as network graphs to discover new biomedical relations.
Work Experience
临时总技术主任
Immersely
- Worked for Immersely, which was building a platform to unlock the ability for game developers to create hyper-personalized games that adapt in real time to player emotion, 促进参与,创造更好的, 更多商业上成功的游戏.
- Took charge of developing ML models that use physiological signals to detect the emotions of a person while he is gaming to develop an interactive gaming experience.
- Tasked with developing a technical roadmap and back-end tech infrastructure for the company.
Deep Tech Venture Builder
Post Urban Ventures
- 在融资前验证新创业理念的技术可行性, 建立了技术原型(MVP),用于种子期前和种子轮投资者推介, 并为早期创业公司提供必要的技术基础设施.
- Worked as an interim CTO of four startups and as a technical advisor for two startups within Post Urban Ventures.
- 成功为创业公司获得500万英镑的资助.
- 参与准备技术演示文稿, 提供专家建议和指导, 并促进了创业的成功. 为创业公司在预种子期和种子期后的扩张设计技术路线图.
高级AI/ML和NLP聊天机器人开发人员
Richmond Ayirebide
- Developed an accountant chatbot based on the client's requirements using ChatGPT, finetuned GPT-3, and Telegram.
- Streamlined the preprocessing and postprocessing to format results into easy-to-view Excel sheets for the client.
- Helped set up a plan for the future deployment of the chatbot into the cloud infrastructure.
临时总技术主任
Bioleap
- 为biolap开发技术框架, 这家初创公司专注于开发基于人工智能的单细胞模型.
- 管理AWS中云功能的构建, 聘请了一个有能力的技术团队, 改进了现有的机械模型.
- Established several strategic technological partnerships with leading bio-modeling labs. 为Bioleap模型构建了基于云的自动化策略.
- 建立技术战略(技术栈), technical roadmap, 以及支持增长战略的商业计划.
NLP Data Scientist
Evaluate Ltd
- Developed a press-release classifier that categorizes news articles into 40 technology classes, 在30岁左右拯救公司,每年第三方API许可费用为1000英镑.
- 从临床试验中确定数字健康创新, news articles, and deal documents for a custom analytics project that reduced workforce hours of manual document classification for a Japanese client.
- Created a core NLP framework to extract biomedical entities from unstructured texts and visualize them as a graphical network; the framework became popular for discovering new biomedical relations and was subsequently used in many Evaluate products.
Data Scientist
Patsnap
- Developed PatSnap Bio, a core product that is one of the largest sequence searching platforms and is actively being used by large pharmaceutical companies.
- 创建了另一个在中国进行Beta测试的核心产品PatSnap Materials.
- Engaged actively in the product development and client feedback process for PatSnap Bio and PatSnap Materials.
- 提交了涉及我的技术的五项专利申请.
Experience
2019冠状病毒病科学期刊分析
http://github.com/siddharth0112358/coronavirus_19GitHub上的研究论文:
• AutoDetect_COVID_FakeNews - Classification model for detecting Fake news regarding COVID
• BERT_semantic_search - Semantic search which finds similar sentences in COVID corpus in response to query question
• Biorelated_sentence_extraction_COVID - extract bio-related sentences from COVID corpus
• COVID_19_topic_modelling_Top2Vec - Topic modelling on COVID_19 corpus using Top2Vec
•COVID_explore_drugs -浏览COVID语料库中的药物
• CoVID19_Ques_and_Ans - Covid papers Questions and Answering system based on doc2vec
• CoVID_19_NER_text_summarization_and_topic_modelling - BART summarization and LDA topic modelling and NER
•Covid_19_genome_analysis - covid - 19基因组分析
•Covid_paper_rank_display -基于主题的NER和covid论文恢复
•Medical_NER_Corona -冠状病毒数据集上的NER
•Mining_COVID_keywords—使用双字母和三字母挖掘关键字
阿里云全球人工智能创新挑战赛
The goal of my project was to analyze the effect of weather on energy generation and demand and find a solution that can predict renewable energy generation and energy demand using weather parameters.
SOLUTION HIGHLIGHTS
• Solar, wind, and hydro energy generation prediction using climate and time parameters.
• Energy demand prediction was done using time and energy parameters (Model 1) and time, energy, 及气候参数(模式二). 模型2的准确率略高于模型1. It shows that climate parameters do not affect energy demand as significantly as energy parameters.
• Energy price prediction was done using time and energy parameters (Model 1) and time, energy, 及气候参数(模式二). 模型2的准确率高于模型1. 结果表明,气候参数对能源价格有显著影响.
对于上述所有情况,我们测试了1000万种回归算法. The ExtraTreeRegressor algorithm showed the best performance and was used to build the regression model.
URL: http://www.alibabacloud.com/blog/project-showcase-%7C-effect-of-weather-on-energy-generation-and-demand_598252
Conversational Chatbots
• Conversation helper - This bot helps to simulate tough conversations so that the clients can practice the conversations beforehand. 客户在2-3个会话技巧上得分, and a report is generated at the end that shows his score and how to improve his conversation ability.
• Fashion assistant - This bot recommends fashion items based on client needs and stock inventory of the business. 它使用GPT-3和DALL-E的组合.
• Google bot - This bot has a Google search engine capability and acts as an advisor/friend to whom you can ask any questions, and it will run a Google search in the back end to provide you with the most updated answers.
面试时可以播放预览.
Skillset
Languages
Python, Python 3
Industry Expertise
生物信息学、医疗
Storage
JSON
Other
自然语言处理(NLP), Machine Learning, Data Visualization, Biochemistry, Analytics, Biology, Pharmacology, R&D, Engineering, CSV File Processing, Excel Expert, Interactive Charts, Spark NLP, Chatbots, Patents, GPT, 生成预训练变压器(GPT), Biomedical Skills, Language Models, 非结构化数据分析, 人工智能(AI), Biomaterial, Composite Materials, Deep Learning, Dash, Deep Neural Networks, 卷积神经网络(CNN), Sequence Models, Entrepreneurship, Web Scraping, Time Series Analysis, Computational Biology, Game AI, Emotion Recognition, 聊天机器人对话设计, LangChain, Weviate, Pinecone, Cell Biology, Materials Science, 3D Printing, Product Development, Model Deployment, 生成对抗网络(GANs), Single-cell Modeling, CTO, Pitch Preparation, Medical Diagnostics, OpenAI, 生成预训练变压器3 (GPT-3), Google Custom Search
Libraries/APIs
TensorFlow, PySpark, Spark ML
Tools
微软Excel, SOLIDWORKS
Paradigms
Data Science
Platforms
Amazon Web Services (AWS)
Education
Doctorate in Medicine
新加坡国立大学-新加坡
材料科学与工程专业硕士学位
新加坡国立大学-新加坡
冶金与材料科学专业本科以上学历
工程学院浦那-浦那,印度
Certifications
数据科学家的医疗保健NLP
John Snow Labs
数据科学家的Spark NLP
John Snow Labs
TensorFlow:高级技术专业化
DeepLearning.AI | via Coursera
医疗保健专业的深度学习
伊利诺伊大学厄巴纳-香槟分校(Coursera
用TensorFlow定制你的模型
伦敦帝国理工学院|来自Coursera
生成对抗网络(GANs)专业化
DeepLearning.AI | via Coursera
机器学习模型的部署
Udemy
Python中的自然语言处理
DataCamp
自然语言处理专业化
DeepLearning.AI | via Coursera
医疗保健专业化中的人工智能
斯坦福大学|来源:Coursera
深度学习专业化
DeepLearning.AI | via Coursera
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
开始你的无风险人才试验
对顶尖人才的需求很大.
Start hiring