Data Scientist - II, Trust Sensitive Content &Intelligence Job ID: 2850305 | ADCI - BLR 14 SEZ - F07 Alexa is thevoice activated digital assistant powering devices like AmazonEcho, Echo Dot, Echo Show, and Fire TV, which are at the forefrontof this latest technology wave. To preserve our customers’experience and trust, the Alexa Sensitive Content Intelligence(ASCI) team builds services and tools through Machine Learningtechniques to implement our policies to detect and mitigatesensitive content in across Alexa. We are looking for a passionate,talented, and inventive Data Scientist-II to help buildindustry-leading technology with Large Language Models (LLMs) andmultimodal systems, requiring good learning and generative modelsknowledge. You will be working with a team of exceptional DataScientists working in a hybrid, fast-paced organization wherescientists, engineers, and product managers work together to buildcustomer facing experiences. You will collaborate with other datascientists while understanding the role data plays in developingdata sets and exemplars that meet customer needs. You will analyzeand automate processes for collecting and annotating LLM inputs andoutputs to assess data quality and measurement. You will applystate-of-the-art Generative AI techniques to analyze how well ourdata represents human language and run experiments to gaugedownstream interactions. You will work collaboratively with otherdata scientists and applied scientists to design and implementprincipled strategies for data optimization. Key jobresponsibilities A Data Scientist-II should have a reasonably goodunderstanding of NLP models (e.g. LSTM, LLMs, other transformerbased models) or CV models (e.g. CNN, AlexNet, ResNet, GANs, ViT)and know of ways to improve their performance using data. Youleverage your technical expertise in improving and extendingexisting models. Your work will directly impact our customers inthe form of products and services that make use of speech,language, and computer vision technologies. You will be joining aselect group of people making history producing one of the mosthighly rated products in Amazon's history, so if you are lookingfor a challenging and innovative role where you can solve importantproblems while growing in your career, this may be the place foryou. A day in the life You will be working with a group of talentedscientists on running experiments to test scientificproposal/solutions to improve our sensitive contents detection andmitigation for worldwide coverage. This will involve collaborationwith partner teams including engineering, PMs, data annotators, andother scientists to discuss data quality, policy, modeldevelopment, and solution implementation. You will work with otherscientists, collaborating and contributing to extending andimproving solutions for the team. About the team The mission of theAlexa Sensitive Content Intelligence (ASCI) team is to (1) minimizenegative surprises to customers caused by sensitive content, (2)detect and prevent potential brand-damaging interactions, and (3)build customer trust through appropriate interactions on sensitivetopics. The term “sensitive content” includes within its scope awide range of categories of content such as offensive content(e.g., hate speech, racist speech), profanity, content that issuitable only for certain age groups, politically polarizingcontent, and religiously polarizing content. The term “content”refers to any material that is exposed to customers by Alexa(including both 1P and 3P experiences) and includes text, speech,audio, and video. BASIC QUALIFICATIONS - 3+ years of data scientistexperience - 3+ years of data querying languages (e.g. SQL),scripting languages (e.g. Python) or statistical/mathematicalsoftware (e.g. R, SAS, Matlab, etc.) experience - 3+ years ofmachine learning/statistical modeling data analysis tools andtechniques, and parameters that affect their performance experience- Experience applying theoretical models in an applied environment- Experience with big data: processing, filtering, and presentinglarge quantities (100K to Millions of rows) of data PREFERREDQUALIFICATIONS - Experience in Python, Perl, or another scriptinglanguage - Experience diving into data to discover hidden patternsand of conducting error/deviation analysis Our inclusive cultureempowers Amazonians to deliver the best results for our customers.If you have a disability and need a workplace accommodation oradjustment during the application and hiring process, includingsupport for the interview or onboarding process, please visit thislink for more information. #J-18808-Ljbffr