Character’s mission is to empower everyone with AGI. Our vision is to enable people with our technology so that they can use Character.AI any moment of any day.
Character.AI is one of the world’s leading personal AI platforms. Founded in 2021 by AI pioneers Noam Shazeer and Daniel De Freitas, Character.AI is a full-stack AI company with a globally scaled direct-to-consumer platform. As of 2023 that platform was #2 in the space in user engagement. Character.AI is uniquely centered around people, letting users personalize their experience by interacting with AI “Characters.” The company achieved unicorn status in 2023 and was named Google Play’s AI App of the Year.
Noam co-invented the key tech powering LLMs and was recently named to TIME100’s Most Influential People in AI list. TIME called him “one of the most important and impactful people of the space’s past, present, and future.” Daniel created and led LaMDA, the breakthrough conversational tech project currently powering Bard.
To learn more, please visit beta.character.ai.
The data platform team at Character seeks to accomplish the following:
Provide high quality data for analytics and model training
Remove the accidental complexity of Data Engineering tasks to empower ML Researchers, SWEs, and Data Scientists to move as quickly and confidently as possible.
Make data go vroom while gpus go brrr!
Responsibilities:
Make our Data Warehouse, Data Lake, and ML Research pipelines a joy to use with your infrastructure expertise, knowledge of big data systems, and keen eye for organization.
Build internal systems for our Product and ML Research teams to manage datasets, run large-scale inference jobs, manage crowdworker data, and anything else under the DataOps umbrella.
Make the most of vast compute resources to enable rapid development of AI products.
Help manage and scale our Kubernetes deployments for training, data engineering, and data acquisition.
Ensure our data platform can scale reliably to the next several orders of magnitude
Qualifications:
Expertise with Data Platform architecture and big data tools (Spark, Apache Beam, Ray, PubSub/Kafka, etc).
Solid understanding of Spark and ability to write, debug and optimize Spark code. Apache Spark on K8s a plus.
Experience building production systems and data pipelines in languages like Python, SQL, Go, Java, Scala
5+ years experience with Kubernetes. Experience as a Cluster Administrator a plus.
Experience with production cloud networking, service meshes (istio), CNIs (cilium, eBPF), and managing contention in shared clusters a plus.
Character is an equal opportunity employer and does not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status. We value diversity and encourage applicants from a range of backgrounds to apply.