Job Description
- Build customer-facing features which make Cloud Dataproc the best place to run Hadoop and Spark in the cloud.
- Drive technical design and execution for differentiated Performance and LakeHouse features and enhancements in an ambiguous problem space.
- Enhance Apache Spark for performance, reliability, security, and monitoring, and simultaneously enhance Lake House technologies like Iceberg, Hudi, or Delta Lake for performance, security, and monitoring.
- Contribute to and adapt existing documentation or educational content based on product and program updates, as well as user feedback, while also extending open-source technologies like Apache Spark, Hive, and Trino to improve their debuggability, observability, and supportability.
- Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
