#ml #pipeline #ml_pipeline #gpu #hadoop #hive #apache_spark #apache_airflow #tensorflow #apache_kafka #pytorch
https://www.youtube.com/watch?v=neb1C6JlEXc
https://www.youtube.com/watch?v=neb1C6JlEXc
YouTube
ROCm and Distributed Deep Learning on Spark and TensorFlowJim Dowling Logical Clocks AB,Ajit Mathews
ROCm, the Radeon Open Ecosystem, is an open-source software foundation for GPU computing on Linux. ROCm supports TensorFlow and PyTorch using MIOpen, a library of highly optimized GPU routines for deep learning. In this talk, we describe how Apache Spark…
#kubernetes #apache_spark #tensorflow #kubeflow #apache_arrow
#multilanguage_pipeline #pipeline
https://www.youtube.com/watch?v=dXPvlocXo34
#multilanguage_pipeline #pipeline
https://www.youtube.com/watch?v=dXPvlocXo34
YouTube
Accelerating Tensorflow with Apache Arrow on Spark (Holden Karau)
Holden, an open source developer advocate at Google, discusses how the Apache Arrow is new in Spark 2.3, and offers faster interchange between Spark and Python. Apache Arrow also has connections to Tensorflow (and even without those can be fed from Pandas).…
#kubernetes #apache_spark #tensorflow #kubeflow #apache_arrow
https://www.youtube.com/watch?v=jdBbFSghM2s
https://www.youtube.com/watch?v=jdBbFSghM2s
YouTube
Building Cross-Cloud ML Pipelines with Kubeflow with Spark & Tensorflow - Holden Karau
Join us for Kubernetes Forums Seoul, Sydney, Bengaluru and Delhi - learn more at kubecon.io
Don't miss KubeCon + CloudNativeCon 2020 events in Amsterdam March 30 - April 2, Shanghai July 28-30 and Boston November 17-20! Learn more at kubecon.io. The conference…
Don't miss KubeCon + CloudNativeCon 2020 events in Amsterdam March 30 - April 2, Shanghai July 28-30 and Boston November 17-20! Learn more at kubecon.io. The conference…
#google #team #code_review #podcasts #scala #apache_spark #python #databricks #team
https://www.twitch.tv/holdenkarau
https://www.twitch.tv/holdenkarau
Twitch
holdenkarau - Twitch
Holden is a transgender Canadian open source developer with a focus on Apache Spark and related "big data" tools. She is the co-author of Learning Spark, High Performance Spark, and Kubeflow for Machine Learning. She is a committer and PMC on Apache Spark
#apache_spark #kubernetes #spark_operator
https://www.zdnet.com/article/google-announces-kubernetes-operator-for-apache-spark/
https://www.zdnet.com/article/google-announces-kubernetes-operator-for-apache-spark/
ZDNet
Google announces Kubernetes Operator for Apache Spark
The beta release of "Spark Operator" allows native execution of Spark applications on Kubernetes clusters -- no Hadoop or Mesos required.
#ml #apache_beam #tfx #tensorflow #airflow #apache_flink #python #java #scala #team #google #apache_spark #dataflow
https://www.youtube.com/watch?v=6p8UXjNg1oc
https://www.youtube.com/watch?v=6p8UXjNg1oc
YouTube
Apache Beam for Production Machine Learning: TensorFlow Extended (Beam Summit Europe 2019)
Developing ML and deep learning applications to be deployed in production is much more than just training a model. Google has taken years of experience in de...
#ml #apache_beam #tfx #tensorflow #airflow #apache_flink #python #java #scala #team #google #apache_spark #dataflow #pipeline
https://www.youtube.com/watch?v=v1DrnY8caVU
https://www.youtube.com/watch?v=v1DrnY8caVU
YouTube
End-to-End ML pipelines with Beam, Flink, TensorFlow, and Hopsworks (Beam Summit Europe 2019)
Apache Beam is a key technology for building scalable End-to-End ML pipelines, as it is the data preparation and model analysis engine for TensorFlow Extended (TFX), a framework for horizontally scalable Machine Learning (ML) pipelines based on TensorFlow.…
#apache_spark #kubernetes #operator #monitoring #deployment #team #lightbend
https://www.lightbend.com/blog/how-to-manage-monitor-spark-on-kubernetes-introduction-spark-submit-kubernetes-operator
https://www.lightbend.com/blog/how-to-manage-monitor-spark-on-kubernetes-introduction-spark-submit-kubernetes-operator
Lightbend
How To Manage And Monitor Apache Spark On Kubernetes - Part 1: Spark-Submit VS Kubernetes Operator | @lightbend
In this two-part blog series, we introduce the concepts and benefits of working with both spark-submit and the Kubernetes Operator for Spark. In Part 1, we introduce both tools and review how to get started monitoring and managing your Spark clusters on Kubernetes.…
#python #scala #jupyter #notebook #kernel #apache_spark
https://medium.com/@bogdan.cojocar/how-to-run-scala-and-spark-in-the-jupyter-notebook-328a80090b3b
https://medium.com/@bogdan.cojocar/how-to-run-scala-and-spark-in-the-jupyter-notebook-328a80090b3b
Medium
How to run Scala and Spark in the Jupyter notebook
The Jupyter notebook is one of the most used tools in data science projects. It’s a great tool for developing software in python and has…
#apache_spark #apache_spark3 #spark3 #overview #delta_lake #koalas
https://www.youtube.com/watch?v=scM_WQMhB3A
https://www.youtube.com/watch?v=scM_WQMhB3A
YouTube
New Developments in the Open Source Ecosystem: Apache Spark 3 0, Delta Lake, and Koalas
In this talk, we will highlight major efforts happening in the Spark ecosystem. In particular, we will dive into the details of adaptive and static query optimizations in Spark 3.0 to make Spark easier to use and faster to run. We will also demonstrate how…
#apache_spark #from #apache_sparkML #ml #to #tensorflow #kuberflow #holden #team #google #demo #databricks #team #spark_operator #demo
https://www.youtube.com/watch?v=0P5WO8f8qJg
https://www.youtube.com/watch?v=0P5WO8f8qJg
YouTube
Migrating Apache Spark ML Jobs to Spark + Tensorflow on Kubeflow - Holden Karau (Independent)
This talk will take an two existing Spark ML pipeline (Frank The Unicorn, for predicting PR comments (Scala) - https://github.com/franktheunicorn/predict-pr-...
#scala #tensorflow #python #apache_spark #spark #gpu
https://portal.klewel.com/watch/webcast/scala-days-2019/talk/37/
https://portal.klewel.com/watch/webcast/scala-days-2019/talk/37/
Klewel
Flare & Lantern: Accelerators for Spark and Deep Learning
Frameworks like Spark and TensorFlow have commoditized cluster computing and training of neural networks. However, they leave precious performance on the table, especially when used together. Flare is a new back-end for Spark SQL that yields significant speedups…
#serverless #serverless_ETL #ETL #graalVM #apache_spark #spark #kubernetes
https://www.youtube.com/watch?v=kDzsv6brQjo
https://www.youtube.com/watch?v=kDzsv6brQjo
YouTube
Scale By The Bay 2019: Rose Toomey, Moonshot Spark: serverless with GraalVM
Can Apache Spark slip its earthly bounds and go serverless, clusterless? Popular cloud services are becoming more capable. AWS Lamba now runs three times lon...