研究

MLlib: Apache Spark中的機器學習

作者:Xiangrui孟,Joseph Bradley, Burak Yavuz, Evan Sparks, Shivaram Venkataraman, Davies Liu, Jeremy Freeman, DB Tsai, Manish Amde, Sean Owen, Doris Xin, Reynold Xin, Michael J. Franklin, Reza Zadeh, Matei Zaharia, Ameet Talwalkar

下載論文

摘要

Apache Spark是一個流行的用於大規模數據處理的開源平台,非常適合迭Beplay体育安卓版本代機器學習任務。在本文中,我們介紹了Spark的開源分布式機器學習庫MLlib。MLlib為廣泛的學習設置提供了有效的功能,並包括一些基礎的統計、優化和線性代數原語。隨Spark一起發布的MLlib支持多種語言,並提供了高級API,利用Spark豐富的生態係統簡化端到端機器學習管道的開發。由於其活躍的開源社區有超過140個貢獻者,MLlib經曆了快速增長,並包括廣泛的文檔來支持進一步的增長,並讓用戶快速跟上速度。

相關內容

作者:Andrew Chen, Andy Chow, Aaron Davidson, Arjun DCunha, Ali Ghodsi, Sue Ann Hong, Andy Konwinski, Clemens Mewald, Siddharth Murching, Tomas Nykodym, Paul Ogilvie, Mani Parkhe, Avesh Singh, Fen Xie, Matei Zaharia, Richard Zang, Juntai Zheng, Corey Zumar, Databricks, Inc。

作者:Matei Zaharia, Andrew Chen, Aaron Davidson, Ali Ghodsi, Sue Ann Hong, Andy Konwinski, Siddharth Murching, Tomas Nykodym, Paul Ogilvie, Mani Parkhe, Fen Xie, Corey Zumar, Databricks Inc。

作者:Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Elibol,楊宗恒,William Paul, Michael I. Jordan, Ion Stoica, UC Berkeley

作者:Roy Fox, Richard Shin, Sanjay Krishnan, Ken Goldberg, Dawn Song, Ion Stoica

作者:Firas Abuzaid, Joseph Bradley, Feynman Liang, Andrew Feng, Lee Yang, Matei Zaharia, Ameet Talwalkar

作者:Cody Coleman, Deepak Narayanan, Daniel Kang, Zhao Tian, Zhang Jian, Luigi Nardi, Peter Bailis, Kunle Olukotun, Chris Ré, Matei Zaharia

作者:Daniel Crankshaw, Wang Xin, Giulio Zhou, Michael J. Franklin, Joseph E. Gonzalez, Ion Stoica

作者:Reza Bosagh Zadeh, Xiangrui孟,Alexander Ulanov, Burak Yavuz, Li Pu, Shivaram Venkataraman, Evan Sparks, Aaron Staple, Matei Zaharia

作者:Eric Liang, Richard Liaw, Philipp Moritz, Robert Nishihara, Roy Fox, Ken Goldberg, Joseph E. Gonzalez, Michael I. Jordan, Ion Stoica

Baidu
map