Английская Википедия:Databricks
Шаблон:Short description Шаблон:Infobox company
Databricks, Inc. is an American software company founded by the original creators of Apache Spark.[1] Databricks develops a web-based platform for working with Spark, that provides automated cluster management and IPython-style notebooks. The company develops Delta Lake, an open-source project to bring reliability to data lakes for machine learning and other data science use cases.[2]
History
Databricks grew out of the AMPLab project at University of California, Berkeley that was involved in making Apache Spark, an open-source distributed computing framework built atop Scala. The company was founded by Ali Ghodsi, Andy Konwinski, Arsalan Tavakoli-Shiraji, Ion Stoica, Matei Zaharia,[3] Patrick Wendell, and Reynold Xin.
In November 2017, the company was announced as a first-party service on Microsoft Azure via integration Azure Databricks.[4]
In June 2020, Databricks acquired Redash, an open-source tool designed to help data scientists and analysts visualize and build interactive dashboards of their data.[5]
In February 2021 together with Google Cloud, Databricks provided integration with the Google Kubernetes Engine and Google's BigQuery platform.[6] Fortune ranked Databricks as one of the best large "Workplaces for Millennials" in 2021.[7] At the time, the company said more than 5,000 organizations used its products.[8]
In August 2021, Databricks finished its eighth round of funding by raising $1.6 billion and valuing the company at $38 billion.[9]
In October 2021, Databricks made its second acquisition of German no-code company 8080 Labs. 8080 Labs makes bamboolib, a data exploration tool that does not require coding to use.[10]
In response to the popularity of OpenAI's ChatGPT, in March 2023, the company introduced an open-source language model, named Dolly after Dolly the sheep, that developers could use to create their own chatbots. Their model uses fewer parameters to produce similar results as ChatGPT, but Databricks had not released formal benchmark tests to show whether its bot actually matched the performance of ChatGPT.[11][12][13]
Databricks acquired data security startup Okera in May 2023 to extend its data governance capabilities.[14] The next month, it acquired an open-source generative AI startup MosaicML for $1.4Шаблон:Nbspbillion.[15][16]
In October 2023, Databricks acquired data replication startup Arcion for $100 million.[17]
Funding
In September 2013, Databricks announced it raised $13.9 million from Andreessen Horowitz and said it aimed to offer an alternative to Google's MapReduce system.[18][19] Microsoft was a noted investor of Databricks in 2019, participating in the company's Series E at an unspecified amount.[20][21] The company has raised $1.9 billion in funding, including a $1 billion Series G led by Franklin Templeton at a $28 billion post-money valuation in February 2021. Other investors include Amazon Web Services, CapitalG (a growth equity firm under Alphabet Inc.) and Salesforce Ventures.[8]
Series | Date | Amount (million $) | Lead Investors |
---|---|---|---|
A | 2013 | 13.9[18] | Andreessen Horowitz |
B | 2014 | 33[22] | New Enterprise Associates |
C | 2016 | 60[23] | New Enterprise Associates |
D | 2017 | 140[24] | Andreessen Horowitz |
E | Feb. 2019 | 250[25] | Andreessen Horowitz |
F | Oct. 2019 | 400[26] | Andreessen Horowitz |
G | Jan. 2021 | 1,000[27] | Franklin Templeton Investments |
H | Aug. 2021 | 1,600[28] | Morgan Stanley |
I | Sep. 2023 | 500[29] | Capital One Ventures, Nvidia |
Products
Databricks develops and sells a cloud data platform using the marketing term "lakehouse", a portmanteau based on the terms "data warehouse" and "data lake".[30] Databricks' lakehouse is based on the open source Apache Spark framework that allows analytical queries against semi-structured data without a traditional database schema.[31] In October 2022, Lakehouse received FedRAMP authorized status for use with the U.S. federal government and contractors.[32]
Databricks' Delta Engine launched in June 2020 as a new query engine that layers on top of Delta Lake to boost query performance.[33] It is compatible with Apache Spark and MLflow, which are also open source projects Databricks employees helped create.[34]
In November 2020, Databricks introduced Databricks SQL (previously known as SQL Analytics) for running business intelligence and analytics reporting on top of data lakes. Analysts can query data sets directly with standard SQL or use product connectors to integrate directly with business intelligence tools like Tableau, Qlik, SigmaComputing, Looker, and ThoughtSpot.[35]
Databricks offers a platform for other workloads, including machine learning, data storage and processing, streaming analytics, and business intelligence.[36]
The company has also created Delta Lake, MLflow and Koalas, open source projects that span data engineering, data science and machine learning.[37] In addition to building the Databricks platform, the company has co-organized massive open online courses about Spark[38] and a conference for the Spark community called the Data + AI Summit,[39] formerly known as Spark Summit.
Operations
Databricks is headquartered in San Francisco.[40] It also has operations in Canada, the United Kingdom, Netherlands, Singapore, Australia, Germany, France, Japan, China, South Korea, India, Brazil, Switzerland, Costa Rica and Serbia.[41]
References
- ↑ Шаблон:Cite news
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite webШаблон:Dead link
- ↑ Шаблон:Cite webШаблон:Dead link
- ↑ Шаблон:Cite webШаблон:Dead link
- ↑ Шаблон:Cite magazine
- ↑ 8,0 8,1 Шаблон:Cite web
- ↑ Шаблон:Cite news
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite news
- ↑ Шаблон:Cite news
- ↑ Шаблон:Cite news
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite news
- ↑ Шаблон:Cite news
- ↑ Шаблон:Cite web
- ↑ 18,0 18,1 Шаблон:Cite web
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite webШаблон:Dead link
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite webШаблон:Dead link
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite journal
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite news
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite webШаблон:Dead link
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite news
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite web
- ↑ Шаблон:Cite web
- Английская Википедия
- Страницы с неработающими файловыми ссылками
- Big data companies
- Companies based in San Francisco
- Free software companies
- Privately held companies based in California
- Software companies based in the San Francisco Bay Area
- Software companies established in 2013
- Software companies of the United States
- Страницы, где используется шаблон "Навигационная таблица/Телепорт"
- Страницы с телепортом
- Википедия
- Статья из Википедии
- Статья из Английской Википедии