History

Gengliang Wang 395860a986 [SPARK-24768][SQL] Have a built-in AVRO data source implementation ## What changes were proposed in this pull request? Apache Avro (https://avro.apache.org) is a popular data serialization format. It is widely used in the Spark and Hadoop ecosystem, especially for Kafka-based data pipelines. Using the external package https://github.com/databricks/spark-avro, Spark SQL can read and write the avro data. Making spark-Avro built-in can provide a better experience for first-time users of Spark SQL and structured streaming. We expect the built-in Avro data source can further improve the adoption of structured streaming. The proposal is to inline code from spark-avro package (https://github.com/databricks/spark-avro). The target release is Spark 2.4. [Built-in AVRO Data Source In Spark 2.4.pdf](https://github.com/apache/spark/files/2181511/Built-in.AVRO.Data.Source.In.Spark.2.4.pdf) ## How was this patch tested? Unit test Author: Gengliang Wang <gengliang.wang@databricks.com> Closes #21742 from gengliangwang/export_avro.		2018-07-12 13:55:25 -07:00
..
create-release	[SPARK-23698] Remove raw_input() from Python 2	2018-07-04 09:40:58 +08:00
deps	[SPARK-24420][BUILD] Upgrade ASM to 6.1 to support JDK9+	2018-07-03 10:13:48 -07:00
sparktestsupport	[SPARK-24768][SQL] Have a built-in AVRO data source implementation	2018-07-12 13:55:25 -07:00
tests	[MINOR] Fix typos in dev/* scripts.	2018-01-31 07:37:25 +09:00
.gitignore	[SPARK-23174][BUILD][PYTHON][FOLLOWUP] Add pycodestyle*.py to .gitignore file.	2018-01-31 00:51:00 +09:00
.rat-excludes	[SPARK-24654][BUILD] Update, fix LICENSE and NOTICE, and specialize for source vs binary	2018-06-30 19:27:16 -05:00
appveyor-guide.md	[MINOR] Fix typos in dev/* scripts.	2018-01-31 07:37:25 +09:00
appveyor-install-dependencies.ps1	[MINOR][BUILD] Download RAT and R version info over HTTPS; use RAT 0.12	2017-08-12 14:31:05 +09:00
change-scala-version.sh	[SPARK-19810][BUILD][CORE] Remove support for Scala 2.10	2017-07-13 17:06:24 +08:00
check-license	[SPARK-22511][BUILD] Update maven central repo address	2017-11-14 17:58:07 -06:00
checkstyle-suppressions.xml	[HOTFIX][BUILD] Fix finalizer checkstyle error and re-disable checkstyle	2017-09-27 13:40:21 -07:00
checkstyle.xml	[HOTFIX][BUILD] Fix finalizer checkstyle error and re-disable checkstyle	2017-09-27 13:40:21 -07:00
github_jira_sync.py	[MINOR] Fix a bunch of typos	2018-01-02 07:10:19 +09:00
lint-java	[SPARK-23063][K8S] K8s changes for publishing scripts (and a couple of other misses)	2018-01-13 21:34:28 -08:00
lint-python	[MINOR] Fix typos in dev/* scripts.	2018-01-31 07:37:25 +09:00
lint-r	[SPARK-10328] [SPARKR] Fix generic for na.omit	2015-08-28 00:37:50 -07:00
lint-r.R	[SPARK-22063][R] Fixes lint check failures in R by latest commit sha1 ID of lint-r	2017-10-01 18:42:45 +09:00
lint-scala	[SPARK-2627] [PySpark] have the build enforce PEP 8 automatically	2014-08-06 12:58:24 -07:00
make-distribution.sh	[SPARK-24654][BUILD] Update, fix LICENSE and NOTICE, and specialize for source vs binary	2018-06-30 19:27:16 -05:00
merge_spark_pr.py	[SPARK-23698] Remove raw_input() from Python 2	2018-07-04 09:40:58 +08:00
mima	[SPARK-23063][K8S] K8s changes for publishing scripts (and a couple of other misses)	2018-01-13 21:34:28 -08:00
pip-sanity-check.py	[SPARK-19064][PYSPARK] Fix pip installing of sub components	2017-01-25 14:43:39 -08:00
README.md	Merge pull request #565 from pwendell/dev-scripts. Closes #565 .	2014-02-08 23:13:34 -08:00
requirements.txt	[MINOR] Add Sphinx into dev/requirements.txt	2018-07-10 13:54:04 +08:00
run-pip-tests	[PYSPARK] Update py4j to version 0.10.7.	2018-05-09 10:47:35 -07:00
run-tests	[SPARK-22302][INFRA] Remove manual backports for subprocess and print explicit message for < Python 2.7	2017-10-22 02:22:35 +09:00
run-tests-jenkins	[MINOR] Fix typos in dev/* scripts.	2018-01-31 07:37:25 +09:00
run-tests-jenkins.py	[SPARK-23028] Bump master branch version to 2.4.0-SNAPSHOT	2018-01-13 00:37:59 +08:00
run-tests.py	[SPARK-24768][SQL] Have a built-in AVRO data source implementation	2018-07-12 13:55:25 -07:00
sbt-checkstyle	[SPARK-22269][BUILD] Run Java linter via SBT for Jenkins	2018-05-24 14:19:32 +08:00
scalastyle	[SPARK-23063][K8S] K8s changes for publishing scripts (and a couple of other misses)	2018-01-13 21:34:28 -08:00
test-dependencies.sh	[SPARK-23807][BUILD] Add Hadoop 3.1 profile with relevant POM fix ups	2018-04-24 09:57:09 -07:00
tox.ini	[SPARK-23010][K8S] Initial checkin of k8s integration tests.	2018-06-08 15:15:24 -07:00

README.md

Spark Developer Scripts

This directory contains scripts useful to developers when packaging, testing, or committing to Spark.

Many of these scripts require Apache credentials to work correctly.