7892f88f84
### What changes were proposed in this pull request? This PR makes the following refinements to the workflow for building docs: * Install Python and Ruby consistently using pyenv and rbenv across both the docs README and the release Dockerfile. * Pin the Python and Ruby versions we use. * Pin all direct Python and Ruby dependency versions. * Eliminate any use of `sudo pip`, which the Python community discourages, or `sudo gem`. ### Why are the changes needed? This PR should increase the consistency and reproducibility of the doc-building process by managing Python and Ruby in a more consistent way, and by eliminating unused or outdated code. Here's a possible example of an issue building the docs that would be addressed by the changes in this PR: https://github.com/apache/spark/pull/27459#discussion_r376135719 ### Does this PR introduce any user-facing change? No. ### How was this patch tested? Manual tests: * I was able to build the Docker image successfully, minus the final part about `RUN useradd`. * I am unable to run `do-release-docker.sh` because I am not a committer and don't have the required GPG key. * I built the docs locally and viewed them in the browser. I think I need a committer to more fully test out these changes. Closes #27534 from nchammas/SPARK-30731-building-docs. Authored-by: Nicholas Chammas <nicholas.chammas@liveramp.com> Signed-off-by: Sean Owen <srowen@gmail.com>
104 lines
1.4 KiB
Plaintext
104 lines
1.4 KiB
Plaintext
*#*#
|
|
*.#*
|
|
*.iml
|
|
*.ipr
|
|
*.iws
|
|
*.pyc
|
|
*.pyo
|
|
*.swp
|
|
*~
|
|
.DS_Store
|
|
.cache
|
|
.classpath
|
|
.ensime
|
|
.ensime_cache/
|
|
.ensime_lucene
|
|
.generated-mima*
|
|
.idea/
|
|
.idea_modules/
|
|
.project
|
|
.pydevproject
|
|
.python-version
|
|
.ruby-version
|
|
.scala_dependencies
|
|
.settings
|
|
/lib/
|
|
R-unit-tests.log
|
|
R/unit-tests.out
|
|
R/cran-check.out
|
|
R/pkg/vignettes/sparkr-vignettes.html
|
|
R/pkg/tests/fulltests/Rplots.pdf
|
|
build/*.jar
|
|
build/apache-maven*
|
|
build/scala*
|
|
build/zinc*
|
|
cache
|
|
checkpoint
|
|
conf/*.cmd
|
|
conf/*.conf
|
|
conf/*.properties
|
|
conf/*.sh
|
|
conf/*.xml
|
|
conf/java-opts
|
|
conf/slaves
|
|
dependency-reduced-pom.xml
|
|
derby.log
|
|
dev/create-release/*final
|
|
dev/create-release/*txt
|
|
dev/pr-deps/
|
|
dist/
|
|
docs/_site/
|
|
docs/api
|
|
sql/docs
|
|
sql/site
|
|
lib_managed/
|
|
lint-r-report.log
|
|
log/
|
|
logs/
|
|
out/
|
|
project/boot/
|
|
project/build/target/
|
|
project/plugins/lib_managed/
|
|
project/plugins/project/build.properties
|
|
project/plugins/src_managed/
|
|
project/plugins/target/
|
|
python/lib/pyspark.zip
|
|
python/.eggs/
|
|
python/deps
|
|
python/docs/_site/
|
|
python/test_coverage/coverage_data
|
|
python/test_coverage/htmlcov
|
|
python/pyspark/python
|
|
reports/
|
|
scalastyle-on-compile.generated.xml
|
|
scalastyle-output.xml
|
|
scalastyle.txt
|
|
spark-*-bin-*.tgz
|
|
spark-tests.log
|
|
src_managed/
|
|
streaming-tests.log
|
|
target/
|
|
unit-tests.log
|
|
work/
|
|
docs/.jekyll-metadata
|
|
|
|
# For Hive
|
|
TempStatsStore/
|
|
metastore/
|
|
metastore_db/
|
|
sql/hive-thriftserver/test_warehouses
|
|
warehouse/
|
|
spark-warehouse/
|
|
|
|
# For R session data
|
|
.RData
|
|
.RHistory
|
|
.Rhistory
|
|
*.Rproj
|
|
*.Rproj.*
|
|
|
|
.Rproj.user
|
|
|
|
# For SBT
|
|
.jvmopts
|