如何用多线程在OSX上安装XGBoost
我正在尝试按照指南here在我的Mac(OSX 10.12.1)上安装xgBoost,但遇到了一些问题。
第1步
通过
brew install gcc --without-multilib
获取支持OpenMP的GCC-6.x.x
终端
Ben$ brew install gcc --without-multilib
Error: gcc-5.3.0 already installed
To install this version, first `brew unlink gcc`
Ben$ brew unlink gcc
Unlinking /usr/local/Cellar/gcc/5.3.0... 1288 symlinks removed
Ben$ brew install gcc --without-multilib
[26 minutes later]
==> Summary
?? /usr/local/Cellar/gcc/6.2.0: 1,358 files, 238.3M, built in 26 minutes 20 seconds
第2步
克隆存储库
git clone --recursive https://github.com/dmlc/xgboost
终端
Ben$ git clone --recursive https://github.com/dmlc/xgboost
Cloning into 'xgboost'...
remote: Counting objects: 18754, done.
remote: Compressing objects: 100% (21/21), done.
remote: Total 18754 (delta 1), reused 0 (delta 0), pack-reused 18733
Receiving objects: 100% (18754/18754), 6.23 MiB | 3.74 MiB/s, done.
Resolving deltas: 100% (11347/11347), done.
Checking connectivity... done.
Submodule 'dmlc-core' (https://github.com/dmlc/dmlc-core) registered for path 'dmlc-core'
Submodule 'rabit' (https://github.com/dmlc/rabit) registered for path 'rabit'
Cloning into '/Users/Ben/xgboost/dmlc-core'...
Cloning into '/Users/Ben/xgboost/rabit'...
Submodule path 'dmlc-core': checked out 'f35f14f30835af238257b979cc1fac3e41ff3291'
Submodule path 'rabit': checked out 'a9a2a69dc1144180a43f7d2d1097264482be7817'
第3步
使用以下命令生成
注意:如果您使用的是OSX El Capitan,BREW会安装GCC的最新版本GCC-6。因此,您可能需要修改Makefile#L46,并将GCC-5更改为GCC-6。之后,在make/config.mk中将GCC-5/g++-5更改为GCC-6/g++-6,然后使用以下命令进行构建cd xgboost; cp make/config.mk ./config.mk; make -j4
嗯。不完全确定在这里该做些什么。我在xgboost/
中看到一个名为"Makefile"的文件。在里面我看到了很多我不理解的东西,但这些行看起来很相关
生成文件(代码段)
# on Mac OS X, force brew gcc-6, since the Xcode c++ fails anyway
# it is useful for pip install compiling-on-the-fly
OS := $(shell uname)
ifeq ($(OS), Darwin)
export CC = $(if $(shell which gcc-6),gcc-6,$(if $(shell which gcc-mp-6), gcc-mp-6, clang))
export CXX = $(if $(shell which g++-6),g++-6,$(if $(shell which g++-mp-6),g++-mp-6, clang++))
endif
将忽略此操作并继续...
终端
Ben$ cd xgboost; cp make/config.mk ./config.mk; make -j4
[Tons of log output. Appears to finish succesfully]
...
g++-6 -std=c++0x -Wall -O3 -msse2 -Wno-unknown-pragmas -funroll-loops -Iinclude -Idmlc-core/include -Irabit/include -fPIC -fopenmp -shared -o lib/libxgboost.so build/learner.o build/logging.o build/c_api/c_api.o build/c_api/c_api_error.o build/common/common.o build/data/data.o build/data/simple_csr_source.o build/data/simple_dmatrix.o build/data/sparse_page_dmatrix.o build/data/sparse_page_raw_format.o build/data/sparse_page_source.o build/data/sparse_page_writer.o build/gbm/gblinear.o build/gbm/gbm.o build/gbm/gbtree.o build/metric/elementwise_metric.o build/metric/metric.o build/metric/multiclass_metric.o build/metric/rank_metric.o build/objective/multiclass_obj.o build/objective/objective.o build/objective/rank_obj.o build/objective/regression_obj.o build/tree/tree_model.o build/tree/tree_updater.o build/tree/updater_colmaker.o build/tree/updater_histmaker.o build/tree/updater_prune.o build/tree/updater_refresh.o build/tree/updater_skmaker.o build/tree/updater_sync.o dmlc-core/libdmlc.a rabit/lib/librabit.a -pthread -lm -fopenmp
第4步(说明here?)
如果您要使用最新的xgBoost版本,并且已经
已编译的xgost,使用library(devtools); install('xgboost/R-package')
手动安装xgBoost包(相应地将路径更改为
其中编译了xgBoost)。
新建R会话
library(devtools)
install("/Users/Ben/xgboost/R-package")
Installing xgboost
'/Library/Frameworks/R.framework/Resources/bin/R' --no-site-file --no-environ --no-save --no-restore --quiet CMD INSTALL
'/Users/Ben/xgboost/R-package' --library='/Library/Frameworks/R.framework/Versions/3.3/Resources/library' --install-tests
* installing *source* package ‘xgboost’ ...
** libs
g++-5 -std=c++11 -I/Library/Frameworks/R.framework/Resources/include -DNDEBUG -I../..//include -I../..//dmlc-core/include -I../..//rabit/include -I../../ -DXGBOOST_STRICT_R_MODE=1 -DDMLC_LOG_BEFORE_THROW=0 -DDMLC_ENABLE_STD_THREAD=1 -DDMLC_DISABLE_STDIN=1 -DDMLC_LOG_CUSTOMIZE=1 -DXGBOOST_CUSTOMIZE_LOGGER=1 -DRABIT_CUSTOMIZE_MSG_ -DRABIT_STRICT_CXX98_ -I/usr/local/opt/llvm/include -fopenmp -fPIC -Wall -mtune=core2 -g -O2 -c xgboost_R.cc -o xgboost_R.o
make: g++-5: No such file or directory
make: *** [xgboost_R.o] Error 1
ERROR: compilation failed for package ‘xgboost’
* removing ‘/Library/Frameworks/R.framework/Versions/3.3/Resources/library/xgboost’
* restoring previous ‘/Library/Frameworks/R.framework/Versions/3.3/Resources/library/xgboost’
Error: Command failed (1)
失败:(我哪里出错了?
解决方案
已更新解决方案
自2019年3月,R版本3.5.3,XgBoost版本0.82.0.1,与我以前的答案相比,情况发生了变化。以下步骤基于installation guide here,但略有不同。brew install cmake
或brew upgrade cmake
(cmake --version
为我显示3.14.0)brew install gcc
或brew upgrade gcc
(注意安装的GCC版本。它应该位于/usr/local/bin
。我得到GCC 8.3.0(gcc-8 --version
))cd some/unimportant/directory
git clone --recursive https://github.com/dmlc/xgboost
cd xgboost/
mkdir build
cd build
CC=gcc-8 CXX=g++-8 cmake .. -DR_LIB=ON
make -j4
sudo make install
现在重新启动/刷新RStudio,它应该已安装
在R中测试
set.seed(222)
N <- 2*10^5
p <- 350
x <- matrix(rnorm(N * p), ncol = p)
y <- rnorm(N)
system.time(mymodel <- xgboost(
# nthread = 4,
data = x,
label = y,
nrounds = 5,
objective = "reg:linear",
tree_method = "exact",
max_depth = 10,
min_child_weight = 1,
eta = 1,
subsample = 0.66,
colsample_bytree = 0.33
))
# Tested on 2018 MPB, xgboost version 0.82.0.1, multi-threaded version
# nthread = default: 7.4 seconds (elapsed)
# nthread = 1: 24.0 seconds (elapsed)
# nthread = 2: 13.7 seconds (elapsed)
# nthread = 4: 7.5 seconds (elapsed)
旧答案
令人惊讶的是,将问题写在StackOverflow上通常会让您直接找到答案。在这上面花了几个小时之后,我想我需要在~/.R/Makevars
CC=gcc-5
CXX=g++-5
CXX1X = g++-5
至
CC=gcc-6
CXX=g++-6
CXX1X = g++-6
另外,我最终从"drat"资源库安装了xgBoost
install.packages("drat", repos="https://cran.rstudio.com")
drat:::addRepo("dmlc")
install.packages("xgboost", repos="http://dmlc.ml/drat/", type = "source")
相关文章