sphinx(三)centos7编译安装sphinx-for-chinese

2022-03-21 00:00:00 创建 代码 安装 复制 不存在

Coreseek目前官方网站已经不能访问了,大概率估计后期就不会有人维护了。对于长期发展来看,可能是不要好。

所以我的博客中使用了sphinx-for-chinese。是基于sphinx2.3.1开发的。

下载地址:

gitee.com/mz/sphinx

进入目录

cd /usr/local/download/sphinx
复制代码

安装依赖:

yum install mysql-devel
复制代码

编译,这里我需要说明一下:我的mysql使用编译安装,所有的文件都被我编译安装到/usr/local/mariadb下。如果你的mysql不是使用编译安装的,我下边编译的命令,你可能没有办法使用。

./configure  --prefix=/usr/local/sphinxforchinese  --with-mysql=/usr/local/mariadb
复制代码

安装

make&&make install
复制代码

安装成功

1:配置支持中文:

(1):复制中文字典

cp /usr/local/download/sphinx/xdict_1.1.txt /usr/local/sphinxforchinese/xdict_1.1.txt
复制代码

(2):进入sphinx安装目录

cd /usr/local/sphinxforchinese

(3):安装字典

/usr/local/sphinxforchinese/bin/mkdict  xdict_1.1.txt  etc/xdict
复制代码

配置成功显示:

[root@iZuf60ynur81p6k0ysvtneZ sphinxforchinese]# /usr/local/sphinxforchinese/bin/mkdict  xdict_1.1.txt  etc/xdict
Preparing...
Making Chinese dictionary:   |******************************|
Total words:                 284757
File size:                2854912 bytes
Compression ratio:              100 %
Chinese dictionary was successfully created!
复制代码

2:启动:

/usr/local/sphinxforchinese/bin/searchd
复制代码

报错:

/usr/local/sphinxforchinese/bin/searchd: error while loading shared libraries: libmariadb.so.3: cannot open shared object file: No such file or directory
复制代码

解决:

ln -s /usr/local/mariadb/lib/libmariadb.so.3 /usr/lib64/libmariadb.so.3 #创建软连接
复制代码

3:再次启动

/usr/local/sphinxforchinese/bin/searchd
复制代码

报错:

[root@iZuf60ynur81p6k0ysvtneZ sphinx]# /usr/local/sphinxforchinese/bin/searchd
Sphinx 2.2.9-id64-release (r5006)
Copyright (c) 2001-2015, Andrew Aksyonoff
Copyright (c) 2008-2015, Sphinx Technologies Inc (http://sphinxsearch.com)
  
FATAL: no readable config file (looked in /usr/local/sphinxforchinese/etc/sphinx.conf, ./sphinx.conf).
[root@iZuf60ynur81p6k0ysvtneZ sphinx]# /usr/local/sphinxforchinese/bin/searchd
Sphinx 2.2.9-id64-release (r5006)
Copyright (c) 2001-2015, Andrew Aksyonoff
Copyright (c) 2008-2015, Sphinx Technologies Inc (http://sphinxsearch.com)
  
FATAL: no readable config file (looked in /usr/local/sphinxforchinese/etc/sphinx.conf, ./sphinx.conf).
复制代码

没有配置文件,解决方法:

cp /usr/local/sphinxforchinese/etc/sphinx-min.conf.dist /usr/local/sphinxforchinese/etc/sphinx.conf
复制代码

4:再次启动

/usr/local/sphinxforchinese/bin/searchd
复制代码

报错:

[root@iZuf60ynur81p6k0ysvtneZ sphinx]# /usr/local/sphinxforchinese/bin/searchd
Sphinx 2.2.9-id64-release (r5006)
Copyright (c) 2001-2015, Andrew Aksyonoff
Copyright (c) 2008-2015, Sphinx Technologies Inc (http://sphinxsearch.com)
  
using config file './sphinx.conf'...
listening on all interfaces, port=9312
listening on all interfaces, port=9306
precaching index 'test1'
WARNING: index 'test1': preload: failed to open /usr/local/sphinxforchinese/var/data/test1.sph: No such file or directory; NOT SERVING
precaching index 'testrt'
precached 2 indexes in 0.000 sec
复制代码

就是找不到索引文件。

我们来修改刚刚复制的sphinx.conf文件

#
# Minimal Sphinx configuration sample (clean, simple, functional)
#
  
source src1
{
    # 你的数据库配置信息
    type            = mysql
  
    sql_host        = localhost
    sql_user        = mysql
    sql_pass        =
    sql_db          = test
    sql_port        = 3306  # optional, default is 3306
  
    # 数据源
    sql_query       = \
        SELECT id, group_id, UNIX_TIMESTAMP(date_added) AS date_added, title, content \
        FROM documents
    # 具体字段优化 
    # 正整数类型
sql_attr_uint       = group_id
# 时间戳类型
    sql_attr_timestamp  = date_added
}
  
index test1
{
    source          = src1
    # 确保目录存在,不存在创建
    path            = /usr/local/sphinxforchinese/var/data/test1
    # 配置中文字典
    chinese_dictionary = /usr/local/sphinxforchinese/etc/xdict
}
  
index testrt
{
    type            = rt
    rt_mem_limit    = 128M
    # 确保目录存在,不存在创建
    path            = /usr/local/sphinxforchinese/var/data/testrt
  
    rt_field        = title
    rt_field        = content
    rt_attr_uint    = gid
}
  
indexer
{
    mem_limit       = 128M
}
  
searchd
{
    listen          = 9312
    listen          = 9306:mysql41
    # 确保目录存在,不存在创建
    log             = /usr/local/sphinxforchinese/var/log/searchd.log
    # 确保目录存在,不存在创建
    query_log       = /usr/local/sphinxforchinese/var/log/query.log
    read_timeout    = 5
    max_children    = 30
    # 确保目录存在,不存在创建
    pid_file        = /usr/local/sphinxforchinese/var/log/searchd.pid
    seamless_rotate = 1
    preopen_indexes = 1
    unlink_old      = 1
    workers         = threads # for RT to work
    # 确保目录存在,不存在创建
    binlog_path     = /usr/local/sphinxforchinese/var/data
}
复制代码

我们将/usr/local/sphinxforchinese /etc目录下(安装目录)的example.sql导入数据库

# 使用test数据库

MariaDB [(none)]> use test;
Database changed
MariaDB [test]> source /usr/local/sphinxforchinese/etc/example.sql
Query OK, 0 rows affected, 1 warning (0.005 sec)
  
Query OK, 0 rows affected (0.010 sec)
  
Query OK, 4 rows affected (0.003 sec)
Records: 4  Duplicates: 0  Warnings: 0
  
Query OK, 0 rows affected, 1 warning (0.002 sec)
  
Query OK, 0 rows affected (0.014 sec)
  
Query OK, 10 rows affected (0.002 sec)
Records: 10  Duplicates: 0  Warnings: 0
复制代码

创建索引:

/usr/local/sphinxforchinese/bin/indexer -c /usr/local/sphinxforchinese/etc/sphinx.conf  --all --rotate
复制代码

创建成功:

Sphinx 2.2.9-id64-release (r5006)
Copyright (c) 2001-2015, Andrew Aksyonoff
Copyright (c) 2008-2015, Sphinx Technologies Inc (http://sphinxsearch.com)
  
using config file '/usr/local/sphinxforchinese/etc/sphinx.conf'...
indexing index 'test1'...
collected 4 docs, 0.0 MB
sorted 0.0 Mhits, 100.0% done
total 4 docs, 193 bytes
total 0.003 sec, 50168 bytes/sec, 1039.77 docs/sec
skipping non-plain index 'testrt'...
total 4 reads, 0.000 sec, 0.1 kb/call avg, 0.0 msec/call avg
total 12 writes, 0.000 sec, 0.1 kb/call avg, 0.0 msec/call avg
rotating indices: successfully sent SIGHUP to searchd (pid=4092).
复制代码

使用PHP做测试:

示例PHP文件在下载的包目录(/usr/local/download/sphinx/api)下

我这里使用原生PHP做的测试,因此,你需要将sphinxapi.php放在一个你能引用到的地方

index.php

<?php
// phpinfo();
  
require("/usr/local/nginx/html/sphinxapi.php");
$sphinx = new Sphinxclient();
$sphinx->setServer('127.0.0.1',9312);
$keyword='test';//要搜索的关键字
$index= 'test1';//索引名称
//查询出关键字所在的主键ID
$sphinx->_limit=2000;
$res= $sphinx->Query($keyword, $index);
// $res = $sphinx->Query($keyword, '*');
if(isset($res['matches'])){
    $ids = array_keys($res['matches']);
    $ids = implode(',',$ids);
}else{
    print_r('内容不存在');exit;
}
$sql = "SELECT * from documents where id in ($ids)"; 
$mysqli_con= mysqli_connect('127.0.0.1','mysql', '', 'test', '3306', '/usr/local/mariadb/tmp/mysql.sock');
$res = mysqli_query($mysqli_con, $sql);
while($row = mysqli_fetch_assoc($res)){
    $data[] = $row;
}
foreach($data as $key=>$v)
{
    $v = str_replace($keyword, "<font color='red'>{$keyword}</font>", $v);
    $data[$key]= $v;
}
echo "<pre>";
print_r($data);//*/
复制代码

访问得到如下结果:

结果没问题,但是有一个报错:

Deprecated: Methods with the same name as their class will not be constructors in a future version of PHP; SphinxClient has a deprecated constructor in /usr/local/nginx/html/sphinxapi.php on line 406
复制代码

解决方案:

找到与类SphinxClient同名的函数,将函数名改为__construct即可。

测试中文结果如下图所示:

5:sphinx常用命令

(1):启动

/usr/local/sphinxforchinese/bin/searchd
复制代码

(2):停止

/usr/local/sphinxforchinese/bin/searchd –stop
复制代码

(3):创建索引

/usr/local/sphinxforchinese/bin/indexer -c /usr/local/sphinxforchinese/etc/sphinx.conf  --all --rotate
复制代码

以上大概就是sphinx-for-chinese的安装及使用方式。

相关文章