在 JavaScript 中通过 JSON 对象进行类似 Lucene 的搜索

2022-01-15 00:00:00 indexing json lucene javascript

我有一个相当大的 JSON 对象数组(它是一个具有艺术家、专辑等属性的音乐库,用 loadonce=true 提供 jqgrid),我想通过整个集合实现类似 lucene(类似谷歌)的查询- 但在本地,即在浏览器中,不与网络服务器通信.有什么 javascript 框架可以帮助我吗?

I have a pretty big array of JSON objects (its a music library with properties like artist, album etc, feeding a jqgrid with loadonce=true) and I want to implement lucene-like (google-like) query through whole set - but locally, i.e. in the browser, without communication with web server. Are there any javascript frameworks that will help me?

推荐答案

  1. 浏览您的记录,通过组合所有搜索来创建一次性索引单个字符串字段中的字段称为索引.

  1. Go through your records, to create a one time index by combining all search able fields in a single string field called index.

将这些索引记录存储在一个数组中.

Store these indexed records in an Array.

在索引上对数组进行分区 .. 就像一个数组中的所有 a 等等.

Partition the Array on index .. like all a's in one array and so on.

对索引使用 javascript 函数 indexOf() 以匹配用户输入的查询并从分区数组中查找记录.

Use the javascript function indexOf() against the index to match the query entered by the user and find records from the partitioned Array.

这是简单的部分,但是它将以非常有效的方式支持所有简单的查询,因为不必为每个查询重新创建索引,并且 indexOf 操作非常有效.我用它来搜索多达 2000 条记录.我使用了一个预先排序的数组.实际上,这就是 Gmail 和雅虎邮件的工作方式.它们将您在浏览器上的联系人存储在一个预先排序的数组中,并带有一个索引,您可以在键入时查看联系人姓名.

That was the easy part but, it will support all simple queries in a very efficient manner because the index does not have to be re-created for every query and indexOf operation is very efficient. I have used it for searching up to 2000 records. I used a pre-sorted Array. Actually, that's how Gmail and yahoo mail work. They store your contacts on browser in a pre-sorted array with an index that allows you to see the contact names as you type.

这也为您提供了一个基础.现在您可以在其上编写高级查询解析逻辑.例如,要支持一些简单的条件关键字,如 - AND OR NOT,将需要大约 20-30 行自定义 JavaScript 代码.或者你可以找到一个 JS 库,它会像 Lucene 那样为你做解析.

This also gives you a base to build on. Now you can write an advanced query parsing logic on top of it. For example, to support a few simple conditional keywords like - AND OR NOT, will take about 20-30 lines of custom JavaScript code. Or you can find a JS library that will do the parsing for you the way Lucene does.

对于上述逻辑的参考实现,看看如何ZmContactList.js 对联系人进行排序和搜索以进行自动完成.

For a reference implementation of above logic, take a look at how ZmContactList.js sorts and searches the contacts for autocomplete.

相关文章