ElasticSearch之Merge-天翼云

ElasticSearch之Merge

2024-04-17 08:21:15 阅读次数：50

Elasticsearch的shard，即对应Lucene的index。
Lucene的index由多个segment组成。
segment是index保存数据的最小单位，不支持修改。

Elasticsearch在运行过程中，启动后台任务，周期性检测并将占用空间小的segment自动合并至大一些的segment，避免存在过多的segment对象，同时在合并过程中，会剔除掉已删除的记录。

合并操作的过程可能消耗较多的资源，比如CPU和I/O，因此在合并操作运行的过程中，Elasticsearch会自动调整合并操作的吞吐量，优先保证其它业务的正常运行。

Elasticsearch提供了ConcurrentMergeScheduler作为合并操作的调度器，管理合并操作的产生和运行。

ConcurrentMergeScheduler在新的线程中提交合并操作，同时控制合并操作的并发数。当合并操作占用的线程的数量达到index.merge.scheduler.max_thread_count，ConcurrentMergeScheduler将后续待执行的合并操作放至队列中，避免合并操作占用过多的资源，影响其它操作。

相关参数

index.merge.scheduler.max_thread_count
在一个shard上执行merge操作时允许使用的线程的数量。
默认值为Math.max(1, Math.min(4, node.processors / 2))。

修改参数的取值，执行命令如下：

curl -X PUT "https://localhost:9200/_settings?pretty" -H 'Content-Type: application/json' -d'
{
    "index.merge.scheduler.max_thread_count": 2
}
' --cacert $ES_HOME/config/certs/http_ca.crt -u "elastic:ohCxPH=QBE+s5=*lo7F9"

假如当前没有创建index，则报错信息如下：

{
  "error" : {
    "root_cause" : [
      {
        "type" : "index_not_found_exception",
        "reason" : "no such index [[]]",
        "index_uuid" : "_na_",
        "index" : "[]"
      }
    ],
    "type" : "index_not_found_exception",
    "reason" : "no such index [[]]",
    "index_uuid" : "_na_",
    "index" : "[]"
  },
  "status" : 404
}

假如当前已有创建好的index，执行结果的样例，如下：

{
  "acknowledged" : true
}

相关资料

Merge
Elasticsearch性能优化实战指南
es索引调优
源码剖析：Elasticsearch 段合并调度及优化手段

活动

智算服务

应用商城

合作伙伴

开发者

支持与服务

了解天翼云

ElasticSearch之Merge

ElasticSearch之Merge

相关文章

小课2：筛选信息命令

shell脚本实现查询代码中定义了多少宏的方法

【漏洞复现】CVE-2015-5531 Arbitrary File Reading

spring cloud系统安装涉及的技术说明

【Python】使用numpy库实现Tic-Tac-Toe井字棋

【linux】linux C 程序 注册信号处理函数

课时3：处理信息命令

SpringBoot项目在linux下部署脚本实例

ElasticSearch中的分页（size、from）

SSH port forwarding: bind: Cannot assign requested

作者介绍

最新文章

【漏洞复现】CVE-2015-5531 Arbitrary File Reading

ElasticSearch中的分页（size、from）

SSH port forwarding: bind: Cannot assign requested

linux从入门到精通—— vim使用

lrzsz——一款好用的文件互传工具

linux查询磁盘是否做raid

热门文章

Linux crontab 任务误删恢复及备份步骤

Linux 趣味小知识--软硬连接以及应用

Linux常用命令总结

linux-压缩与解压缩

linux基本命令（47）——iostat命令

Linux中文本搜索命令grep用法详解

热门标签

相关产品

弹性云主机

天翼云电脑（公众版）

对象存储

云硬盘

随机文章

linux IO 技术体系

Kali linux安装SSH服务

Linux之cp和mv命令选项

kickstart安装linux

ElasticSearch中的中文分词详解

Unix/Linux shell脚本中 “set -e” 的作用

【linux】linux C 程序注册信号处理函数