ElasticSearch_dsl实现多字段查询去重过滤详解(script)-天翼云

ElasticSearch_dsl实现多字段查询去重过滤详解(script)

2023-05-24 08:13:51 阅读次数：422

ElasticSearch单字段去重详见博文：ElasticSearch单字段查询去重详解_IT之一小佬的博客-CSDN博客

ElasticSearch多字段去重详见博文：ElasticSearch多字段查询去重过滤详解_IT之一小佬的博客-CSDN博客

本博文将详细介绍使用elasticsearch_dsl进行多字段进行去重。本文示例数据详见上文单字段博文数据。

ElasticSearch_dsl实现多字段查询去重过滤详解(script)

1、对条件进行查询

示例代码：

from elasticsearch_dsl import connections, Search, A, Q

# 连接es
es = connections.create_connection(hosts=['192.168.124.49:9200'], timeout=20)
print(es)

s = Search(using=es, index='person_info')
q = Q('match', provience='北京')
res = s.query(q)
for data in res:
    print(data.to_dict())

print("共查到%d条数据" % res.count())

运行结果：

ElasticSearch_dsl实现多字段查询去重过滤详解(script)

2、使用script_fields脚本多字段去重

示例代码：

from elasticsearch_dsl import connections, Search, Q

# 连接es
es = connections.create_connection(hosts=['192.168.124.49:9200'], timeout=20)
print(es)

s = Search(using=es, index='person_info')
q = Q('match', provience='北京')
# res = s.query(q).script_fields(age_gender_aggs={'script': {'lang': 'painless', 'source': "doc['age'].value + doc['gender'].value"}})
res = s.query(q).script_fields(age_gender_aggs={'script': {'lang': 'painless', 'source': "'age:' + doc['age'].value + ',gender:' + doc['gender'].value"}})

count = 0
for data in res:
    print(data.to_dict(), type(data.to_dict()))
    count += 1
print("共查到%d条数据" % count)

运行结果：

ElasticSearch_dsl实现多字段查询去重过滤详解(script)

3、使用script_fields脚本多字段去重并显示需要的字段

示例代码：

from elasticsearch_dsl import connections, Search, Q

# 连接es
es = connections.create_connection(hosts=['192.168.124.49:9200'], timeout=20)
print(es)

s = Search(using=es, index='person_info')
q = Q('match', provience='北京')
res = s.query(q)\
    .script_fields(age_gender_aggs={'script': {'lang': 'painless', 'source': "'age:' + doc['age'].value + ',gender:' + doc['gender'].value"}})\
    .source(['name', 'age', 'gender', 'address'])

count = 0
for data in res:
    print(data.to_dict(), type(data.to_dict()))
    count += 1
print("共查到%d条数据" % count)

运行结果：

ElasticSearch_dsl实现多字段查询去重过滤详解(script)

4、使用script_fields脚本多字段去重并显示所有字段

示例代码：

from elasticsearch_dsl import connections, Search, Q

# 连接es
es = connections.create_connection(hosts=['192.168.124.49:9200'], timeout=20)
print(es)

s = Search(using=es, index='person_info')
q = Q('match', provience='北京')
res = s.query(q)\
    .script_fields(age_gender_aggs={'script': {'lang': 'painless', 'source': "'age:' + doc['age'].value + ',gender:' + doc['gender'].value"}})\
    .source([])\
    .execute()  # 这一行可写可不写

count = 0
for data in res:
    print(data.to_dict(), type(data.to_dict()))
    count += 1
print("共查到%d条数据" % count)

运行结果：

ElasticSearch_dsl实现多字段查询去重过滤详解(script)

5、使用script_fields脚本多字段去重统计数量

示例代码：

from elasticsearch_dsl import connections, Search, Q

# 连接es
es = connections.create_connection(hosts=['192.168.124.49:9200'], timeout=20)
print(es)

s = Search(using=es, index='person_info')
q = Q('match', provience='北京')
res = s.query(q).script_fields(age_gender_aggs={'script': {'lang': 'painless', 'source': "doc['age'].value + doc['gender'].value"}})

lst = []
for data in res:
    print(data.to_dict(), type(data.to_dict()))
    lst.append(str(data.to_dict()))
print(set(lst))
print("共查到%d条数据" % len(set(lst)))

运行结果：

ElasticSearch_dsl实现多字段查询去重过滤详解(script)

6、使用聚合中script脚本多字段去重统计数量

示例代码：

from elasticsearch_dsl import connections, Search, Q, A

# 连接es
es = connections.create_connection(hosts=['192.168.124.49:9200'], timeout=20)
print(es)

s = Search(using=es, index='person_info')
q = Q('match', provience='北京')
search = s.query(q)
search.aggs.bucket('age_gender_agg',
                   A('cardinality', script={'lang': 'painless', 'source': "doc['age'].value + doc['gender'].value"}))
ret = search.execute()
print(ret)
print(ret.aggregations.age_gender_agg)
print(ret.aggregations.age_gender_agg.value)

运行结果：

ElasticSearch_dsl实现多字段查询去重过滤详解(script)

参考博文：

Retrieve selected fields from a search | Elasticsearch Guide [8.5] | Elastic

API Documentation — Elasticsearch DSL 7.2.0 documentation

活动

智算服务

应用商城

合作伙伴

开发者

支持与服务

了解天翼云

ElasticSearch_dsl实现多字段查询去重过滤详解(script)

ElasticSearch_dsl实现多字段查询去重过滤详解(script)

1、对条件进行查询

2、使用script_fields脚本多字段去重

3、使用script_fields脚本多字段去重并显示需要的字段

4、使用script_fields脚本多字段去重并显示所有字段

5、使用script_fields脚本多字段去重统计数量

6、使用聚合中script脚本多字段去重统计数量

相关文章

JavaScript|数据类型的使用

JS学习（介绍、引入方式）

【漏洞复现】CVE-2015-5531 Arbitrary File Reading

利用javascript做简单的算法

JavaScript 如何将 HTML 转成 Markdown？

Vue3 自定义指令

Java执行shell脚本

ElasticSearch中的分页（size、from）

ELK集群搭建(2)

postgres elasticsearch fdw 学习

作者介绍

最新文章

ElasticSearch - 在 微服务项目 中基于 RabbitMQ 实现 ES 和 MySQL 数据异步同步（考点）

ElasticSearch - 基础概念，以及和 mysql 的对比

修改字段映射类型

Elasticsearch 基本操作(下)

Elasticsearch Dynamic Mapping 和常见字段类型详解

当es使用script脚本查询聚合等操作遇到空字段报错问题解决方案

热门文章

当es使用script脚本查询聚合等操作遇到空字段报错问题解决方案

修改字段映射类型

Elasticsearch Dynamic Mapping 和常见字段类型详解

Elasticsearch 基本操作(下)

ElasticSearch数据库修改分片数、副本数及修改mapping字段

es获取mapping中所有的字段（回溯）

热门标签

相关产品

弹性云主机

天翼云电脑（公众版）

对象存储

云硬盘

随机文章

es查询响应结果中获取某些字段的值

【Elastic】Elasticsearch-7.15.1运行报错记录和解决方法

Elasticsearch Dynamic Mapping 和常见字段类型详解

当es使用script脚本查询聚合等操作遇到空字段报错问题解决方案

使用python，将es数据写入mongo数据库中

ElasticSearch - 基础概念，以及和 mysql 的对比

ElasticSearch - 在微服务项目中基于 RabbitMQ 实现 ES 和 MySQL 数据异步同步（考点）