计算给定字符串中重复字符的数量

2021-09-10 00:00:00 sql tsql sql-server

如何计算给定字符串中重复 $ 字符的出现次数.

How do I count the number of occurrences of repeated $ character in the given strings.

例如:

  1. String = '$$$$ABC$$$DE$$$' -->答案是 4,3,3
  2. String = '###$$%%ANE$$$$$' -->答案是 2,5

我不知道怎么做所以没有做任何尝试.

I have no idea how to do it so did not do any attempts.

感谢您的帮助.

用于复制:

  1. DDL 和插入:

Create table xyz(text varchar(200));
Insert into xyz values('$$$$ABC$$$DE$$$');
Insert into xyz values('###$$%%ANE$$$$$');

  1. 我需要做的:计算'$'的重复次数

  1. What I need to do: Count the repeated number of '$'

所需的输出,基于上面 #1 中的示例数据.

Desired output, based on the sample data in #1 above.

text = '$$$$ABC$$$DE$$$' -->答案是 4,3,3
text = '###$$%%ANE$$$$$' -->答案是 2,5

text = '$$$$ABC$$$DE$$$' --> Answer is 4,3,3
text = '###$$%%ANE$$$$$' --> Answer is 2,5

SQL Server 版本:Microsoft SQL Server 2019 (RTM) - 15.0.2000.5

SQL Server version: Microsoft SQL Server 2019 (RTM) - 15.0.2000.5

推荐答案

请尝试以下解决方案.它将从 SQL Server 2017 开始工作.

Please try the following solution. It will work starting from SQL Server 2017 onwards.

它基于 TRANSLATE() 函数以及 XML 和 XQuery 的使用.

It is based on use of the TRANSLATE() function, and XML and XQuery.

SQL

-- DDL and sample data population, start
DECLARE @tbl TABLE (ID INT IDENTITY PRIMARY KEY, tokens VARCHAR(30));
INSERT INTO @tbl (tokens) VALUES
('$$$$ABC$$$DE$$$'), --> Answer is 4,3,3
('###$$%%ANE$$$$$'); --> Answer is 2,5
-- DDL and sample data population, end

DECLARE @separator CHAR(1) = SPACE(1);

;WITH cte AS 
(
    SELECT *
        , REPLACE(TRANSLATE(tokens, '$', SPACE(1)),' ','') AS JunkCharacters
    FROM @tbl
)
SELECT *
, REPLACE(TRY_CAST('<root><r><![CDATA[' +
    REPLACE(TRANSLATE(tokens, TRIM(JunkCharacters), SPACE(LEN(TRIM(JunkCharacters)))), @separator, ']]></r><r><![CDATA[') + 
    ']]></r></root>' AS XML)
        .query('
        for $x in /root/r[text()]
        return data(string-length($x))
        ').value('.', 'VARCHAR(20)'), SPACE(1), ',') AS CleansedTokensCounter
FROM cte;

输出

+----+-----------------+----------------+-----------------------+
| ID |     tokens      | JunkCharacters | CleansedTokensCounter |
+----+-----------------+----------------+-----------------------+
|  1 | $$$$ABC$$$DE$$$ | ABCDE          |                 4,3,3 |
|  2 | ###$$%%ANE$$$$$ | ###%%ANE       |                   2,5 |
+----+-----------------+----------------+-----------------------+

相关文章