mysql中distinct和group by過濾刪除重復行

2024-07-24 12:38:05

字體：大中小

來源：轉載

供稿：網友

在mysql中distinct就是可以直接去重的而group by 是分組顯示的,但是有朋友在應用中可能會發現distinct并不像官方講得那有實用了,下面我來介紹一下它們是怎么過濾刪除重復行.

下面先來看看例子,代碼如下:

table

id name

1 a

2 b

3 c

4 c

5 b

庫結構大概這樣,這只是一個簡單的例子,實際情況會復雜得多,比如我想用一條語句查詢得到name不重復的所有數據,那就必須使用distinct去掉多余的重復記錄,代碼如下:

select distinct name from table

得到的結果是:

name

a

b

c

好像達到效果了,可是,我想要得到的是id值呢？改一下查詢語句吧:

select distinct name, id from table

結果會是:

id name

1 a

2 b

3 c

4 c

5 b

distinct怎么沒起作用？作用是起了的,不過他同時作用了兩個字段,也就是必須得id與name都相同的才會被排除.

我們再改改查詢語句:select id, distinct name from table

現在將完整語句放出:

select *, count(distinct name) from table group by name

結果:

id name count(distinct name)

1 a 1

2 b 1

3 c 1

上面簡單但有些地方是不能完成我們的需要的,下面記錄了些常用的重復記錄操作語句

查詢及刪除重復記錄的方法.

1、查找表中多余的重復記錄,重復記錄是根據單個字段（peopleId）來判斷,代碼如下:

select * from people

where peopleId in (select peopleId from people group by peopleId having count(peopleId) > 1)

2、刪除表中多余的重復記錄,重復記錄是根據單個字段（peopleId）來判斷,只留有rowid最小的記錄,代碼如下:

delete from people

where peopleId in (select peopleId from people group by peopleId having count(peopleId) > 1)

and rowid not in (select min(rowid) from people group by peopleId having count(peopleId )>1)

3、查找表中多余的重復記錄,多個字段,代碼如下:

select * from vitae a

where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)

4、刪除表中多余的重復記錄,多個字段,只留有rowid最小的記錄,代碼如下:

delete from vitae a

where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)

and rowid not in (select min(rowid) from vitae group by peopleId,seq having count(*)>1)

5、查找表中多余的重復記錄,多個字段,不包含rowid最小的記錄,代碼如下:

select * from vitae a

where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)

and rowid not in (select min(rowid) from vitae group by peopleId,seq having count(*)>1) --Vevb.com