在mysql中distinct就是可以直接去重的而group by 是分組顯示的,但是有朋友在應用中可能會發現distinct并不像官方講得那有實用了,下面我來介紹一下它們是怎么過濾刪除重復行.
下面先來看看例子,代碼如下:
- table
- id name
- 1 a
- 2 b
- 3 c
- 4 c
- 5 b
庫結構大概這樣,這只是一個簡單的例子,實際情況會復雜得多,比如我想用一條語句查詢得到name不重復的所有數據,那就必須使用distinct去掉多余的重復記錄,代碼如下:
select distinct name from table
得到的結果是:
- name
- a
- b
- c
好像達到效果了,可是,我想要得到的是id值呢?改一下查詢語句吧:
select distinct name, id from table
結果會是:
- id name
- 1 a
- 2 b
- 3 c
- 4 c
- 5 b
distinct怎么沒起作用?作用是起了的,不過他同時作用了兩個字段,也就是必須得id與name都相同的才會被排除.
我們再改改查詢語句:select id, distinct name from table
現在將完整語句放出:
select *, count(distinct name) from table group by name
結果:
- id name count(distinct name)
- 1 a 1
- 2 b 1
- 3 c 1
上面簡單但有些地方是不能完成我們的需要的,下面記錄了些常用的重復記錄操作語句
查詢及刪除重復記錄的方法.
1、查找表中多余的重復記錄,重復記錄是根據單個字段(peopleId)來判斷,代碼如下:
- select * from people
- where peopleId in (select peopleId from people group by peopleId having count(peopleId) > 1)
2、刪除表中多余的重復記錄,重復記錄是根據單個字段(peopleId)來判斷,只留有rowid最小的記錄,代碼如下:
- delete from people
- where peopleId in (select peopleId from people group by peopleId having count(peopleId) > 1)
- and rowid not in (select min(rowid) from people group by peopleId having count(peopleId )>1)
3、查找表中多余的重復記錄,多個字段,代碼如下:
- select * from vitae a
- where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)
4、刪除表中多余的重復記錄,多個字段,只留有rowid最小的記錄,代碼如下:
- delete from vitae a
- where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)
- and rowid not in (select min(rowid) from vitae group by peopleId,seq having count(*)>1)
5、查找表中多余的重復記錄,多個字段,不包含rowid最小的記錄,代碼如下:
- select * from vitae a
- where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)
- and rowid not in (select min(rowid) from vitae group by peopleId,seq having count(*)>1) --Vevb.com
新聞熱點
疑難解答