oracle 删除重复数据

程序员文章站 2024-02-08 19:50:34

重复的数据可能有这样两种情况，第一种: 表中只有某些字段一样，第二种是两行记录完全一样。一、对于部分字段重复数据的删除 1.查询重复的数据　　 select...

重复的数据可能有这样两种情况，第一种: 表中只有某些字段一样，第二种是两行记录完全一样。
一、对于部分字段重复数据的删除
1.查询重复的数据　　
select 字段1,字段2, count(*) from 表名 group by 字段1,字段2 having count(*) > 1 　　
例：select owner from dba_tables group by owner having count(*)>1;
select owner from dba_tables group by owner having count(*)=1; //查询出没有重复的数据　　
2.删除重复的数据
delete from 表名 a where 字段1,字段2 in (select 字段1,字段2,count(*) from 表名 group by 字段1,字段2 having count(*) > 1)
这种删除执行的效率非常低，对于大数据量来说，可能会将数据库吊死。
另一种高效率的方法是先将查询到的重复的数据插入到一个临时表中，然后再进行删除。
create table 临时表 as
(
select 字段1,字段2, count(*) as row_num
from 表名
group by 字段1,字段2
having count(*) > 1
);
　　上面这句话就是建立了临时表，并将查询到的数据插入其中。
　　下面就可以进行这样的删除操作了：
delete from 表名 a
where 字段1,字段2 in (select 字段1，字段2 from 临时表); 　　
3.保留重复数据中最新的一条记录
在oracle中，rowid是隐藏字段，用来唯一标识每条记录。所以，只要保留重复数据中rowid最大的一条记录就可以了。　　
查询重复数据:
select a.rowid,a.* from 表名 a
where a.rowid != (
select max(b.rowid) from 表名 b
where a.字段1 = b.字段1 and a.字段2 = b.字段2 ); 　　
例：selete from dba_tables a
where a.rowid!=(
select max(rowid) from test b
where a.owner=b.owner);
　　删除重复数据，只保留最新的一条数据：
delete from 表名 a
where a.rowid != (
select max(b.rowid) from 表名 b
where a.字段1 = b.字段1 and a.字段2 = b.字段2 )
　　使用临时表实现高效查询
create table 临时表 as
（select a.字段1, a.字段2, max(a.rowid) as dataid from 正式表 a
group by a.字段1,a.字段2);
delete from 表名 a
where a.rowid !=
( select b.dataid from 临时表 b
where a.字段1 = b.字段1 and
a.字段2 = b.字段2 );
commit;
　　二、对于完全重复记录的删除
　　对于表中两行记录完全一样的情况，可以用下面语句获取到去掉重复数据后的记录：
select distinct * from 表名
可以将查询的记录放到临时表中，然后再将原来的表记录删除，最后将临时表的数据导回原来的表中。如下：
create table 临时表 as (select distinct * from 表名);
drop table 正式表;
insert into 正式表 (select * from 临时表);
drop table 临时表; 　　假如想删除一个表的重复数据，可以先建一个临时表，将去掉重复数据后的数据导入到临时表，然后在从临时表将数据导入正式表中，如下： insert into t_table_bak
select distinct * from t_table;

以下是补充：

oracle 数据库中查询重复数据：

select * from employee group by emp_name having count (*)>1;

oracle 查询可以删除的重复数据

select t1.* from employee t1 where (t1.emp_name) in (select t2.emp_name from employee t2 group by emp_name having count (*)>1) and t1.emp_id not in (select min(t3.emp_id) from employee t3 group by emp_name having count (*)>1);

oracle 删除重复数据

delete from employee t1 where (t1.emp_name) in (select t2.emp_name from employee t2 group by emp_name having count (*)>1) and t1.emp_id not in (select min(t3.emp_id) from employee t3 group by emp_name having count (*)>1);

上一篇： C#贪吃蛇游戏实现分析

下一篇：迷你迅雷7(迅雷精简版)怎么安装?(附迷你迅雷7下载)

oracle 删除重复数据

oracle 删除重复数据

Oracle数据库的找回DBA账户的密码

Oracle ERP开发基础之数据导入工具

解决在Oracle数据库中使用hibernate生成表不能正确创建表的问题

Oracle数据库学习总结

oracle 11gR2 逻辑备用数据库搭建

Oracle GoldenGate在异种OS上同种DB之间的数据同步

mysql怎么删除重复记录

Java实现Oracle数据库备份

Linux下OTL连接Oracle数据库