bedtools subtract 基因区段取差集

article/2025/9/10 13:47:30

基本概述：

bedtools subtract 通俗的说，得到 A - B 的区段。如果在A中发现了B区段，就把 B 扣除，通过不同的参数，扣除的标准不一样。其中，参数 -A 可以达成 Remove features with any overlap 的效果（第四行）。
在这里插入图片描述

使用方法：

bedtools subtract [OPTIONS] -a <BED/GFF/VCF> -b <BED/GFF/VCF>

或者简写成

subtractBed [OPTIONS] -a <BED/GFF/VCF> -b <BED/GFF/VCF>

参数一览

Option	Description
-f	Minimum overlap required as a fraction of A. Default is 1E-9 (i.e. 1bp).
-F	Minimum overlap required as a fraction of B. Default is 1E-9 (i.e., 1bp).
-r	Require that the fraction of overlap be reciprocal for A and B. In other words, if -f is 0.90 and -r is used, this requires that B overlap at least 90% of A and that A also overlaps at least 90% of B.
-e	Require that the minimum fraction be satisfied for A _OR_ B. In other words, if -e is used with -f 0.90 and -F 0.10 this requires that either 90% of A is covered OR 10% of B is covered. Without -e, both fractions would have to be satisfied.
-s	Force “strandedness”. That is, only report hits in B that overlap A on the same strand. By default, overlaps are reported without respect to strand.
-S	Require different strandedness. That is, only report hits in B that overlap A on the _opposite_ strand. By default, overlaps are reported without respect to strand.
-A	Remove entire feature if any overlap. That is, by default, only subtract the portion of A that overlaps B. Here, if any overlap is found (or -f amount), the entire feature is removed.
-N	Same as -A except when used with -f, the amount is the sum of all features (not any single feature).

默认参数：

$ cat A.bed
chr1  10   20
chr1  100  200$ cat B.bed
chr1  0    30
chr1  180  300$ bedtools subtract -a A.bed -b B.bed
chr1  100  180

A 10 - 20 100 - 200
B 0-30 180-300
默认状态 A 第一个区段全扣除，第二个区段180之后的序列扣除，最终剩下100-180.

-f 重叠参数

$ cat A.bed
chr1  100  200$ cat B.bed
chr1  180  300$ bedtools subtract -a A.bed -b B.bed -f 0.10
chr1  100  180$ bedtools subtract -a A.bed -b B.bed -f 0.80
chr1  100  200

f 这个参数，如果B的区段覆盖度不满足这个参数，就不被扣除。0.10 扣了，0.80 没扣。

-s 考虑特征（正反链）

$ cat A.bed
chr1  100  200    a1  1   +$ cat B.bed
chr1  80   120    b1  1   +
chr1  180  300    b2  1   -$ bedtools subtract -a A.bed -b B.bed -S
chr1  100  180    a1  1   +

-A Remove features with any overlap：

重点来了，-A 模式下，那么 B 里面包含 A 里包含一个 1个BP，也是整段删除。“”沾边删” 模式

$ cat A.bed
chr1  100  200$ cat B.bed
chr1  180  300$ bedtools subtract -a A.bed -b B.bed
chr1  100  180$ bedtools subtract -a A.bed -b B.bed -A

bedtools subtract 基因区段取差集

基本概述：

使用方法：

默认参数：

-f 重叠参数

-s 考虑特征（正反链）

-A Remove features with any overlap：

相关文章

cv::subtract

opencv之subtract

OpenCV函数subtract()使用心得及需要注意的地方

moment系列一：add() 方法和subtract() 方法的使用

基于改进EAST算法的文本检测

自然文本检测主要模型

深度学习：多场景多尺度的文本检测

FOTS：自然场景的文本检测与识别

【文本检测】DBNet

openCV实践项目：图片文本检测

TextSnake文本检测

值得一看的文本检测方法

文本检测与识别

OCR文本检测模型—EAST

文本检测算法新思路：基于区域重组的文本检测

OCR文本检测模型—CTPN

OpenCV实战——文本检测

【文本检测与识别白皮书-3.1】第一节：常用的文本检测与识别方法

paddleocr文本检测模型的训练

文本检测实战：使用OpenCV实现文本检测（EAST 文本检测器）