关于blastn的讨论汇总 - 话题女王

g******1
发帖数: 295

来自主题: Biology版 - blastn 一问

For the following two sequences, I am doing blastn on Seq2 against Seq1
blastn -task blastn-short -outfmt "6 nident"
Seq1: ATTAGAWACCCBDGTAGTCC
Seq2: ATTAGATACCCTGGTAGTCC
The length of each sequence is 20. The blastn returns nident=17, with 3
mismatch.
However, T actually matches W and B, and G matches D.
With IUPAC characters, in this case, how can I also count ambiguous matches?
What I expect in this case is number of identity =20 instead of nident=17.
Is there any parameter for blastn so th... 阅读全帖

g******1
发帖数: 295

来自主题: Biology版 - blastn 一问

Doesn't blastn provide that output?

y**********n
发帖数: 478

来自主题: Biology版 - blastn 一问

check the parameters bellow to see if you can get that output, otherwise you
need to make a script yourself.
http://www.molquest.com/help/2.3/programs/BlastN/parameters.htm

C***U
发帖数: 2406

来自主题: Biology版 - 有没有熟悉blast的大牛啊？

我用的命令是
blastn -query ***.fa -db *** -out ***.out -num_threads 200 -outfmt 'std' -
word_size 11
blastn -query ***.fa -db *** -out ***.out -num_threads 200 -outfmt 'std'
所以说他其实用的是megablast,那么他的default word_size是28 而不是document上的
blastn的default word_size是11.

a***k
发帖数: 1038

来自主题: Military版 - 说武汉P4实验室是源头的，全部谣言。实锤来了。

2019年12月26日刚上班，还是如往常一样先大概浏览一下这一天的mNGS病原微生物自动
解读结果，没问题的话就开始一天的研发工作了。
意外的是，发现有一个样本报出了敏感病原体——SARS冠状病毒，有几十条的序列，且
这个样本只有这么一个有意义的病原体，如果是普通病毒，这已经是一个相当可靠的结
果了。心头一紧，赶紧后台查看详细的分析数据，发现相似度并不算很高，只有大约94
.5%（这跟卡相似度的阈值有关，相当于只筛选下了相似度比较高的序列）。想到有几
种可能：1、SARS不同毒株基因组有一定差异；2、RNA病毒容易突变，距离SRAS事件17
年了，变异比较大；3、近缘物种的错误比对等等。为了确认结果的可靠性，开始了详
细分析。
好在之前已经遇到过几次这种类似的敏感的病原体确认分析工作，而且领导也曾跟我讨
论过几次能不能做一个新发病原自动挖掘的分析流程，心里一直记着这个事情，在做其
他权重更高优先级更高的项目时也随手做了一个初步的版本出来，这个样本刚好可以
派上用场。我给它起了个名字，相比于日常生产用的分析流程，它多了个后缀：“探索
版”，包含了几乎所有已测序的病毒基因组。
探索版的分... 阅读全帖

l**********1
发帖数: 5204

来自主题: Biology版 - ？怎样Blast较大的序列？

>回复]
[ 1 ]
发信人: thomasz (thomasz), 信区: Biology
标题: ？怎样Blast较大的序列？
发信站: BBS 未名空间站 (Sat Feb 25 14:36:17 2012, 美东)
我有个6.6kb的DNA序列，是某个细菌基因组中的一段
。但是我
知道别的基因组里确实有整段的同源序列的。想来6.6kb也不是很大，那该怎么做好呢
？多谢！
----
and
cited:

------------
出一大堆 somewhat similar to Optimize for Somewhat similar sequences by (blastn)
then what is the next step?
Fishing most important hits?
//www.ncbi.nlm.nih.gov/blast/Blast.cgi?
CMD=Web&LAYOUT=TwoWindows&AUTO_FORMAT=Semiauto&ALIGNMENTS=50&ALIGNMENT_VIEW=Pairwi
se&
CLIENT=web&DATA... 阅读全帖

b******n
发帖数: 4225

来自主题: Biology版 - ？怎样Blast较大的序列？

在细菌中，长达6.6kb的片段不coding任何东西很少见的
如果有，一般是rRNA

VIEW=Pairwi
10&FILTER=L
Translations&
OVERVIEW=on&UNGAP
blastn)
VIEW=Pairwi
10&FILTER=L
PROGRAM=blastn&S
HTTPGET=Yes&SH

C***U
发帖数: 2406

来自主题: Biology版 - 有没有熟悉blast的大牛啊？

最近跑了几个例子，用的是blastn做nuc-nuc对比。发现一个很奇怪的现象。
如果我用word size 11跑和用default word size跑，两个的运行时间差距很大！word
size 11的那个2个小时都没做完，default的那个只要30分钟。
但是根据文档说明，blastn的default word size是11.
有没有哪个大牛可以解释一下？谢谢了！！

n******7
发帖数: 12463

来自主题: Biology版 - 有没有熟悉blast的大牛啊？

你没明白他问的什么
-W Word size, default if zero (blastn 11, megablast 28, all others 3) [
Integer]
default = 0
blastn 和 megablast 的default不一样

g**********t
发帖数: 475

来自主题: Biology版 - 有没有熟悉blast的大牛啊？

I checked the help document of blastn (version 2.2.23+). there's a flag, -
task,and its default value is megablast. so by default you use megablast.
change it to blastn.

word

c*********d
发帖数: 9770

来自主题: Military版 - 荒猫牛不耕微信号laoniu003005

荒猫牛不耕微信号laoniu003005
功能介绍
好文分享
来源：荒猫之舞
作者：荒猫
中美贸易战告一段落，毫不意外地，咱们又赢了。不过与以往不同的是，这次美国也赢
了，是双赢。据报道，咱们答应了美国减少贸易逆差的要求，所以美国赢了；而因为要
减少贸易逆差，中方将大量增加自美购买商品和服务，这样可以满足咱们不断增长的消
费需求和促进高质量经济发展，所以咱们也赢了。
但让我感觉不可思议的是，这么好的事情，为什么一直不做呢？为什么要在美国政府的
逼迫下去做呢？是因为没有想到？那岂不是说那些人能力低下？是因为想到了而不做，
那岂不是证明了那些人很坏？
而且在美国高调启动贸易战之初，咱们是抱着多大的决心，宁为玉碎不为瓦全，一定要
打赢的，一定要粉碎美国人阻挡咱们大国崛起的阴谋的。当美国提出500亿的清单时，
咱们针锋相对；当美国提出1000亿的清单时，咱们继续应战；而当美国把清单的总价提
高到2000亿的时候，一夜之间，美国就不是阻挡咱们大国崛起，而是有利于咱们大国崛
起了。美国的要求原来是有利于咱们的发展，而不是遏制咱们的发展了。我恍然大悟：
原来以前的针锋相对，是用的激将法，是为了刺激美... 阅读全帖

发帖数: 1

来自主题: Military版 - 浙江舟山蝙蝠类SARS病毒 CoVZXC21 和 CoVZC45 表示，我们不背这个锅。

一个加州理工毕业的说（引用他的话，是用来驳斥的。在没有RaTG13之前，他说的其实
有理。）
“
I performed BLASTn and Clustal alignment on the Wuhan coronavirus and cross-
annotated missing CDS regions. I found that Bat SARS-like CoVZXC21 and Bat
SARS-like CoVZC45 are the most similar coronaviruses. The sequences are
different at quite a few silent point mutations but sufficiently close (95-
93%) to indicate a lineage that has not diverged significantly- this is a
recent strain with a close common ancestor. Some regions don’t even have
sile... 阅读全帖

发帖数: 1

来自主题: Military版 - 洋爹认证: 武肺病毒不是来自病毒所不是合成的

Puma2019 发于 2020-01-29 08:58, 论坛:
标题: 跟一个朋友聊了会，这次武汉病毒不像合成的。
quote 如下：
I performed BLASTn and Clustal alignment on the Wuhan coronavirus and cross-
annotated missing CDS regions. I found that Bat SARS-like CoVZXC21 and Bat
SARS-like CoVZC45 are the most similar coronaviruses. The sequences are
different at quite a few silent point mutations but sufficiently close (95-
93%) to indicate a lineage that has not diverged significantly- this is a
recent strain with a close common ancestor. Some region... 阅读全帖

J**S
发帖数: 25790

来自主题: Military版 - 提出Rat13G和2019nCov 25年前共祖的，反对阿三的说法

Rat13G序列的登记号是啥，为啥BLASTn BLASTp 都找不到？
只能找到最近的是C21, C45？这两个序列是南京军事医学研究所上传的序列。

l*****g
发帖数: 263

来自主题: Biology版 - Re: Is it possible now to search human genome?

OK. Updated information. :)
I finally logged in the Celera web site and did a search.
I tried with a known protein sequence, but the result is
not satisfactory - it only give me partially aligned
sequence among their 26K predited sequence.
Details: I input a DNA sequence 600 bp, and the best fit
using BlastN is only three segments of perfect alignment.
(Note: I did the search against their transcript database.
So, there should be no intron in there.)
I kind of doubt what they said now - not the

a***e
发帖数: 1010

来自主题: Biology版 - How to analyze the interaction between two RNA molecules?

http://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE_TYPE=BlastSearch&PROG_DEF=blastn&BLAST_PROG_DEF=megaBlast&SHOW_DEFAULTS=on&BLAST_SPEC=blast2seq&LINK_LOC=align2seq

b******n
发帖数: 4225

来自主题: Biology版 - 关于引物问题的请教

“最离谱的是在这里面（新基因全部的基因序列）尽然找不到我的PCR引物序列”
你看看两端跟你的引物有没有部分相似性
其实PCR引物如果3'末端有13个左右碱基相同就有可能P出来的（Tm值不够高的话）
另外你使用BLAST中间的Somewhat similar sequences (blastn)选项
而不是缺省选项megablast

i*****l
发帖数: 51

来自主题: Biology版 - mega4求助

几乎所有的核酸分析软件都可以做，实在没有软件就在MEDLINE坐BLASTN分析

b*******e
发帖数: 288

来自主题: Biology版 - 怎么在一个genome里搜索一段相似的序列？

用ncbi的blastn --> align two or more sequences?

t*d
发帖数: 1290

来自主题: Biology版 - How can I install program on bioinfor cluster?

Did you see the folder named 'bin'?
blastn under 'bin' folder should be the program you will need.

applications
code"

b******n
发帖数: 4225

来自主题: Biology版 - ？怎样Blast较大的序列？

用blastx
不要用blastn，这个在以genome为subject进行blast的时候很坑爹
原因不知

y**********n
发帖数: 478

来自主题: Biology版 - blastn 一问

you can count the ambiguous matches yourself by a script based on the
alignment.

b****r
发帖数: 17995

来自主题: Biology版 - 有没有熟悉blast的大牛啊？

blastn我常用啊，怎么会要这么久，一般5分钟了不起了

g**********t
发帖数: 475

来自主题: Biology版 - 有没有熟悉blast的大牛啊？

which version and which command are you using? by default, blastn might use
the megablast method, in which the word size is 28. please check the
document.

word

C***U
发帖数: 2406

来自主题: Biology版 - 有没有熟悉blast的大牛啊？

我用的是blastn

C***U
发帖数: 2406

来自主题: Biology版 - 有没有熟悉blast的大牛啊？

我想我明白了他的问题，所以我才说我用的NCBI的版本，然后给了那个链接。
然后用的命令是blastn。

C***U
发帖数: 2406

来自主题: Biology版 - 有没有熟悉blast的大牛啊？

恩
我在机器上看了一下blastn -help
确实是你说的那样
大牛太感谢你了。。。。我的老板问我咋回事我都没找出来原因
拜谢了！！！

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

topics

未名新帖统计// 7月16日

历史上的今天