Advertisement

python&生物信息学 | kmer碱基序列生成

阅读量:

k-mer:在生物信息学中,k-mers是包含在生物序列中的长度为k的子序列。长度为L的序列对于一个给定的K可以得到L-k+1个k-mers。

复制代码
    def stream_kmers(seq, k):
    if k >= len(seq):
        return [seq]
    return (seq[i:i+k] for i in range(len(seq)-k+1))
    
    kmers = list(stream_kmers("CTCCCAGTTTACTATCGCACTGGTCGACTATACCTGTGCC", k=9))

[‘AACTGGGAG’,
‘AAACTGGGA’,
‘TAAACTGGG’,
‘GTAAACTGG’,
‘AGTAAACTG’,
‘TAGTAAACT’,
‘ATAGTAAAC’,
‘GATAGTAAA’,
‘CGATAGTAA’,
‘GCGATAGTA’,
‘TGCGATAGT’,
‘GTGCGATAG’,
‘AGTGCGATA’,
‘CAGTGCGAT’,
‘CCAGTGCGA’,
‘ACCAGTGCG’,
‘GACCAGTGC’,
‘CGACCAGTG’,
‘TCGACCAGT’,
‘GTCGACCAG’,
‘AGTCGACCA’,
‘TAGTCGACC’,
‘ATAGTCGAC’,
‘TATAGTCGA’,
‘GTATAGTCG’,
‘GGTATAGTC’,
‘AGGTATAGT’,
‘CAGGTATAG’,
‘ACAGGTATA’,
‘CACAGGTAT’,
‘GCACAGGTA’,
‘GGCACAGGT’]

全部评论 (0)

还没有任何评论哟~