本文關鍵給大家介紹了Python完成GB文件格式編碼序列編碼序列Fasta文件類型實例詳細說明,感興趣的小伙伴可以參考借鑒一下,希望可以有一定的幫助,祝愿大家盡可能發展,盡早工作上得到晉升
GB文件類型和FASTA文檔詳細介紹
在生物學中會有將GB文件格式編碼序列編碼序列成Fasta文件類型的需要,接下來我們運用python腳本制作來解決這些問題。
gb格式文檔是GenBank的文檔,用于儲存編碼序列的詳細資料。包括一個gene的名字,序號,發現人,論文參考文獻,外顯子部位,編碼區編碼序列,蛋白序列等等信息。
比如:
LOCUS NM_213806 849 bp mRNA linear MAM 24-SEP-2019 DEFINITION Sus scrofa Fas ligand(TNF superfamily,member 6)(FASLG),mRNA. ACCESSION NM_213806 VERSION NM_213806.1 KEYWORDS RefSeq. SOURCE Sus scrofa(pig) ORGANISM Sus scrofa Eukaryota;Metazoa;Chordata;Craniata;Vertebrata;Euteleostomi; Mammalia;Eutheria;Laurasiatheria;Cetartiodactyla;Suina;Suidae; Sus. REFERENCE 1(bases 1 to 849) AUTHORS Lin F,Fu YH,Han J,Shen M,Du CW,Li R,Ma XS and Liu HL. TITLE Changes in the expression of Fox O1 and death ligand genes during follicular atresia in porcine ovary JOURNAL Genet.Mol.Res.13(3),6638-6645(2014) PUBMED 25177944 REMARK GeneRIF:Data suggest forkhead box protein O1(FoxO1)involvement in the regulation of TNF-related apoptosis-inducing ligand TRAIL and Fas ligand FasL expression during follicular atresia. Publication Status:Online-Only REFERENCE 2(bases 1 to 849) AUTHORS Xie GH,Wang SJ,Wang Y,Zhang Y,Zhang HZ,Jin S,Wang QF,Liu ZC and Ge HL. TITLE Fas Ligand gene transfer enhances the survival of tissue-engineered chondrocyte allografts in mini-pigs JOURNAL Transpl.Immunol.19(2),145-151(2008) PUBMED 18503890 REMARK GeneRIF:the result indicates that the expression of FasL by chondrocytes is capable of inducing apoptosis of activated T cells REFERENCE 3(bases 1 to 849) AUTHORS Chang HW,Jeng CR,Lin CM,Liu JJ,Chang CC,Tsai YC,Chia MY and Pang VF. TITLE The involvement of Fas/FasL interaction in porcine circovirus type 2 and porcine reproductive and respiratory syndrome virus co-inoculation-associated lymphocyte apoptosis in vitro JOURNAL Vet.Microbiol.122(1-2),72-82(2007) PUBMED 17321702 REMARK GeneRIF:The expression of FAS and FAS ligand in splenic macrophages co-infected with porcine circovirus 2 and porcine reproductive and respiratory syndrome virus is reported REFERENCE 4(bases 1 to 849) AUTHORS Tayade C,Black GP,Fang Y and Croy BA. TITLE Differential gene expression in endometrium,endometrial lymphocytes,and trophoblasts during successful and abortive embryo implantation JOURNAL J.Immunol.176(1),148-156(2006) PUBMED 16365405 REFERENCE 5(bases 1 to 849) AUTHORS Bai L,Maedler K,Donath M and Tuch BE. TITLE Expression of Fas but not Fas ligand on fetal pig beta cells JOURNAL Xenotransplantation 11(5),426-435(2004) PUBMED 15303979 REMARK GeneRIF:FasL was not detected on fetal pig pancreatic cells but could be induced on both beta and non-beta cells when the cells were treated with IL1beta. Erratum:[Xenotransplantation.2016 Mar;23(2):171-2.PMID:27106874] REFERENCE 6(bases 1 to 849) AUTHORS Tsuyuki S,Kono M and Bloom ET. TITLE Cloning and potential utility of porcine Fas ligand:overexpression in porcine endothelial cells protects them from attack by human cytolytic cells JOURNAL Xenotransplantation 9(6),410-421(2002) PUBMED 12371937 REFERENCE 7(bases 1 to 849) AUTHORS Motegi-Ishiyama Y,Nakajima Y,Hoka S and Takagaki Y. TITLE Porcine Fas-ligand gene:genomic sequence analysis and comparison with human gene JOURNAL Mol.Immunol.38(8),581-586(2002) PUBMED 11792426 REFERENCE 8(bases 1 to 849) AUTHORS Muneta Y,Shimoji Y,Inumaru S and Mori Y. TITLE Molecular cloning,characterization,and expression of porcine Fas ligand(CD95 ligand) JOURNAL J.Interferon Cytokine Res.21(5),305-312(2001) PUBMED 11429161 COMMENT PROVISIONAL REFSEQ:This record has not yet been subject to final NCBI review.The reference sequence was derived from AB027297.1. ##Evidence-Data-START## Transcript exon combination::AB027297.1,AF397407.1[ECO:0000332] RNAseq introns::single sample supports all introns SAMN01893940,SAMN01915393 [ECO:0000348] ##Evidence-Data-END## FEATURES Location/Qualifiers source 1..849 /organism="Sus scrofa" /mol_type="mRNA" /db_xref="taxon:9823" /chromosome="9" /map="9" gene 1..849 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /note="Fas ligand(TNF superfamily,member 6)" /db_xref="GeneID:396726" CDS 1..849 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /note="CD95 ligand;tumor necrosis factor(ligand) superfamily,member 6;fas antigen ligand" /codon_start=1 /product="tumor necrosis factor ligand superfamily member 6" /protein_id="NP_998971.1" /db_xref="GeneID:396726" /translation="MQQPFNYPYPQIFWVDSSATSPWASPGSVFPCPASVPGRPGQRR PPPPPPPPPPPPTLLPSRPLPPLPPPSLKKKRDHNAGLCLLVMFFMVLVALVGLGLGM FQLFHLQKELTELRESASQRHTESSLEKQIGHPNLPSEKKELRKVAHLTGKPNSRSIP LEWEDTYGIALVSGVKYMKGSLVINDTGLYFVYSKVYFRGQYCNNQPLSHKVYTRNSR YPQDLVLMEGKMMNYCTTGQMWARSSYLGAVFNLTSADHLYVNVSELSLVNFEESKTF FGLYKL" mat_peptide 1..390 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /product="ADAM10-processed FasL form.{ECO:0000250}" /experiment="experimental evidence,no additional details recorded" /note="propagated from UniProtKB/Swiss-Prot(Q9BEA8.1)" mat_peptide 1..249 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /product="FasL intracellular domain.{ECO:0000250}" /experiment="experimental evidence,no additional details recorded" /note="propagated from UniProtKB/Swiss-Prot(Q9BEA8.1)" misc_feature 244..249 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /experiment="experimental evidence,no additional details recorded" /note="Cleavage,by SPPL2A.{ECO:0000250};propagated from UniProtKB/Swiss-Prot(Q9BEA8.1);cleavage site" misc_feature 247..309 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /experiment="experimental evidence,no additional details recorded" /note="propagated from UniProtKB/Swiss-Prot(Q9BEA8.1); transmembrane region" misc_feature 388..393 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /experiment="experimental evidence,no additional details recorded" /note="Cleavage,by ADAM10.{ECO:0000250};propagated from UniProtKB/Swiss-Prot(Q9BEA8.1);cleavage site" mat_peptide 391..846 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /product="Tumor necrosis factor ligand superfamily member 6,soluble form.{ECO:0000250}" /experiment="experimental evidence,no additional details recorded" /note="propagated from UniProtKB/Swiss-Prot(Q9BEA8.1)" misc_feature 553..555 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /experiment="experimental evidence,no additional details recorded" /note="N-linked(GlcNAc...)asparagine.{ECO:0000255}; propagated from UniProtKB/Swiss-Prot(Q9BEA8.1); glycosylation site" misc_feature 751..753 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /experiment="experimental evidence,no additional details recorded" /note="N-linked(GlcNAc...)asparagine.{ECO:0000255}; propagated from UniProtKB/Swiss-Prot(Q9BEA8.1); glycosylation site" misc_feature 781..783 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /experiment="experimental evidence,no additional details recorded" /note="N-linked(GlcNAc...)asparagine.{ECO:0000255}; propagated from UniProtKB/Swiss-Prot(Q9BEA8.1); glycosylation site" exon 1..351 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /inference="alignment:Splign:2.1.0" exon 352..397 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /inference="alignment:Splign:2.1.0" exon 398..454 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /inference="alignment:Splign:2.1.0" exon 455..849 /gene="FASLG" /gene_synonym="CD95-L;FASL;TNFSF6" /inference="alignment:Splign:2.1.0"
ORIGIN
1 atgcagcagc ccttcaatta cccatacccc caaatcttct gggtggacag cagtgctacc
61 tctccctggg cctccccagg ctcagtcttc ccctgtccag cttctgtgcc aggaaggcca
121 gggcaaagga ggccaccacc accaccgccg ccaccgccac caccaccaac actcctgcca
181 tcaagaccgc tgcctccact gccaccgcca tctctgaaga agaagaggga ccacaatgca
241 ggcctgtgtc tccttgtgat gttcttcatg gttctggtgg ccctggttgg attggggctg
301 gggatgtttc agctcttcca cctacagaag gagctgactg aactcagaga gtctgccagc
361 caaaggcata cagaatcatc tttggagaag caaataggtc accccaatct accctctgag
421 aaaaaggagc tgagaaaagt ggcccactta acaggcaagc ctaactcaag atccatccct
481 ctggaatggg aagacaccta tggaattgcc ttggtctctg gggtgaagta tatgaagggc
541 agccttgtga tcaatgacac tgggctgtat tttgtgtatt ccaaagtgta cttccggggt
601 cagtactgca acaaccagcc cctgagtcac aaggtataca caaggaactc taggtatccc
661 caggacctgg tgctgatgga gggaaagatg atgaactatt gcactactgg ccaaatgtgg
721 gcccgcagca gctacctggg ggctgtgttc aatctcacca gcgctgacca tttatatgtc
781 aacgtatctg agctctctct ggtcaatttt gaggaatcta agacattttt tggcttatat
841 aagctctga
//
fasta格式是一種基于文本用于表示核酸序列或多肽序列的格式。其中核酸或氨基酸均以單個字母來表示,且允許在序列前添加序列名及注釋。該格式已成為生物信息學領域的一項標準。
例如:
>NM_213806
ATGCAGCAGCCCTTCAATTACCCATACCCCCAAATCTTCTGGGTGGACAGCAGTGCTACC
TCTCCCTGGGCCTCCCCAGGCTCAGTCTTCCCCTGTCCAGCTTCTGTGCCAGGAAGGCCA
GGGCAAAGGAGGCCACCACCACCACCGCCGCCACCGCCACCACCACCAACACTCCTGCCA
TCAAGACCGCTGCCTCCACTGCCACCGCCATCTCTGAAGAAGAAGAGGGACCACAATGCA
GGCCTGTGTCTCCTTGTGATGTTCTTCATGGTTCTGGTGGCCCTGGTTGGATTGGGGCTG
GGGATGTTTCAGCTCTTCCACCTACAGAAGGAGCTGACTGAACTCAGAGAGTCTGCCAGC
CAAAGGCATACAGAATCATCTTTGGAGAAGCAAATAGGTCACCCCAATCTACCCTCTGAG
AAAAAGGAGCTGAGAAAAGTGGCCCACTTAACAGGCAAGCCTAACTCAAGATCCATCCCT
CTGGAATGGGAAGACACCTATGGAATTGCCTTGGTCTCTGGGGTGAAGTATATGAAGGGC
AGCCTTGTGATCAATGACACTGGGCTGTATTTTGTGTATTCCAAAGTGTACTTCCGGGGT
CAGTACTGCAACAACCAGCCCCTGAGTCACAAGGTATACACAAGGAACTCTAGGTATCCC
CAGGACCTGGTGCTGATGGAGGGAAAGATGATGAACTATTGCACTACTGGCCAAATGTGG
GCCCGCAGCAGCTACCTGGGGGCTGTGTTCAATCTCACCAGCGCTGACCATTTATATGTC
AACGTATCTGAGCTCTCTCTGGTCAATTTTGAGGAATCTAAGACATTTTTTGGCTTATAT
AAGCTCTGA
處理步驟
將文件夾下gb文件批量讀取
將各個gb文件中的登錄號和具體序列抽提出來,并寫入fasta文件
將fasta文件進一步處理去掉換行符,使一個完整的序列中沒有換行符
將所有處理好的fasta文件存入一個新建的子文件夾中
Python腳本代碼如下:
#!/usr/bin/env python #-*-encoding:utf-8-*- ''' File:gb2fasta.py Time:2020/07/04 14:15:13 Author:Ai Version:1.0 Contact:aqy0716 163.com License:(C)Copyright 2020 SCAU Desc:將gb文件轉換為fasta文件,同時轉成無換行符,最后存入新的子文件夾中 ''' #here put the import lib import os import shutil def gb2fasta(path): #從gb文件中抽取登錄號和具體序列信息存入同名fasta文件 #讀取文件夾相關信息 for root,dirs,files in os.walk(path): for file in files: #打印文件所屬目錄 print(root+''+file) #獲取文件路徑 path_gb=os.path.join(root,file) flag=0 if path_gb[-2:]=='gb': #打開新建fasta文件準備寫入 fasta=open(path_gb[:-2]+'fasta','w') #打開gb文件,準備讀取序列信息并寫入fasta文件 with open(path_gb,'r')as f: #逐行掃描 for line in f: #如果是ACCESSION行,則寫入fasta文件作為序列標題 if line[0:9]=='ACCESSION': fasta.writelines('>'+line.split()[1]+'n') #如果是ORGIN行,代表是序列 elif line[0:6]=='ORIGIN': flag=1 elif flag==1: #通過空格符(空格換行制表)對字符串進行切片 s=line.split() #非空切片字符打印 if s!=[]: #print(s) #去掉列表首個元素(數字序號)后,連接所有元素即為完整序列按行寫入fasta文件 seq=''.join(s[1:]) fasta.writelines(seq.upper()+'n') fasta.close() def multi2single(path): #此函數功能為:將多行序列轉換為單行序列(即去掉換行符),成為標準fasta文件 for root,dirs,files in os.walk(path): for file in files: path_full=os.path.join(root,file) #有fasta且不為single.fasta,才進行單行轉換,否則會重復創建文件夾 a=path_full[-5:]=='fasta' if path_full[-12:]=='single.fasta': b=True else: b=False b=bool(1-b) if a&b: fr=open(path_full,'r') fw=open(path_full[:-6]+'_single.fasta','w') seq={} for line in fr: if line.startswith('>'):#判斷字符串是否以‘>開始' name=line.split()[0]#以空格為分隔符,并取序列為0的項。 seq[name]='' else: seq[name]+=line.replace('n','') fr.close() for i in seq.keys(): fw.write(i) fw.write('n') fw.write(seq<i>) fw.write('n') fw.close() def copy2subdir(path): #將生成的_single.fasta文件存入新的子文件夾.singl_fasta中 root1=path for root,dirs,files in os.walk(path): subdir=os.path.join(root1,'singl_fasta') for file in files: oldfile=os.path.join(root1,str(file)) newfile=os.path.join(root1,'singl_fasta',str(file)) if not os.path.exists(subdir): os.makedirs(subdir) print("目錄創建成功!") if oldfile[-12:]=='single.fasta': if not os.path.exists(newfile): shutil.copyfile(oldfile,newfile) print("n你的fasta文件保存在:"+subdir+"文件夾下n") if __name__=="__main__": path=input("請輸入路徑")#此處輸入D:docugb2fasta gb2fasta(path) multi2single(path) copy2subdir(path)
運行情況
PS D:vscode_python_magic>&d:/ruanjiancangku/python_projectkotin/venv/Scripts/python.exe d:/vscode_python_magic/實驗室-magic/序列處理/gb2fasta.py D:docugb2fasta1-FASLG-swine-849bp-NM_213806.fasta D:docugb2fasta1-FASLG-swine-849bp-NM_213806.gb ['1','atgcagcagc','ccttcaatta','cccatacccc','caaatcttct','gggtggacag','cagtgctacc'] ['61','tctccctggg','cctccccagg','ctcagtcttc','ccctgtccag','cttctgtgcc','aggaaggcca'] ['121','gggcaaagga','ggccaccacc','accaccgccg','ccaccgccac','caccaccaac','actcctgcca'] ['181','tcaagaccgc','tgcctccact','gccaccgcca','tctctgaaga','agaagaggga','ccacaatgca'] ['241','ggcctgtgtc','tccttgtgat','gttcttcatg','gttctggtgg','ccctggttgg','attggggctg'] ['301','gggatgtttc','agctcttcca','cctacagaag','gagctgactg','aactcagaga','gtctgccagc'] ['361','caaaggcata','cagaatcatc','tttggagaag','caaataggtc','accccaatct','accctctgag'] ['421','aaaaaggagc','tgagaaaagt','ggcccactta','acaggcaagc','ctaactcaag','atccatccct'] ['481','ctggaatggg','aagacaccta','tggaattgcc','ttggtctctg','gggtgaagta','tatgaagggc'] ['541','agccttgtga','tcaatgacac','tgggctgtat','tttgtgtatt','ccaaagtgta','cttccggggt'] ['601','cagtactgca','acaaccagcc','cctgagtcac','aaggtataca','caaggaactc','taggtatccc'] ['661','caggacctgg','tgctgatgga','gggaaagatg','atgaactatt','gcactactgg','ccaaatgtgg'] ['721','gcccgcagca','gctacctggg','ggctgtgttc','aatctcacca','gcgctgacca','tttatatgtc'] ['781','aacgtatctg','agctctctct','ggtcaatttt','gaggaatcta','agacattttt','tggcttatat'] ['841','aagctctga'] ['//'] D:docugb2fasta2-LTA-swine-1584bp-NM_214453.fasta D:docugb2fasta2-LTA-swine-1584bp-NM_214453.gb ['1','agaaaggggc','ccacaggggt','cccgcacagc','aggtgagact','ctcccacccc','atctcctagg'] ['61','gctgtccggg','tgctggactc','ccccctcact','tcggtccctc','cgcccgctcc','ctggccttcc'] ['121','tgcccctcct','gcatcttcac','cccggcctgg','gccttggtgg','gtttggtttt','ggtttgttct'] ['181','ctctgattct','ttatctgtca','ggctctttct','agctctcaca','cactctgatc','cctctctgtt'] ['241','cccttcccat','ctctgtttct','ctctgggtct','ccccctgctc','acctcgggat','ttccctgagt'] ['301','gcctctggtc','cccttctctg','tctggcgccc','cgtctcttgt','ctctcggggt','ggctgtctcc'] ['361','gagggcagga','ggccttcttc','cgcaggtgcc','ccgccccgct','cactgtctct','ctccccccac'] ['421','aggttttccc','catgacacca','cctggacgcc','tctacctccg','gagggtgtgc','agcaccccca'] ['481','tcctcctcct','cctggggctg','ctgctggccc','tgccgcccga','ggcccagggg','ctccctggcg'] ['541','tcggcctccc','accctcagct','gcacagcctg','cccatcagca','ccccccaaag','cacttggcca'] ['601','gaggcaccct','caaacctgcc','gctcacctcg','ttggagaccc','cagcaccccg','gactcactgc'] ['661','gctggagagc','gaacacggat','cgtgccttcc','tccgccatgg','cttcttgctg','agcaacaact'] ['721','ccctgctggt','ccccaccagt','ggcctctact','ttgtctactc','ccaggtcgtc','ttctccgggg'] ['781','aaggctgctt','ccccaaggcc','acccccaccc','ctctctacct','ggcccacgag','gtccagctct'] ['841','tctcctccca','gtaccccttc','cacgtgccgc','tcctcagcgc','tcagaagtcc','gtgtgccccg'] ['901','ggccacaggg','accttgggtg','cgctctgtgt','accagggggc','tgtgttcctg','ctcacccagg'] ['961','gagatcagct','gtccacacac','acagacggca','ccccccacct','gctcctcagc','cccagtagcg'] ['1021','tcttctttgg','agccttcgct','ctatagaaga','atccagaaag','aaaaaaattg','gtttcaaggc'] ['1081','cttctcccct','tttcacctcc','cttatgacca','cttcggaggt','caccgcgcct','ctcctctgac'] ['1141','aatttccaac','agtctcatct','tcccccacgc','tcagcacctg','gagcttctgt','agaaggaatt'] ['1201','ctaggcacct','cgggggaact','ggaaccaccc','cggatgctct','gctgaggatc','tgaatgcccg'] ['1261','cctggagccc','ttcccctgtc','ctgcccgtct','aggggccctc','gtccaggacg','tggaagggaa'] ['1321','gctgacccat','gagggacttt','gaacggatga','ccggagcggt','gtgggggggt','tatttatgaa'] ['1381','ggggaaaatt','aaattattta','tttatggagg','atggagagaa','gggaatcaca','gagggatgtc'] ['1441','agaagagtgt','gacacatgtg','cccaagagat','aaagtgacag','aaggcatggg','ctccagatga'] ['1501','cccggccaga','gagggcaaag','tggctcagga','aggggctgct','tgactggagg','ctcatgagga'] ['1561','gacggctgac','cctcgatgaa','accc'] ['//'] D:docugb2fasta3-LTB-swine-950bp-NM_001185138.fasta D:docugb2fasta3-LTB-swine-950bp-NM_001185138.gb ['1','tcggatgggg','gcaccggggc','tggagggccg','gggtaggagg','ccccagggga','agggatgcct'] ['61','cctgctggcc','gtggcagggg','ccacttccct','ggtgaccctc','ctgctggccg','tgcctatcac'] ['121','ggtcctggct','gtgctggcct','tggtgcccca','ggagcaggga','gaactggtaa','cagggaccgc'] ['181','tgacccaggc','acccaggcgg','aggcccagca','gcgattggag','tccaaggaga','cgccagagga'] ['241','ggaggcagaa','acagatctca','gccccaggct','cccagctgcc','cacctcattg','gcgcttggat'] ['301','cacgggtcag','gggctaggct','gggaggcgaa','gaaagaagag','gcgtttctga','ggagcgggac'] ['361','gcagttctct','ggcgcggagg','gcctggccct','cccgcaggac','ggcctctact','acctctactg'] ['421','tcacgtcggc','taccggggcc','gggcacctcc','tcccggcggg','gaccccctgg','accgctcggt'] ['481','cacgctgctc','agccggctgt','accgggcggg','gggcgcctac','ggaccgggga','ctcccgagct'] ['541','gctgctggag','ggcgcggaga','ctgtgactcc','ggtcttggac','cccagtcgga','ggcacgagta'] ['601','cgggcccctc','tggtacacga','gcgtggggtt','cggtggcctg','gtgcagctcc','ggaggggcga'] ['661','gagggtgtac','gttaatatca','gtcaccccga','tatggtggat','tacaggagag','gaaagacctt'] ['721','cttcggggcg','gtgatggtgg','gctgaggact','gtccgcggcc','cgagaggacc','actgcatggt'] ['781','gggagtgtgt','cgatggatca','agcccagaca','cggggtccca','gacaccaggc','cagacaccat'] ['841','ggccgtgggg','aaaatgcagg','agatcgtgtg','gaaaactgat','tttgagcctg','atgaaaataa'] ['901','agaatgtaaa','agctttaatn','gctgcccatg','ccaaaaaaaa','aaaaaaaaaa'] ['//'] D:docugb2fasta4-TNF-swine-1666bp-NM_214022.fasta D:docugb2fasta4-TNF-swine-1666bp-NM_214022.gb ['1','cccagagtga','ggacaccagg','ggaccagcca','ggagagagac','aagccatctc','caggaccccc'] ['61','tagaaataac','ctctcagaag','acacaccccc','gaacaggcag','ccggacgact','ctctccctct'] ['121','cacacgctgc','cccggggcgc','caccatctcc','cagctggacc','tgagcccctc','tgaaaaagac'] ['181','accatgagca','ctgagagcat','gatccgagac','gtggagctgg','cggaggaggc','gctcgccaag'] ['241','aaggccgggg','gcccccaggg','ctccaggagg','tgcctgtgcc','tcagcctctt','ctccttcctc'] ['301','ctggtcgcag','gagccaccac','gctcttctgc','ctactgcact','tcgaggttat','cggcccccag'] ['361','aaggaagagt','ttccagctgg','ccccttgagc','atcaaccctc','tggcccaagg','actcagatca'] ['421','tcgtctcaaa','cctcagataa','gcccgtcgcc','cacgttgtag','ccaatgtcaa','agccgaggga'] ['481','cagctccaat','ggcagagtgg','gtatgccaat','gccctcctgg','ccaacggcgt','gaagctgaaa'] ['541','gacaaccagc','tggtggtgcc','gacagatggg','ctgtacctca','tctactccca','ggtcctcttc'] ['601','aggggccaag','gctgcccttc','caccaacgtt','ttcctcactc','acaccatcag','ccgcatcgcc'] ['661','gtctcctacc','agaccaaggt','caacctcctc','tctgccatca','agagcccttg','ccagagggag'] ['721','acccccgagg','gggccgaggc','caagccctgg','tacgaaccca','tctacctggg','aggggtcttc'] ['781','cagctggaga','aggatgatcg','actcagtgcc','gagatcaacc','tgcccgacta','tctggacttt'] ['841','gctgaatctg','ggcaggtcta','ttttgggatc','attgccctgt','gagggggcag','gacatccgtt'] ['901','ccctcccctg','tccatccctt','tattatttta','ctccttcaga','ccccctcacg','tccttctggt'] ['961','ttagaaagag','aatgaggggc','tggggactgg','gctccaagct','taaaacttta','aacaacaaca'] ['1021','gcaacactta','gaaatcaggg','attcagggat','gtgtggcctg','gacaaccagg','cactgaccac'] ['1081','caccaagaat','tggaactggg','gcttccagac','tcgctggggt','ccttgggttt','ggattcctgg'] ['1141','atgcaacctg','ggacatctgg','aatgtggctg','ccagggaagc','ttgggttcca','atcggaatac'] ['1201','ttcagaacat','tccttgagaa','gatttcacct','caatcttgat','gactttttag','gcttcccttt'] ['1261','cttccaattt','tccagacttc','cctgggatgg','ggagcccagc','cccaaacccc','acaggccagc'] ['1321','tccctcttat','ttatatttgc','acttggcatt','attatttatt','tatttattta','ttatttattt'] ['1381','actagtgaat','gtatttattc','aggagggcga','ggtgtcctgg','gagacccagc','ataagggctg'] ['1441','ccttggttca','gatgtgtttt','ctgtgaaaac','ggagctgaac','tgtaggttgc','tcccacctgg'] ['1501','cctcctagcc','tctgtgcctc','cttttgctta','tgtttttaaa','aacaaatatt','tatctgatcg'] ['1561','agttgtctaa','ataatgctga','tttggtgact','aacttgtcgc','tacatcgctg','aacctctgct'] ['1621','ccccagggga','gttgtgtctg','taaccgccct','actggtcagt','ggcgag'] ['//'] D:docugb2fasta5-TNFSF4-swine-549bp-NM_001025217.fasta D:docugb2fasta5-TNFSF4-swine-549bp-NM_001025217.gb ['1','atggaagggg','tccaacccct','agatgaaaat','gtgggaaacg','caccaggacg','aagactcttg'] ['61','aggaacaagc','tattgttggt','ggcctccgta','attcagggtc','tggggttgct','cctgtgtctc'] ['121','acctacatct','gcctgcacct','ctatgctcag','gtgccatctc','agtaccctcc','aattcagagt'] ['181','atcaaagtac','aatttaccaa','gtgtgaaaat','gataatggtt','tcatcatcac','accctcaagc'] ['241','aaggatggaa','ccatgaaagt','gcaaaacaac','tcaatcatca','tcaactgtga','tgggttctat'] ['301','ctcatctccc','tgaagggtta','cttttctcag','gagctcagcc','tcatgcttca','gtaccggaag'] ['361','ggtcggaaac','ctctcttctc','cctgaacaag','gtcaagtctg','tggactctgt','cacagtagcc'] ['421','gatctggctt','tcaaggacaa','ggtcttcctg','aacgtgacca','ctcatagtgc','ctcctgtgaa'] ['481','gacattcagg','tgaatggtgg','ggaattgatt','ctcattcatc','aaaatcctgg','tggattctgt'] ['541','gtctactga'] ['//'] D:docugb2fasta6-TNFSF10-swine-1696bp-NM_001024696.fasta D:docugb2fasta6-TNFSF10-swine-1696bp-NM_001024696.gb ['1','agcagtcaga','ccctgcctgg','accatggcgg','tgatgcagac','tccaggaggc','cccagccccg'] ['61','ggcagacctg','tgtgttgatc','ctgatcttca','cagtgctcct','gcaagccctc','tgtgtggcct'] ['121','tgacttacgt','gtacttcacc','aatgaactga','aacagatgca','ggacaagtac','tccaaaagcg'] ['181','gtatagcttg','cttcttaaag','gaagatgaca','gtttctggga','tcccaccgat','gacgagagaa'] ['241','tgctcagccc','ctgctggcag','gtgaagtggc','agctacgtca','gtttgtgaga','aagatgattt'] ['301','tgagaaccta','tgaggaaacc','atttctacag','tttcagaaaa','gcaacaaggc','attcctcacc'] ['361','tagaaagaga','aaaaggtcca','cagagagtgg','ctgctcacat','aactggaacc','agtaggaaaa'] ['421','gaagcacatt','tccatctcta','agctccaaat','atgaaaaagc','tttgggccag','aaaataaact'] ['481','cctgggaatc','atcaagaaaa','ggacattcat','tcttgaataa','ttttcacttg','aggaatggag'] ['541','agctggttat','ccatcaaaca','gggttttact','acatctattc','ccaaacatac','tttcgatttc'] ['601','aggaacctga','ggaaattttg','ggaacggttt','ctacagaagg','gaacagaaag','aaaaacaggc'] ['661','aaatgataca','gtatatttac','aaatggacaa','gctatcctga','ccctatactg','ctgatgaaaa'] ['721','gtgctagaaa','tagttgttgg','tctaaagatt','cagaatatgg','actctattcc','atctatcaag'] ['781','gtggaatatt','tgagcttaag','gaagatgacc','gaatttttgt','ctctgttact','aatgagcaac'] ['841','tgattgacat','ggaccaagaa','gccagttttt','tcggggcctt','tttaattggc','taaatgatct'] ['901','gcagggaaaa','aaaccatgcc','ccagagtgac','tattcagagt','cgtatactgt','gaaaatattc'] ['961','cagcagagcc','aataggttaa','ggcagcctga','gcaaagaggc','ctcaacccaa','aggctcaaca'] ['1021','acacaagctt','tttggaaagt','gaaaagtgac','caattccttc','caggaaaatg','aaactgccaa'] ['1081','gagaccttgt','ggagctctgc','ctgatgtcat','tttgctagta','aacatctaga','agatactctg'] ['1141','tctccaaatt','tgtgtaacaa','ttaacacctc','ctgcctttat','catctaatcc','tgtgaagatt'] ['1201','ctagaagaaa','gagtagtgat','ccatctcagg','tgggaataag','ggacaacatt','cccaaaacta'] ['1261','aagagaaaag','ggcagcactg','aaaggtcaca','gtcaatatat','gcagtttcag','tacaaacata'] ['1321','acaaattaaa','gctacgttta','gtggacaagg','agctacttct','gaatggtttg','tgcttttctc'] ['1381','tactaaaaat','caggctggcc','aaaagcactc','agggtatttt','tgataaagga','ctctaaaata'] ['1441','agtgataaag','tatggcgata','cctcagaaaa','ctaaatacag','aactaccaca','tgacccagca'] ['1501','atcccactcc','tgtgcatata','tctggacaaa','actttccttg','aaaaagatac','attcatctct'] ['1561','atgctcattg','cagcactatt','cacagtagcc','aagacatgga','aacacctata','tgtctatgaa'] ['1621','tggatgaata','gattaagaag','gtgtgttatg','tatacatant','ggaatactgg','gaagccataa'] ['1681','aaaggacaaa','gaggcc'] ['//'] D:docugb2fasta7-TNFSF13b-swine-1013bp-NM_001097498.fasta D:docugb2fasta7-TNFSF13b-swine-1013bp-NM_001097498.gb ['1','tctaggaggg','aaatggatga','ctccacgggg','gagcagtcac','gcctttcttg','ccttagcacg'] ['61','agagaagaaa','tgaaactgaa','ggagacggtc','cccatcctcc','cccagaagga','aagcccctct'] ['121','gtccgcatct','ccaaagatgg','gaagctgctg','gtcgtgaccc','tgctgctggc','cctgctgtcc'] ['181','tgctgcctca','cggggatctt','tgcaccacca','gctccaaggg','agagcagctc','cattcaaagc'] ['241','aacaggagta','agcgcgccgc','gcaggatgcg','gaggagacag','tcactcagga','ctgcttgcaa'] ['301','ttgattgcag','acagtgacat','gcctactata','cgaaaaggag','cttatacatt','tgttccatgg'] ['361','cttctcagct','ttaaaagagg','aagagcccta','gaagaaaaag','aaaataaaat','cgtggtcaaa'] ['421','gaaacgggtt','acttttttat','atacggtcag','gttttataca','ccgataacac','ctttgccatg'] ['481','gggcatctca','tacagaggaa','gaaagtccat','gtctttgggg','atgaactgag','tctggtgact'] ['541','ttgttccgat','gtattcaaaa','tatgcctgaa','acactaccca','ataattcctg','ttattcagct'] ['601','ggcattgcaa','agctggagga','aggagatgaa','ctccaactgg','caataccacg','tgaagacgct'] ['661','aaaatatcac','gggatggaga','cggcacattt','tttggtgcat','tgaaacttct','gtgacctact'] ['721','tacaccttgt','ttgtggctct','tgccctccct','ccctctgtac','ctctaaagag','aaaacactta'] ['781','actggaaata','ccaaaagggg','aaaaaaaagt','agttaccata','gccttttctg','tgagctgttt'] ['841','gttttggttt','gctgaaacta','gaccaaaaca','ggaaatttaa','cagacaacca','cagccaaagg'] ['901','gtatcatgtg','aattacaaga','aatagagccc','atttaagaaa','aaatagaatt','agaaagactt'] ['961','ttcactgtaa','tgccatgttg','aacagcttag','tcatagcttc','ttgtcttgga','gga'] ['//'] D:docugb2fasta8-TNFSF18-swine-5005bp-XM_005667782.fasta D:docugb2fasta8-TNFSF18-swine-5005bp-XM_005667782.gb ['1','gcattccatt','taacaaatga','aaaggctaag','gcataaagaa','ccaggagaga','gaaccggaga'] ['61','tttcctcaat','tttagtgtag','taaatcaaca','gttttagtgc','taggaagttt','tttggaatag'] ['121','agtgtaaact','cagatggtag','gacagggtgc','atgaaaatat','ccttttctat','gataacttta'] ['181','tttctgtctg','atgtcagcat','tgaaatttca','gagtattaaa','atggtgaggt','atgaaaaaac'] ['241','taagcttgtt','gttattgaca','tatttttaaa','aataaaattc','tagtaataca','ttactgtttt'] ['301','ctagagatta','tctcagaatg','gacacattgt','ataccctagc','aattgatgaa','aatatttttc'] ['361','ccaaaccatg','aaccccagca','tccttagctc','cctgacctac','tgcctcccac','aaacatgata'] ['421','tttggagtat','gatagacctt','catcttgaat','ttcattcttt','ttcttacaaa','agtaattttc'] ['481','ttatctggaa','ataatttgta','atgttgaata','gttccatagt','tcctcttgct','tcagaaaata'] ['541','tttatttttc','ctcttaactt','cccttgttgg','gttttttttt','tttaaatcat','ctgtgtttgt'] ['601','gttggcttct','agccctcagt','tccagcacct','ttggtctggt','gccaaatgtt','agtcagcact'] ['661','taggctaaaa','gtatcgtttt','ccaacaccca','gatcagaagg','aaaactccgc','ctcttacacc'] ['721','cactacttag','tgctatacta','caaaactgac','tagttgaatc','atgtgctcat','tacttctgaa'] ['781','tttctgcttt','tcacaactct','cattcctgca','gagaatgagt','ttgagccaca','tggagaatat'] ['841','gcctttaagc','cattcaagtc','ctcacgcagc','acagagacca','tcctggaagc','aatggctact'] ['901','ctactcaaca','atagttattt','tgctattact','ttgctccttc','agtgcactaa','tcttaacttt'] ['961','tctcccactc','aagacctcca','acaggccatg','tgtagcgaag','tttggaccat','taccttcaaa'] ['1021','atggcaaatg','ccatctcctg','agccttcttg','tgtgaataag','acagatgatt','ggaggctgaa'] ['1081','gatacttcag','aatggcttgt','atttaattta','tggccaagtg','gctcccaaca','cagcttacaa'] ['1141','ggggcaagct','ccttttgagg','tgttgctacg','taggaatgaa','gaccccatac','aatctctaac'] ['1201','gaacaattct','acagtccaga','atgtaggagg','ggcttatgaa','tttcatgctg','gagatgtaat'] ['1261','agacttgata','ttcaatgctg','aacatcaggt','tctaaaaaat','aatacatact','gggggatctt'] ['1321','tctgctagca','aatccccaat','tcatctccta','gagactcagt','taggtctcct','catcttcagc'] ['1381','acatgcagag','atgccagtgc','ataggatgga','gaaggaagat','tttcaacaca','tacagttcat'] ['1441','ctgggtatac','aaatcaacat','gaacagatct','cctctgcatg','tgaagcttca','tttctcctgc'] ['1501','ttattgaatg','agactcagaa','agcactgaag','acatttggtt','acccctgatg','ttgggtcagc'] ['1561','aaagacactt','tactagttca','tgataaaatg','aaaatgggtg','gctggaagac','aaaatctttt'] ['1621','caaagtgtct','gtctaatcct','tgaacccctg','agtggaaaaa','tgaggtctat','tcccataata'] ['1681','gccttatata','gcatgcaaaa','aaagaccagg','gcagtagcct','ggtcttgttc','ttatattctt'] ['1741','ggactgtgga','ctgtttcaat','tcattcttcc','catattctca','tcttaggaga','cactcttaat'] ['1801','aaaatgtagt','cagagtgggt','gtgtggccag','caacactcca','ttttggagtt','gatgagatta'] ['1861','ggggatagag','aacactctta','ggaaatattg','ggacagaatt','tcagttggca','ttgaaatgga'] ['1921','atgcacttta','ttcgggaatt','tcacttgatt','tcatcatcaa','gtgcagggtg','ctctataaaa'] ['1981','cctgctggtc','aaaaggctag','ctttcaatct','tcacatagca','gttcatgaga','atttactggt'] ['2041','gtatgtatct','aaccatgtca','atgacaaaga','gtaatcatta','gtagtaagat','ctaacccccc'] ['2101','aaattggtat','taccagtact','gtactttgca','actgtgcaga','gccagctaaa','aatatgaaat'] ['2161','cattacatga','caaagcactt','tcatatacca','catggcaact','cgatagattt','aatggggcag'] ['2221','atatttttgg','cacaatttta','actatgagaa','tacagaggca','gatggaatag','aggtaacttg'] ['2281','gttcagttca','tagaactagt','aattaacagg','cacctgggct','tcccactgta','ttatactata'] ['2341','ttagctttac','gattggtatt','tctgctatca','tgttagaagc','ctataaactt','taacagattt'] ['2401','aaaattttca','gacagtatat','tcccttttag','tccaacagca','attttttcct','ttctcagcaa'] ['2461','atttcttttc','ttttctttgc','ctggagcagg','gtacccaggg','tgttattcaa','gacttactac'] ['2521','aacttaatct','ccttccttac','tttggtcaaa','tgtgttaact','tccaaaaata','atgaataata'] ['2581','ctcaattcag','ggacagtctg','ttaaattttt','ggactctgca','aaattaacta','gctgcttatg'] ['2641','ggttgttatt','aaaaggtatg','taggtaatgt','gattacatga','aaacccaatt','taaaatattt'] ['2701','atggatattt','gtaaaaaatc','tacattatgt','taattaatag','tatcaccatt','aaaaactaat'] ['2761','ttaagaatat','ttgtattgta','tgtaagaaaa','actgcttgga','agcagactaa','gcctgaggcc'] ['2821','aagatgcctc','atagtatgtc','tttttttttt','tttttttaaa','tacatctgct','gagcagctgt'] ['2881','agggacaaag','actggggtac','ctggttcctc','ttgtatttgt','gtatcatctc','aggaaattaa'] ['2941','agttacataa','catacatata','tttatggaaa','cgtggtattg','atgttaactt','ataagcagta'] ['3001','gtgtgctgga','gtgggctagc','actagctcag','gagagctgtt','aaatttttat','taattgtgta'] ['3061','gtctggttat','taaatcatta','tccttgaaat','tggccatggt','aggacaattt','ataccatgtg'] ['3121','aattagcaaa','tgctacaaat','cagggctttt','ctttttggaa','agcccatgca','ccagcacacc'] ['3181','actgtttata','aaactcttct','taatgactcc','tctcagcccc','tgcctcagta','ttacaacagt'] ['3241','caaggcaggc','aaggaaagtg','tcttactctc','agcaaaagcc','ccacagataa','atcatttctc'] ['3301','agggcaggtg','gaggaatcta','cagctgtaac','cagatagata','gctaccaaca','tatgaccttt'] ['3361','gaatttccct','agtgttgaaa','tttcaggctt','tgttttcaat','gtatactctg','ttcccttgtt'] ['3421','tcttcaaaac','agtgtttata','ttttaaactg','acaataaaat','gtttgtacat','gggctgtagc'] ['3481','tgatttatct','atgggttatc&
文章版權歸作者所有,未經允許請勿轉載,若此文章存在違規行為,您可以聯系管理員刪除。
轉載請注明本文地址:http://specialneedsforspecialkids.com/yun/128683.html
此篇文章關鍵給大家介紹了應用Python腳本制作獲取基因組測序指定位置編碼序列的實例詳細說明,感興趣的小伙伴值得借鑒參考一下,也希望能有一定的幫助,祝愿大家多多的發展,盡早漲薪 前言 在基因組分析中,大家常常會有這樣一個要求,便是在一個fasta文件中獲取某些編碼序列出去。有時候這種編碼序列注定是完備的編碼序列,而有時候只是為原fasta文件中某一段編碼序列中的一部分。尤其是當信息量許多時,應...
這篇文章主要為大家介紹了Python腳本提取fasta文件單序列信息實現示例,有需要的朋友可以借鑒參考下,希望能夠有所幫助,祝大家多多進步,早日升職加薪 此篇文章關鍵給大家介紹了Python腳本制作獲得fasta文件單編碼序列信息內容完成實例,感興趣的小伙伴可以參考借鑒一下,希望可以有一定的幫助,祝愿大家多多的發展,盡早漲薪 Python腳本制作編寫 應用Python對fasta格式編碼...
本文主要是給大家介紹了python從gbff文件上直接獲取cds編碼序列實例詳細說明,感興趣的小伙伴可以參考借鑒一下,希望可以有一定的幫助,祝愿大家多多的發展,盡早漲薪。 什么叫GBFF文檔 GenBank純文本文件類型(GenBankflatfile,通稱GBFF) GBFF是GenBank數據庫系統的相關信息企業 GBFF編碼序列文檔由單獨的編碼序列具體內容構成。 編碼序列具體內...
摘要:使用中文替代中文中文編碼中文編碼中有以上兩種聲明字符串變量的方式,它們的主要區別是編碼格式的不同,其中,的編碼格式和文件聲明的編碼格式一致,而的編碼格式則是。 字符串是Python中最常用的數據類型,而且很多時候你會用到一些不屬于標準ASCII字符集的字符,這時候代碼就很可能拋出UnicodeDecodeError: ascii codec cant decode byte 0xc4 ...
摘要:前后端交互過程中涉及的編碼首先,瀏覽器的設置里有設置編碼格式,一般設置為。按照設置的順序檢查檢測文件的編碼。 起因 最近在寫PHP,本身對PHP不太熟練。然后遇到編碼這個問題,困擾了大半天,索性,系統探索解決一番。 前后端交互過程中涉及的編碼 Browser cilent: 首先,瀏覽器的設置里有設置編碼格式,一般設置為UTF-8。 AJAX request: AJAX異步請求的過程...
閱讀 911·2023-01-14 11:38
閱讀 878·2023-01-14 11:04
閱讀 740·2023-01-14 10:48
閱讀 1982·2023-01-14 10:34
閱讀 942·2023-01-14 10:24
閱讀 819·2023-01-14 10:18
閱讀 499·2023-01-14 10:09
閱讀 572·2023-01-14 10:02