Gene Athe_0609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0609 
Symbol 
ID7406950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp688411 
End bp691821 
Gene Length3411 bp 
Protein Length1136 aa 
Translation table11 
GC content38% 
IMG OID643714991 
Productpullulanase, type I 
Protein accessionYP_002572507 
Protein GI222528625 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02104] pullulanase, type I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0177218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGT ATTTTAAAAG CAAACGTTTT ATAAGCATTT TGCTAGCGTT CATATTTTTC 
ACTACTCTTG TTTTCCCACT CACCATTCTT GGAGCAGACG AAAAAACAAC TCTCATCATT
CACTACTACA GATACAATGA AGACTATCAA GGCTGGAATT TGTGGATATG GCCTGTTGAG
CCAGTGGGTG CAGAAGGCAA AGCTTATGAG TTTACTTCCA AAGATGACTT TGGTGTAAAA
GCAGTAGTTG AACTTCCTGG AAAGGTGACA AAAGTAGGTA TCATAGTTAG AAAAGGTAAC
TGGGAAGCAA AAGATGTTGC TGTTGACAGG TTCATCTCAG GAATCAGCGG TTCAAAAGAG
GTTTGGCTGA TAGAAGGTGA AGAGCAAATC TATACATCTC AGCCTCAGAA AACTCCAAAA
ATGACAGCTT TTATTGATGG TCTTAATACC ATCGTTGTAA AGCTTGCAAA GAAGGCAGAT
ATCCTTTCAA ATAACAGGAC TCAAGGGTTC AAAGTTACAG CCTTTTATGA AGAAGTTCCT
ATCAAAAAAG TTGAGCCAGT TTTGCCTAAG ATAAATAAGA ATTTTAAACC AGAAGAAGCT
GGGTATGAAC TCATTGATGG TGGCACAAAA GTGAAATTTA TATTAAAACC TGGTGCAGGT
GATTTTAAAT TTACAGACAC CTCTGGCAAG CTAGATGTAT ATGTCTCTGG AACAATGAAC
GACTGGGGCG GAACAGCATC TTCCGAAGGA AAGTACAAAC CGCTTCCTGC ATGGAAAATG
ACATGGAATG CAGAGAAAGG TTACTATGAA CTTGTAAAAG AACTTGGCAA AGACGGTGTT
GTAATTGGTG CTAAGTTCAA GTTTACCTCA TGGGATGGTA CATCTGCAAA GTGGTATCCA
GATGGAATGG GAAATGACAA GGTTATTGAA GAGCTTTATA CAGGTAATGA GAAGATAACA
AAGGTTGACA CTTTTAAAAT CACAACTGAA GATGAATTAG AACCACAGGT TCCTTATGTT
GTGTCAAAAG ATAGCTTCAA ACCAACAGTT GCACAGGCAA GAAATATTTT AGACAATCCA
AAGTACTACT ACAAAGGCAA TGATTTAGGA TGCACATATA CAAAAGCTTA TTCAGCGTTC
AGACTTTGGG CTCCAACAGC AATTGGAGTT ATACTAAGAC TCTACGATGA TTATAAGACT
ACCAAGTATA AAGAGTATGA AATGCAGCAG TCTTTCAACG GAACATGGTA TCTTAAGATA
AATGGAGATT TGAAAGGGAA ATACTACCAG TATGAAGTTT GGCATGCTTC TAACTCTATA
ACTGACGATA CAATCAGAAA ATATGTTGTG CCAGATCCAT ATTCAAGAGC AACATCTGCA
AATTCTGAAA GAACCTTAAT CTTTGATCCA AAAGATACAA ACCCTGTCGG CTGGGAAAAA
GATACTTTTG TGACTCTAAA GAATCAAGAG GATGCGATAA TTTATGAGAC GCATGTAAGA
GACTTTACAA TAGATGCTTC AAGTGGTGTA AGGCCAGAGT TTAGAGGCAA ATACTTAGGC
TTCACCCAAA CAGGGGCAAA AGGACCAAAT GGAGTAAAAA CAGGTATTGA CCATTTGAAA
GAGCTTGGTA TAACCCATGT ACATCTGCTT CCAACATATG ACTTTGGTTC AATAGATGAG
ACAAATCCTG ACAAAGGCTA TAACTGGGGA TATGACCCAG TTTTATATCA AAACGTTGAA
GGTTCCTATG CTACAAATCC AAACACAATT GTAAGGATTA AAGAGTACAA ACAAATGGTC
ATGGCACTGC ACAAGGCTGG CATAGGCATT ATCCAGGATG TTGTTTTTAA TCACACATTT
CAGATAGGTG ATGCAAAATT CTCAATATTC GACAAGATAG TACCAGGATA CTTCTACAGA
AAAGACAAAG ATGGGAACTA CTCCAATGCA TCAGGCTGTG GTAACGAAAT TGCAACAGAA
AAACCTATGG TAAGAAAGTT TATAATTGAT ACCTTGACAT ACCTTACAAA AGAGTATCAC
ATAGATGGTT TCAGGTTTGA CTTAATGGCA GCAATAGATA GAGTTACAAT GGCAAAAGCT
CAAGAAGAAG TAAGAAAGAT AAATCCATCA GCAGTAATCT ATGGAGAGGG CTGGCTTGCA
GGTTCAACAC CGCTTGATAG CTCACTTAGA ATGGAAATAG GCTCATTTAA TCAGGCAGGC
CTTCACATAG GACTTTTTAA CGACAGAATA AGAGAGGCAA TAAGAGGAAA CCTTGACAAT
GAATCTAAGG GATTCATGCA AGGGAATTAC TCGTTTAGGC TTGAAGATCT CAAAAGAGGC
ATTCAAGGTG GTCTTGGTGA TTTTGCTGCA GACCCGGATG AATGCATAAA CTATGTTTCG
GCACATGACA ACCTAACTCT TTGGGATAAA CTTCAAAAGA GTGTGCCAAA TGAACCAGAT
TACATCAAGG ATAAAATGGG CAGACTTGCG AATGCAATTG TTTTGACAGC ACAGGGTGTT
CCATTCTTGC ATGGTGGAGT TGAATTCAAC CGAACAAAAT ATATGAATCA TAACTCATAC
AATGCGGGCG ATAAAATTAA TAAGTACAAC TGGAACCTTA AAGTAAAGTG GTACAACACT
TTCAAGTACT ATCAGGGGCT AATTGCATTA AGAAAAGCTC ATCCAGCTTT CAGGATGACA
ACTGCAGAAG ACATACAGAA ATATCTTACA TTTATCCAAA CACCGAAAGG AACATTAGGT
TTTAGACTCA CATATCCAAA AGATACATGG AATGACATTA TAGTTGTTTA CAACTCAACA
AAGAAAGTAC AAGAGGTCAC ACTGCCAGAA GGAAACTGGG TAGTTGTTGC AAATGGAGAT
GAAGTTGGCA CAACACCAAT TAAGAATCTT ACAAACTTTG TTGCTGGCAA GGCATTGGTT
GCACCAATTT CTATGTTTGT TGCATACAAG AGCAATGAAT TTCCACAAGG TTTTACTAAG
GTAACCGGTA AGGACCCTGT ATCATTAGAA AGCTCAAGTA CAGTAACAGT TCCAAAAGTT
TATGGTAATG GAAATATTGA AGTCACATTT AAAGTAAAGG TACCACATGG GACAGATGAT
GATGTTATCT ATTTGGCAGG TTCGTTTGGG AAAGCTGGAC TTTCTGATTG GAATCCAGGA
GATAAGGATG GAGCAATAGA ACTTGTAAGA TTGCAAGATG GGACATATAC TGTGACTGTT
AAACTTAACG CAGGTGAAAC ATTCGAATAT AAATACACAA GAGGTAGCTG GACTACAGTC
GAGAAAGGTG CAAATAAAGA AGAGATAGAG AATAGGAAAC TAACAGTTAA AGATGAAGGC
GGAGGAAAGA TGATAGTCAG CGATACAGTT TTGAACTGGG CTGATAAATA A
 
Protein sequence
MAKYFKSKRF ISILLAFIFF TTLVFPLTIL GADEKTTLII HYYRYNEDYQ GWNLWIWPVE 
PVGAEGKAYE FTSKDDFGVK AVVELPGKVT KVGIIVRKGN WEAKDVAVDR FISGISGSKE
VWLIEGEEQI YTSQPQKTPK MTAFIDGLNT IVVKLAKKAD ILSNNRTQGF KVTAFYEEVP
IKKVEPVLPK INKNFKPEEA GYELIDGGTK VKFILKPGAG DFKFTDTSGK LDVYVSGTMN
DWGGTASSEG KYKPLPAWKM TWNAEKGYYE LVKELGKDGV VIGAKFKFTS WDGTSAKWYP
DGMGNDKVIE ELYTGNEKIT KVDTFKITTE DELEPQVPYV VSKDSFKPTV AQARNILDNP
KYYYKGNDLG CTYTKAYSAF RLWAPTAIGV ILRLYDDYKT TKYKEYEMQQ SFNGTWYLKI
NGDLKGKYYQ YEVWHASNSI TDDTIRKYVV PDPYSRATSA NSERTLIFDP KDTNPVGWEK
DTFVTLKNQE DAIIYETHVR DFTIDASSGV RPEFRGKYLG FTQTGAKGPN GVKTGIDHLK
ELGITHVHLL PTYDFGSIDE TNPDKGYNWG YDPVLYQNVE GSYATNPNTI VRIKEYKQMV
MALHKAGIGI IQDVVFNHTF QIGDAKFSIF DKIVPGYFYR KDKDGNYSNA SGCGNEIATE
KPMVRKFIID TLTYLTKEYH IDGFRFDLMA AIDRVTMAKA QEEVRKINPS AVIYGEGWLA
GSTPLDSSLR MEIGSFNQAG LHIGLFNDRI REAIRGNLDN ESKGFMQGNY SFRLEDLKRG
IQGGLGDFAA DPDECINYVS AHDNLTLWDK LQKSVPNEPD YIKDKMGRLA NAIVLTAQGV
PFLHGGVEFN RTKYMNHNSY NAGDKINKYN WNLKVKWYNT FKYYQGLIAL RKAHPAFRMT
TAEDIQKYLT FIQTPKGTLG FRLTYPKDTW NDIIVVYNST KKVQEVTLPE GNWVVVANGD
EVGTTPIKNL TNFVAGKALV APISMFVAYK SNEFPQGFTK VTGKDPVSLE SSSTVTVPKV
YGNGNIEVTF KVKVPHGTDD DVIYLAGSFG KAGLSDWNPG DKDGAIELVR LQDGTYTVTV
KLNAGETFEY KYTRGSWTTV EKGANKEEIE NRKLTVKDEG GGKMIVSDTV LNWADK