Gene Athe_2571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2571 
Symbol 
ID7409522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2692513 
End bp2697192 
Gene Length4680 bp 
Protein Length1559 aa 
Translation table11 
GC content37% 
IMG OID643716933 
Producttransglutaminase domain protein 
Protein accessionYP_002574410 
Protein GI222530528 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0223142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCGCA GTAAATTATC AAGACTGATT GCTATTGTAT TATTAGTTGC ATTTATATTT 
AACTTTATAC TTCCAACTCA ACAGTTTGCT TTAGCAAAAC AGGCAAAACA AGTTGAGAGT
ACTTTTAAAA CAACGACGTT TGAGAGAAAA TATGAAATTC CAAAAACCAG AACAAAACTT
GGCTGTTTCA CAGATGAAAT ATATAAACTG TATGAGAAGC TCAAAGCAGA CATTGAAGCC
AATGACATAG ATAAAGTGAA ATCTGATATA AAAAACATAA AAAAGGTTTT AAAAGAGATT
AGAAAATCTA TTGTTACAGA ACTTGACAAA AACAGTAAAA TACTTGACAA GTTGAATGCA
ACTAAAGCCA AGCAAAGACA TAATCGGTTT AAATCTGAAT TGGAAAAGAA ATTGAACAAT
TTTGAAGTCT TATTTGATAA GCTTGATGAA CTTTTGAACC GCGTAAAAGA ATACAAAGGT
AAGGACATGG AACTTCTAAG GAACAAACTT CAGGAGATAG AGAATATATT AAATCCTGAA
CAGCCCCAGC AACCATTGGG TACACTTCCT CATAACAATG CTAGCTTAAC TCCACCTAAA
CCAGCAATTG GTAATGAAGC CGCAGCATCT GGGTATAGTG TTGCACAACA GGAGTCTTTG
GCAAATGTGT TGCCAAAGAC TCCTGTTGAT GCGGATTTGG CTGAGACTGC TGAGACAAAA
TTTACACAGG AAATCAAAAA TTTAGCTGAT TCACTCAAAA CACCAGTGAG AATGTATGAA
TTTGTAAAAA ACAACATAGA TTTTGAGCCT TACTATGGTT CGAGAAAAGG TGCAACTGGT
ACGTTAAACC AGTTAGCCGG CAATGACTAT GACCAAGCGT CGCTCTTGAT AGCAATGTTA
AGATACAAGG GCATTCCTTC ACGATATGTT AAAGGTATTG TTGAAGTGCC TGTGGAGAAG
GTAAAAAATT GGACAGGTGC TCAAACTGCC GAAGCAGCTG TAAAAGTGCT TGGCTCTCTT
GGTATACCTA CCGAATCTGT TGTGTCTGGT GGTGCAATTG TTGCTGTTCG ATTTGAGCAT
GTTTGGGTAG AAGCATATGT ACCCTATGAT TACTATCGCG GCGCAGGGGC TATGAAAGGT
CAAAAAGTGT GGGTTCCTCT AGATCCAAGT TTCAAACAAT ATAGTGTAGA ATCGGGTTTG
GATATTAAAG CAGTCACCAA AGTCACAGAT GAGCAAATTA TGGATGCATT TAAAGTTAGT
GGTGAAAGAG ATGGCGAGAC CATCACGAAA ATAGATATCG AAAAGATGAG CAGCTTCATG
GATGAGATTA AAGGGAAGCT GCAAGCCTAC ATTGAGGGGA ATAATCTTAG TAATGTTGAT
ACTAACAAAT TGATAGGTGG CAAGAAAATA AAGCCTGAAA AGCTGGGGAT GCTTCCTATT
ACATTGCCAT TCAAGATAGT TACAGTACTT GCAAAAACAA ATACGATACC TGATGCAAGT
AGTGAGAAAA TAGGATTTTC AATCAGGGGA AATGAACCGT TCAACTTGAA TTTTTCAGGA
ACGTATGATT TTAGTATTCA GTTCAAGGCA GTTGAACTAT ATGGTAAAAA GATAACTCTT
TCATGGATTC CAGCCACACA TGAGGACGAA GAAATTATCA ATCATTACGG CGGGTTATTT
AAAACACCTG CTTATATGGT TCAGTTGAAA CCGCAGCTGA AAATAGATGG CCGGGTTGTT
GCAGAAGGTA AGGCAGCAGG ATTTGGAAAT AGGCAGGAAT TTACAATTGA GATAGGACAT
GTAGGAAGAA CTGTTGAGAA GATAACCAAC ACTGTCACTG TTGGAGGATT TTATAGCATA
TGTTTTGATT TTGGCAAGAT TGATGTTAAA GAACTTGATG CTATCAAAGA GAGAGTGTCA
GTCATAAAAG ATACAGTAAA TGAGGAAAGC ATTTATACAG ATCAGGTAAT GGGAGAGATT
TTAAATAGTG CAGGTAAGAC ATACTTTGCT CAGCTTGATG CATTTAATTC AATTATGGCA
AGAGCAATGA AGGTAAGTTC TGTTCGCCAA GTTAGTGAAG CAATGACCGG CTACAGCCCA
ACTGTAAAGT ATATATTCAA TACTCCTGTT GAGGTAACAG GCGGTTCGTT TTATATTGAT
GTAGACCATG ATGTAATGGG TGTTGTTAGT CTTGAGGGTA AAAAGCAGAG CGAACGAGGA
TATATGATAA CATCAGGAGT AATAGCTTCT GCGCTAGAGC ATGGGATACA CGAACAAATA
TTTAAATTAC CTTCAGTTTC AGCAGTAAAG ATACTTACAG AAGCAAGCAA TAGATGGATA
CCTATATATA CTATAGGCAA GGATAACATT GACAGAATAA ATGAACTGGA TGTTTCCGAG
CATGTTAAAA CAGATATCAC CAATGCTGTT AACAGCGGGA AGATAGTAAT AATACCGCAA
AAAGAGATAA GGTACTACAA CTGGCAGGGA GCAGGTTATA TAGTTTTAGA TCCGGAAACT
GGCGCAGCAG GATACATGAT AAGCGGAGGG CTTGCAGGTG GTTCTGCAGC TATTGATGTT
ATGGTTGCGC TGGTTAGTTT AACTACACTT GCTTGGGCTA TATTTGATGT ACTGCAAATA
TCTTGTGCTT TAATTGCTGC AACCAATCCG CTTTTAGCAA TTATATTCTA TTCATTGTTT
GTAATCAGCA CAATCAATCT CATTATGACG TTAGAAACAA TAATTTTGTA CTGGGAGACA
AGAGATTATG AATATGCAAG CCAACTGTTT GGCGAGTTGA TACTAAATAT TGCTACATTT
GGAGTATTTA AGGTAATTGA ATATTTAGTA CCGGGTATTA TGACGTTATT CAAGACGGTA
AAAAATCAAC TGGATGAGAT AGCCCAGATA GCTGAACAAT TTGGAGATGA AGTTGCTGAA
GTTGCAGCAA GATACGGTCC TGATGCGATT GAGGCTATAA AGAGGTATGG TCCAGATGCT
GCGAGGGTGA TCAACAATTA TGGTGATAGT GCAGTAAAAG CTATGGCAAG AGGTATTGAC
CCTGCGCTTA TTGAAAAAAT GGACAGTTTA GCTGTTAAGG TAAATAAGCT TGAAAAATTC
AAGATACTAT CTCGTGAAGC AGCTCTCAAA GTTGTTGAGG TAGTGGAAAC AATAAAGGAC
TACTTGAAAA CATCTGTGGG AAGAGTTTTT GAAAAGATTA GGTCTGTGTA CAGGATTGAA
GATGAGTTAG ATTTAACAAC AGTGGATGGA TGTGAATTTG GCTTATCAAG GGCAAAACTG
AAGAAGAAAT TAATAGAAGA GGGAATGAGT GAGGATTCGG CTGAAGAATC ACTAAAATTT
TTAGAAGAAG GTTGCTTTAC CGGGGACACA ATTGTTATTA CAAAAGAGGG TAAAAAGAGG
ATAGATGAGA TAAAGATAGG TGATTTTGTT TTTGCGAAGG ATGTCAATAC AGGTAAGACA
GCTTATAAGA AAGTTAAACA GATTTATGTC AAGAGTGCGG AAGAGATTGT TCATATCAAA
GTTGGAGATG ATGAAGTAAA AACTACGAAA TCGCACTTAT TCTTTACAGA TTCGGGCTGG
TGGGAAGCAG CTGAGGATAT AAAAAGTGGT GACAAAATAG TAACACAAGA TGGTATAATG
AAAGTAGTAT ATGAAGTAGA AGTTGAGAAG TTAAGCGCAC CTGTAAAAAT TTATAATCTC
AACATAGAAG ATTATCATAC TTACTTTGTT GGAAGCTCTG GATTGCTTGT GCATAATGAC
TGCACACCTG AGGAGACTAA ACTTTGGGGA AAATGGCTAG ATGTAGCCGA AAAATTTAAG
ATAATTGAAA AAGAAGGTTT TGGTAAACTT GACGAAGGCA GAAAATCTTT TACCCAGGAA
TTAGATAATG TTCTTATAGG AAATTCACTT TCGAAAGAAG AGTTTTTAGA GTACATTAAG
AGATTTGTTC ATGACGAAAA AGAACAAAAA CTTCCATCTG AAGTAAGGAA AATATTGCTT
AAAATTAGAA GCTCAATACC AAAACCAGAT GAGGATACAG TATTAATAAG AGTTCTAAGC
CCTGATGATG TGATGTTAAA TAAATATTAT ATAAATGCGG AAAATCCATC TGTAAGTGGA
TTTATAGCGT GTGCAGCTGA CCTTGAAGAC ACTAAGACTT GTGAGGAACT TGTAGCACGA
CTAAGGTTAG ATTACAAAGA TTCACCATTT CCGGATCATA ATAAGCTATC TGATATTAAT
TATAATGAAA TATCATGCTA TATATTGGTA TTTAAAACGA AGGACGTAGA CAAGATAAAA
ATTCCAGTTA GCAAAAATAT GTTTGAAGAC ATTTCGGATG AGGAGCTTGC TAATTTGGGT
ATTCGCAAAG AACATTTTAT TTTATTTGAG GAGGATGAAG AGTTAGCGAA AAGGTATCTT
TCCAAACCAT TCACAGGGAC TGGATTTACA GCTGCAGGTG CTGGATACAG TGAGCATATA
AGCAATGATG TAATGTATCA AATGAAAGAC AAGGCAATAC CGGAGTATTT TGTTCAGTTT
GAAAATCCCC TAAGTTTAAA AGATGGTGCA TTTTTGATAG AAATGAGAGG AAATAAGAAT
ATGAAAGTGA TAGCAAGATA TTCTGAAGCA GAAAAAAGAT TTATACATTT TGAGGAGTAA
 
Protein sequence
MVRSKLSRLI AIVLLVAFIF NFILPTQQFA LAKQAKQVES TFKTTTFERK YEIPKTRTKL 
GCFTDEIYKL YEKLKADIEA NDIDKVKSDI KNIKKVLKEI RKSIVTELDK NSKILDKLNA
TKAKQRHNRF KSELEKKLNN FEVLFDKLDE LLNRVKEYKG KDMELLRNKL QEIENILNPE
QPQQPLGTLP HNNASLTPPK PAIGNEAAAS GYSVAQQESL ANVLPKTPVD ADLAETAETK
FTQEIKNLAD SLKTPVRMYE FVKNNIDFEP YYGSRKGATG TLNQLAGNDY DQASLLIAML
RYKGIPSRYV KGIVEVPVEK VKNWTGAQTA EAAVKVLGSL GIPTESVVSG GAIVAVRFEH
VWVEAYVPYD YYRGAGAMKG QKVWVPLDPS FKQYSVESGL DIKAVTKVTD EQIMDAFKVS
GERDGETITK IDIEKMSSFM DEIKGKLQAY IEGNNLSNVD TNKLIGGKKI KPEKLGMLPI
TLPFKIVTVL AKTNTIPDAS SEKIGFSIRG NEPFNLNFSG TYDFSIQFKA VELYGKKITL
SWIPATHEDE EIINHYGGLF KTPAYMVQLK PQLKIDGRVV AEGKAAGFGN RQEFTIEIGH
VGRTVEKITN TVTVGGFYSI CFDFGKIDVK ELDAIKERVS VIKDTVNEES IYTDQVMGEI
LNSAGKTYFA QLDAFNSIMA RAMKVSSVRQ VSEAMTGYSP TVKYIFNTPV EVTGGSFYID
VDHDVMGVVS LEGKKQSERG YMITSGVIAS ALEHGIHEQI FKLPSVSAVK ILTEASNRWI
PIYTIGKDNI DRINELDVSE HVKTDITNAV NSGKIVIIPQ KEIRYYNWQG AGYIVLDPET
GAAGYMISGG LAGGSAAIDV MVALVSLTTL AWAIFDVLQI SCALIAATNP LLAIIFYSLF
VISTINLIMT LETIILYWET RDYEYASQLF GELILNIATF GVFKVIEYLV PGIMTLFKTV
KNQLDEIAQI AEQFGDEVAE VAARYGPDAI EAIKRYGPDA ARVINNYGDS AVKAMARGID
PALIEKMDSL AVKVNKLEKF KILSREAALK VVEVVETIKD YLKTSVGRVF EKIRSVYRIE
DELDLTTVDG CEFGLSRAKL KKKLIEEGMS EDSAEESLKF LEEGCFTGDT IVITKEGKKR
IDEIKIGDFV FAKDVNTGKT AYKKVKQIYV KSAEEIVHIK VGDDEVKTTK SHLFFTDSGW
WEAAEDIKSG DKIVTQDGIM KVVYEVEVEK LSAPVKIYNL NIEDYHTYFV GSSGLLVHND
CTPEETKLWG KWLDVAEKFK IIEKEGFGKL DEGRKSFTQE LDNVLIGNSL SKEEFLEYIK
RFVHDEKEQK LPSEVRKILL KIRSSIPKPD EDTVLIRVLS PDDVMLNKYY INAENPSVSG
FIACAADLED TKTCEELVAR LRLDYKDSPF PDHNKLSDIN YNEISCYILV FKTKDVDKIK
IPVSKNMFED ISDEELANLG IRKEHFILFE EDEELAKRYL SKPFTGTGFT AAGAGYSEHI
SNDVMYQMKD KAIPEYFVQF ENPLSLKDGA FLIEMRGNKN MKVIARYSEA EKRFIHFEE