Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2571 |
Symbol | |
ID | 7409522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2692513 |
End bp | 2697192 |
Gene Length | 4680 bp |
Protein Length | 1559 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643716933 |
Product | transglutaminase domain protein |
Protein accession | YP_002574410 |
Protein GI | 222530528 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0223142 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCGCA GTAAATTATC AAGACTGATT GCTATTGTAT TATTAGTTGC ATTTATATTT AACTTTATAC TTCCAACTCA ACAGTTTGCT TTAGCAAAAC AGGCAAAACA AGTTGAGAGT ACTTTTAAAA CAACGACGTT TGAGAGAAAA TATGAAATTC CAAAAACCAG AACAAAACTT GGCTGTTTCA CAGATGAAAT ATATAAACTG TATGAGAAGC TCAAAGCAGA CATTGAAGCC AATGACATAG ATAAAGTGAA ATCTGATATA AAAAACATAA AAAAGGTTTT AAAAGAGATT AGAAAATCTA TTGTTACAGA ACTTGACAAA AACAGTAAAA TACTTGACAA GTTGAATGCA ACTAAAGCCA AGCAAAGACA TAATCGGTTT AAATCTGAAT TGGAAAAGAA ATTGAACAAT TTTGAAGTCT TATTTGATAA GCTTGATGAA CTTTTGAACC GCGTAAAAGA ATACAAAGGT AAGGACATGG AACTTCTAAG GAACAAACTT CAGGAGATAG AGAATATATT AAATCCTGAA CAGCCCCAGC AACCATTGGG TACACTTCCT CATAACAATG CTAGCTTAAC TCCACCTAAA CCAGCAATTG GTAATGAAGC CGCAGCATCT GGGTATAGTG TTGCACAACA GGAGTCTTTG GCAAATGTGT TGCCAAAGAC TCCTGTTGAT GCGGATTTGG CTGAGACTGC TGAGACAAAA TTTACACAGG AAATCAAAAA TTTAGCTGAT TCACTCAAAA CACCAGTGAG AATGTATGAA TTTGTAAAAA ACAACATAGA TTTTGAGCCT TACTATGGTT CGAGAAAAGG TGCAACTGGT ACGTTAAACC AGTTAGCCGG CAATGACTAT GACCAAGCGT CGCTCTTGAT AGCAATGTTA AGATACAAGG GCATTCCTTC ACGATATGTT AAAGGTATTG TTGAAGTGCC TGTGGAGAAG GTAAAAAATT GGACAGGTGC TCAAACTGCC GAAGCAGCTG TAAAAGTGCT TGGCTCTCTT GGTATACCTA CCGAATCTGT TGTGTCTGGT GGTGCAATTG TTGCTGTTCG ATTTGAGCAT GTTTGGGTAG AAGCATATGT ACCCTATGAT TACTATCGCG GCGCAGGGGC TATGAAAGGT CAAAAAGTGT GGGTTCCTCT AGATCCAAGT TTCAAACAAT ATAGTGTAGA ATCGGGTTTG GATATTAAAG CAGTCACCAA AGTCACAGAT GAGCAAATTA TGGATGCATT TAAAGTTAGT GGTGAAAGAG ATGGCGAGAC CATCACGAAA ATAGATATCG AAAAGATGAG CAGCTTCATG GATGAGATTA AAGGGAAGCT GCAAGCCTAC ATTGAGGGGA ATAATCTTAG TAATGTTGAT ACTAACAAAT TGATAGGTGG CAAGAAAATA AAGCCTGAAA AGCTGGGGAT GCTTCCTATT ACATTGCCAT TCAAGATAGT TACAGTACTT GCAAAAACAA ATACGATACC TGATGCAAGT AGTGAGAAAA TAGGATTTTC AATCAGGGGA AATGAACCGT TCAACTTGAA TTTTTCAGGA ACGTATGATT TTAGTATTCA GTTCAAGGCA GTTGAACTAT ATGGTAAAAA GATAACTCTT TCATGGATTC CAGCCACACA TGAGGACGAA GAAATTATCA ATCATTACGG CGGGTTATTT AAAACACCTG CTTATATGGT TCAGTTGAAA CCGCAGCTGA AAATAGATGG CCGGGTTGTT GCAGAAGGTA AGGCAGCAGG ATTTGGAAAT AGGCAGGAAT TTACAATTGA GATAGGACAT GTAGGAAGAA CTGTTGAGAA GATAACCAAC ACTGTCACTG TTGGAGGATT TTATAGCATA TGTTTTGATT TTGGCAAGAT TGATGTTAAA GAACTTGATG CTATCAAAGA GAGAGTGTCA GTCATAAAAG ATACAGTAAA TGAGGAAAGC ATTTATACAG ATCAGGTAAT GGGAGAGATT TTAAATAGTG CAGGTAAGAC ATACTTTGCT CAGCTTGATG CATTTAATTC AATTATGGCA AGAGCAATGA AGGTAAGTTC TGTTCGCCAA GTTAGTGAAG CAATGACCGG CTACAGCCCA ACTGTAAAGT ATATATTCAA TACTCCTGTT GAGGTAACAG GCGGTTCGTT TTATATTGAT GTAGACCATG ATGTAATGGG TGTTGTTAGT CTTGAGGGTA AAAAGCAGAG CGAACGAGGA TATATGATAA CATCAGGAGT AATAGCTTCT GCGCTAGAGC ATGGGATACA CGAACAAATA TTTAAATTAC CTTCAGTTTC AGCAGTAAAG ATACTTACAG AAGCAAGCAA TAGATGGATA CCTATATATA CTATAGGCAA GGATAACATT GACAGAATAA ATGAACTGGA TGTTTCCGAG CATGTTAAAA CAGATATCAC CAATGCTGTT AACAGCGGGA AGATAGTAAT AATACCGCAA AAAGAGATAA GGTACTACAA CTGGCAGGGA GCAGGTTATA TAGTTTTAGA TCCGGAAACT GGCGCAGCAG GATACATGAT AAGCGGAGGG CTTGCAGGTG GTTCTGCAGC TATTGATGTT ATGGTTGCGC TGGTTAGTTT AACTACACTT GCTTGGGCTA TATTTGATGT ACTGCAAATA TCTTGTGCTT TAATTGCTGC AACCAATCCG CTTTTAGCAA TTATATTCTA TTCATTGTTT GTAATCAGCA CAATCAATCT CATTATGACG TTAGAAACAA TAATTTTGTA CTGGGAGACA AGAGATTATG AATATGCAAG CCAACTGTTT GGCGAGTTGA TACTAAATAT TGCTACATTT GGAGTATTTA AGGTAATTGA ATATTTAGTA CCGGGTATTA TGACGTTATT CAAGACGGTA AAAAATCAAC TGGATGAGAT AGCCCAGATA GCTGAACAAT TTGGAGATGA AGTTGCTGAA GTTGCAGCAA GATACGGTCC TGATGCGATT GAGGCTATAA AGAGGTATGG TCCAGATGCT GCGAGGGTGA TCAACAATTA TGGTGATAGT GCAGTAAAAG CTATGGCAAG AGGTATTGAC CCTGCGCTTA TTGAAAAAAT GGACAGTTTA GCTGTTAAGG TAAATAAGCT TGAAAAATTC AAGATACTAT CTCGTGAAGC AGCTCTCAAA GTTGTTGAGG TAGTGGAAAC AATAAAGGAC TACTTGAAAA CATCTGTGGG AAGAGTTTTT GAAAAGATTA GGTCTGTGTA CAGGATTGAA GATGAGTTAG ATTTAACAAC AGTGGATGGA TGTGAATTTG GCTTATCAAG GGCAAAACTG AAGAAGAAAT TAATAGAAGA GGGAATGAGT GAGGATTCGG CTGAAGAATC ACTAAAATTT TTAGAAGAAG GTTGCTTTAC CGGGGACACA ATTGTTATTA CAAAAGAGGG TAAAAAGAGG ATAGATGAGA TAAAGATAGG TGATTTTGTT TTTGCGAAGG ATGTCAATAC AGGTAAGACA GCTTATAAGA AAGTTAAACA GATTTATGTC AAGAGTGCGG AAGAGATTGT TCATATCAAA GTTGGAGATG ATGAAGTAAA AACTACGAAA TCGCACTTAT TCTTTACAGA TTCGGGCTGG TGGGAAGCAG CTGAGGATAT AAAAAGTGGT GACAAAATAG TAACACAAGA TGGTATAATG AAAGTAGTAT ATGAAGTAGA AGTTGAGAAG TTAAGCGCAC CTGTAAAAAT TTATAATCTC AACATAGAAG ATTATCATAC TTACTTTGTT GGAAGCTCTG GATTGCTTGT GCATAATGAC TGCACACCTG AGGAGACTAA ACTTTGGGGA AAATGGCTAG ATGTAGCCGA AAAATTTAAG ATAATTGAAA AAGAAGGTTT TGGTAAACTT GACGAAGGCA GAAAATCTTT TACCCAGGAA TTAGATAATG TTCTTATAGG AAATTCACTT TCGAAAGAAG AGTTTTTAGA GTACATTAAG AGATTTGTTC ATGACGAAAA AGAACAAAAA CTTCCATCTG AAGTAAGGAA AATATTGCTT AAAATTAGAA GCTCAATACC AAAACCAGAT GAGGATACAG TATTAATAAG AGTTCTAAGC CCTGATGATG TGATGTTAAA TAAATATTAT ATAAATGCGG AAAATCCATC TGTAAGTGGA TTTATAGCGT GTGCAGCTGA CCTTGAAGAC ACTAAGACTT GTGAGGAACT TGTAGCACGA CTAAGGTTAG ATTACAAAGA TTCACCATTT CCGGATCATA ATAAGCTATC TGATATTAAT TATAATGAAA TATCATGCTA TATATTGGTA TTTAAAACGA AGGACGTAGA CAAGATAAAA ATTCCAGTTA GCAAAAATAT GTTTGAAGAC ATTTCGGATG AGGAGCTTGC TAATTTGGGT ATTCGCAAAG AACATTTTAT TTTATTTGAG GAGGATGAAG AGTTAGCGAA AAGGTATCTT TCCAAACCAT TCACAGGGAC TGGATTTACA GCTGCAGGTG CTGGATACAG TGAGCATATA AGCAATGATG TAATGTATCA AATGAAAGAC AAGGCAATAC CGGAGTATTT TGTTCAGTTT GAAAATCCCC TAAGTTTAAA AGATGGTGCA TTTTTGATAG AAATGAGAGG AAATAAGAAT ATGAAAGTGA TAGCAAGATA TTCTGAAGCA GAAAAAAGAT TTATACATTT TGAGGAGTAA
|
Protein sequence | MVRSKLSRLI AIVLLVAFIF NFILPTQQFA LAKQAKQVES TFKTTTFERK YEIPKTRTKL GCFTDEIYKL YEKLKADIEA NDIDKVKSDI KNIKKVLKEI RKSIVTELDK NSKILDKLNA TKAKQRHNRF KSELEKKLNN FEVLFDKLDE LLNRVKEYKG KDMELLRNKL QEIENILNPE QPQQPLGTLP HNNASLTPPK PAIGNEAAAS GYSVAQQESL ANVLPKTPVD ADLAETAETK FTQEIKNLAD SLKTPVRMYE FVKNNIDFEP YYGSRKGATG TLNQLAGNDY DQASLLIAML RYKGIPSRYV KGIVEVPVEK VKNWTGAQTA EAAVKVLGSL GIPTESVVSG GAIVAVRFEH VWVEAYVPYD YYRGAGAMKG QKVWVPLDPS FKQYSVESGL DIKAVTKVTD EQIMDAFKVS GERDGETITK IDIEKMSSFM DEIKGKLQAY IEGNNLSNVD TNKLIGGKKI KPEKLGMLPI TLPFKIVTVL AKTNTIPDAS SEKIGFSIRG NEPFNLNFSG TYDFSIQFKA VELYGKKITL SWIPATHEDE EIINHYGGLF KTPAYMVQLK PQLKIDGRVV AEGKAAGFGN RQEFTIEIGH VGRTVEKITN TVTVGGFYSI CFDFGKIDVK ELDAIKERVS VIKDTVNEES IYTDQVMGEI LNSAGKTYFA QLDAFNSIMA RAMKVSSVRQ VSEAMTGYSP TVKYIFNTPV EVTGGSFYID VDHDVMGVVS LEGKKQSERG YMITSGVIAS ALEHGIHEQI FKLPSVSAVK ILTEASNRWI PIYTIGKDNI DRINELDVSE HVKTDITNAV NSGKIVIIPQ KEIRYYNWQG AGYIVLDPET GAAGYMISGG LAGGSAAIDV MVALVSLTTL AWAIFDVLQI SCALIAATNP LLAIIFYSLF VISTINLIMT LETIILYWET RDYEYASQLF GELILNIATF GVFKVIEYLV PGIMTLFKTV KNQLDEIAQI AEQFGDEVAE VAARYGPDAI EAIKRYGPDA ARVINNYGDS AVKAMARGID PALIEKMDSL AVKVNKLEKF KILSREAALK VVEVVETIKD YLKTSVGRVF EKIRSVYRIE DELDLTTVDG CEFGLSRAKL KKKLIEEGMS EDSAEESLKF LEEGCFTGDT IVITKEGKKR IDEIKIGDFV FAKDVNTGKT AYKKVKQIYV KSAEEIVHIK VGDDEVKTTK SHLFFTDSGW WEAAEDIKSG DKIVTQDGIM KVVYEVEVEK LSAPVKIYNL NIEDYHTYFV GSSGLLVHND CTPEETKLWG KWLDVAEKFK IIEKEGFGKL DEGRKSFTQE LDNVLIGNSL SKEEFLEYIK RFVHDEKEQK LPSEVRKILL KIRSSIPKPD EDTVLIRVLS PDDVMLNKYY INAENPSVSG FIACAADLED TKTCEELVAR LRLDYKDSPF PDHNKLSDIN YNEISCYILV FKTKDVDKIK IPVSKNMFED ISDEELANLG IRKEHFILFE EDEELAKRYL SKPFTGTGFT AAGAGYSEHI SNDVMYQMKD KAIPEYFVQF ENPLSLKDGA FLIEMRGNKN MKVIARYSEA EKRFIHFEE
|
| |