Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0153 |
Symbol | |
ID | 4462827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 143466 |
End bp | 145883 |
Gene Length | 2418 bp |
Protein Length | 805 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639699162 |
Product | peptidase U32 |
Protein accession | YP_842593 |
Protein GI | 116753475 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAGAA AGGATGAGAT CCCTGAGCTT CTGGCCCCGG CCGGCTCCTG GGATGCGCTG GTTGCGGCTG TTGCAGCCGG AGCGGATGCT GTTTACCTTG GAGGAAAGCG TTTCAGCGCC CGGATGTTCG CCGAGAACTT CCCCAGCCTG GAGGAGGCTG TGGATTACGC TCATGCAAGG AACGTCAGGG TCTATGTAAC CGTGAACACC CTCGTTCGTG ATTGTGAGAT CGACGAGCTG GAGGATTATC TGGTGGAGAT CTGCGAGATT GGCGCAGACG CGATACTTGT GCAGGATACT GGAGTTGTTA GGCTGGCCAG GGATATCGTG CCGGAGCTGG AGCTCCATGC ATCAACCCAG ATGACGATAC ACAGCGCTGA TGGAGTCAGA TGGGCTGCGA GGAACGGCCT AAAGCGGGTG GTGCTCTCGA GAGAGCTTTC TGTTGATGAT ATTAAAAACA TAAAAAATGT TTCCGATGAT CTGGGCGTAG GGCTTGAGGT CTTTGTGCAC GGCGCGCTCT GCTACTCTTA CTCCGGACAG TGTCTTTTAT CATCATCAAT GGGTGGCAGA AGCGGAAACC GCGGGATGTG CGCCCAGCCG TGCAGGAAGC CATACACGCT GCTCAGAGGC ACATCTGATG AATATGGAAG GCTGAAAGAT CTGAGAAGGA GAGGAGGGGA ATGCTATCTG CTCTCCACCC GTGATCTCTG CACGTACCCC AGCCTTGATA AGATCGTATC AGCTGGAGTC GACGCGCTCA AGATCGAGGG GCGGATGAAG TCTGCAGAGT ACGTCGCCAT TGTGACAAGG GTTTACAGAG ATGCGCTTGA TGCCATCGCG AGAGGTGATT GGGCGCAGGA TGATGGAGAG ATCCAGAGGC TCGCGCTCGC CTTCAACAGG GGGTTTACAG AGGGATACAT CCTGGGCGCA GATGATATCA TGGGAAGAGA GATGCCGGAC AACAGGGGCG TCCTTGTGGG AAAGATCCTG AACTGCTCAG GGGGTTTTGC GGTCGTATCT CCCACCGGCG AGATCCTTCC AGAGCCCGGT GACGGGGCAG TGCTTCGCTC AGGAGCGGAG GAGATCGGAT TCGTTGTGAG AGAGAGGGTC GATTTACAGA ATGGCACGTT CGGGCTCAGG GTGCCTGATG GTGCCAGGAC GGGAATGTAT CTTTACATAA CGCGATCCGC CCGCATGAGG GACGATGCTG AGAGGATCAT CAGAAGAGGA AGAGATAGAA TCCCAATCGA CCTGCGGATA TCCTTTGATA ACGGGGTGCC GGTCGCAGAT GTATATCTAG CTGGGCCGTC TGGGCGGATA GAGCTCTCTG TTAAGGGAGA CTTTGTCATG GAGGCTGCGA GAACGCTTCC CCTCAGCCCC TCGCAAATAG AATCTCAGAT GCGGAGAACC GGCGGAACCC AGTTCGTCTT CAGGGAGGTC GTCATCGATT ATCCCGGGGG ACTTTACACC ACGCCAGCGA AGCTCAATCA GCTCAGAAGA GATATACTCA GAGCTGCGGA GAACGCTCTT GTGCACTCGT ACAAAAGAGT GTGCACCAGA GGAAGATCGC CAGCTCTTGA CAGAACTGAG AGAAAGGCTG ACCGGCTCAG GGTATCGGTT TATGCCGATA CTCTTGATGT GATTGATGGA GCGCTTGAGG GCGGCGCTGA GAGGGTGTAC TTCGAGCCAA CTACATATGA GCATGACCTT GCCTCTGCCC TGGAGAAGGC CCGCGATCTC TGTGAAGGGC GCGCAGAGCT GGTCTGGAAG TGGCCCCGGA TAACAAGAGA TCGTTTTCTG TATATGGCTG GGGATGTTCT CCGCGATTTC CGTCTCAACA AGATCATGGT CGAGAACCTG GGCGCTCTCG AGGCAGCAGA GCGATACGAA TGCGAGATCT TCGGAGGACA GGGGCTGAAC ATATGGAACT CGCTGAGCGT GTGCATGCTC TCAGGAGCGA GAGCCCTGAC GCTCTCACCG GAGCTCTCCG CAAGCCAGAT CTCATCTATC GCATCTCTCC GTGACAGACC AGATCTTGAG GTGATCGCCC AGGGAAACAT CGTGATCGCA GTGACCGAGG ACAGGCTCAT CTCAGAGGGA GATGTCTGCG CGATTCGGGA CAGGAGACAC ATCTTTCCCG TCAGGAGAGA TGCCGCGGGC GTCACGAATA TACTGAACAG CGTGGAGACC TGCCTCCTGG ATTACCTCCC TCAGATATCT TTGATGGGTG TGGATTCTGT GGCGATCGAT GCCAGGTGGA GGACAAAAAA GTATGCCAGA GAGATGGCCG GGATATACTC GAGGGCTGTG GGGGAGCTTT CAGAGCTCCC GAAGCTCAAA AGCATGGTCA GGCGAATGGC CATGGGCGGG ATCACAGCAG GACATTTTCT GAGGGGTGTT GCAGAGGCAA GCGATTGA
|
Protein sequence | MRRKDEIPEL LAPAGSWDAL VAAVAAGADA VYLGGKRFSA RMFAENFPSL EEAVDYAHAR NVRVYVTVNT LVRDCEIDEL EDYLVEICEI GADAILVQDT GVVRLARDIV PELELHASTQ MTIHSADGVR WAARNGLKRV VLSRELSVDD IKNIKNVSDD LGVGLEVFVH GALCYSYSGQ CLLSSSMGGR SGNRGMCAQP CRKPYTLLRG TSDEYGRLKD LRRRGGECYL LSTRDLCTYP SLDKIVSAGV DALKIEGRMK SAEYVAIVTR VYRDALDAIA RGDWAQDDGE IQRLALAFNR GFTEGYILGA DDIMGREMPD NRGVLVGKIL NCSGGFAVVS PTGEILPEPG DGAVLRSGAE EIGFVVRERV DLQNGTFGLR VPDGARTGMY LYITRSARMR DDAERIIRRG RDRIPIDLRI SFDNGVPVAD VYLAGPSGRI ELSVKGDFVM EAARTLPLSP SQIESQMRRT GGTQFVFREV VIDYPGGLYT TPAKLNQLRR DILRAAENAL VHSYKRVCTR GRSPALDRTE RKADRLRVSV YADTLDVIDG ALEGGAERVY FEPTTYEHDL ASALEKARDL CEGRAELVWK WPRITRDRFL YMAGDVLRDF RLNKIMVENL GALEAAERYE CEIFGGQGLN IWNSLSVCML SGARALTLSP ELSASQISSI ASLRDRPDLE VIAQGNIVIA VTEDRLISEG DVCAIRDRRH IFPVRRDAAG VTNILNSVET CLLDYLPQIS LMGVDSVAID ARWRTKKYAR EMAGIYSRAV GELSELPKLK SMVRRMAMGG ITAGHFLRGV AEASD
|
| |