Gene Mthe_0153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0153 
Symbol 
ID4462827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp143466 
End bp145883 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content56% 
IMG OID639699162 
Productpeptidase U32 
Protein accessionYP_842593 
Protein GI116753475 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAGAA AGGATGAGAT CCCTGAGCTT CTGGCCCCGG CCGGCTCCTG GGATGCGCTG 
GTTGCGGCTG TTGCAGCCGG AGCGGATGCT GTTTACCTTG GAGGAAAGCG TTTCAGCGCC
CGGATGTTCG CCGAGAACTT CCCCAGCCTG GAGGAGGCTG TGGATTACGC TCATGCAAGG
AACGTCAGGG TCTATGTAAC CGTGAACACC CTCGTTCGTG ATTGTGAGAT CGACGAGCTG
GAGGATTATC TGGTGGAGAT CTGCGAGATT GGCGCAGACG CGATACTTGT GCAGGATACT
GGAGTTGTTA GGCTGGCCAG GGATATCGTG CCGGAGCTGG AGCTCCATGC ATCAACCCAG
ATGACGATAC ACAGCGCTGA TGGAGTCAGA TGGGCTGCGA GGAACGGCCT AAAGCGGGTG
GTGCTCTCGA GAGAGCTTTC TGTTGATGAT ATTAAAAACA TAAAAAATGT TTCCGATGAT
CTGGGCGTAG GGCTTGAGGT CTTTGTGCAC GGCGCGCTCT GCTACTCTTA CTCCGGACAG
TGTCTTTTAT CATCATCAAT GGGTGGCAGA AGCGGAAACC GCGGGATGTG CGCCCAGCCG
TGCAGGAAGC CATACACGCT GCTCAGAGGC ACATCTGATG AATATGGAAG GCTGAAAGAT
CTGAGAAGGA GAGGAGGGGA ATGCTATCTG CTCTCCACCC GTGATCTCTG CACGTACCCC
AGCCTTGATA AGATCGTATC AGCTGGAGTC GACGCGCTCA AGATCGAGGG GCGGATGAAG
TCTGCAGAGT ACGTCGCCAT TGTGACAAGG GTTTACAGAG ATGCGCTTGA TGCCATCGCG
AGAGGTGATT GGGCGCAGGA TGATGGAGAG ATCCAGAGGC TCGCGCTCGC CTTCAACAGG
GGGTTTACAG AGGGATACAT CCTGGGCGCA GATGATATCA TGGGAAGAGA GATGCCGGAC
AACAGGGGCG TCCTTGTGGG AAAGATCCTG AACTGCTCAG GGGGTTTTGC GGTCGTATCT
CCCACCGGCG AGATCCTTCC AGAGCCCGGT GACGGGGCAG TGCTTCGCTC AGGAGCGGAG
GAGATCGGAT TCGTTGTGAG AGAGAGGGTC GATTTACAGA ATGGCACGTT CGGGCTCAGG
GTGCCTGATG GTGCCAGGAC GGGAATGTAT CTTTACATAA CGCGATCCGC CCGCATGAGG
GACGATGCTG AGAGGATCAT CAGAAGAGGA AGAGATAGAA TCCCAATCGA CCTGCGGATA
TCCTTTGATA ACGGGGTGCC GGTCGCAGAT GTATATCTAG CTGGGCCGTC TGGGCGGATA
GAGCTCTCTG TTAAGGGAGA CTTTGTCATG GAGGCTGCGA GAACGCTTCC CCTCAGCCCC
TCGCAAATAG AATCTCAGAT GCGGAGAACC GGCGGAACCC AGTTCGTCTT CAGGGAGGTC
GTCATCGATT ATCCCGGGGG ACTTTACACC ACGCCAGCGA AGCTCAATCA GCTCAGAAGA
GATATACTCA GAGCTGCGGA GAACGCTCTT GTGCACTCGT ACAAAAGAGT GTGCACCAGA
GGAAGATCGC CAGCTCTTGA CAGAACTGAG AGAAAGGCTG ACCGGCTCAG GGTATCGGTT
TATGCCGATA CTCTTGATGT GATTGATGGA GCGCTTGAGG GCGGCGCTGA GAGGGTGTAC
TTCGAGCCAA CTACATATGA GCATGACCTT GCCTCTGCCC TGGAGAAGGC CCGCGATCTC
TGTGAAGGGC GCGCAGAGCT GGTCTGGAAG TGGCCCCGGA TAACAAGAGA TCGTTTTCTG
TATATGGCTG GGGATGTTCT CCGCGATTTC CGTCTCAACA AGATCATGGT CGAGAACCTG
GGCGCTCTCG AGGCAGCAGA GCGATACGAA TGCGAGATCT TCGGAGGACA GGGGCTGAAC
ATATGGAACT CGCTGAGCGT GTGCATGCTC TCAGGAGCGA GAGCCCTGAC GCTCTCACCG
GAGCTCTCCG CAAGCCAGAT CTCATCTATC GCATCTCTCC GTGACAGACC AGATCTTGAG
GTGATCGCCC AGGGAAACAT CGTGATCGCA GTGACCGAGG ACAGGCTCAT CTCAGAGGGA
GATGTCTGCG CGATTCGGGA CAGGAGACAC ATCTTTCCCG TCAGGAGAGA TGCCGCGGGC
GTCACGAATA TACTGAACAG CGTGGAGACC TGCCTCCTGG ATTACCTCCC TCAGATATCT
TTGATGGGTG TGGATTCTGT GGCGATCGAT GCCAGGTGGA GGACAAAAAA GTATGCCAGA
GAGATGGCCG GGATATACTC GAGGGCTGTG GGGGAGCTTT CAGAGCTCCC GAAGCTCAAA
AGCATGGTCA GGCGAATGGC CATGGGCGGG ATCACAGCAG GACATTTTCT GAGGGGTGTT
GCAGAGGCAA GCGATTGA
 
Protein sequence
MRRKDEIPEL LAPAGSWDAL VAAVAAGADA VYLGGKRFSA RMFAENFPSL EEAVDYAHAR 
NVRVYVTVNT LVRDCEIDEL EDYLVEICEI GADAILVQDT GVVRLARDIV PELELHASTQ
MTIHSADGVR WAARNGLKRV VLSRELSVDD IKNIKNVSDD LGVGLEVFVH GALCYSYSGQ
CLLSSSMGGR SGNRGMCAQP CRKPYTLLRG TSDEYGRLKD LRRRGGECYL LSTRDLCTYP
SLDKIVSAGV DALKIEGRMK SAEYVAIVTR VYRDALDAIA RGDWAQDDGE IQRLALAFNR
GFTEGYILGA DDIMGREMPD NRGVLVGKIL NCSGGFAVVS PTGEILPEPG DGAVLRSGAE
EIGFVVRERV DLQNGTFGLR VPDGARTGMY LYITRSARMR DDAERIIRRG RDRIPIDLRI
SFDNGVPVAD VYLAGPSGRI ELSVKGDFVM EAARTLPLSP SQIESQMRRT GGTQFVFREV
VIDYPGGLYT TPAKLNQLRR DILRAAENAL VHSYKRVCTR GRSPALDRTE RKADRLRVSV
YADTLDVIDG ALEGGAERVY FEPTTYEHDL ASALEKARDL CEGRAELVWK WPRITRDRFL
YMAGDVLRDF RLNKIMVENL GALEAAERYE CEIFGGQGLN IWNSLSVCML SGARALTLSP
ELSASQISSI ASLRDRPDLE VIAQGNIVIA VTEDRLISEG DVCAIRDRRH IFPVRRDAAG
VTNILNSVET CLLDYLPQIS LMGVDSVAID ARWRTKKYAR EMAGIYSRAV GELSELPKLK
SMVRRMAMGG ITAGHFLRGV AEASD