Gene Mthe_0172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0172 
Symbol 
ID4462154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp163138 
End bp164694 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content54% 
IMG OID639699180 
Productpeptidase M50 
Protein accessionYP_842611 
Protein GI116753493 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTCA GCAGCGAGCA GTGGATAATT CTGATTGTAC TGGCTCTAGG GCTGCTATCT 
CTCCTTGGTA CAGCGCTCGG CGGTGGAAGG GGGATTAGCA GTTGGGGGCC GGTGATATTT
GTGAGAACGA CCAGGGGTCT GGGCCTCCTT GACAGGCTCG CAGGTGCGAG GAGTCTCTGG
CGTTTAACAA CAACGATCTT CATGCCTATC GTCATGGCAG GGATGATCTA CTTTCTGCTC
CTGCTGCTTC TGATGGTATA CGTGATGTGG AGGGCGCCAC CCGAGCCCGG GAGCTACAGC
GCTCCACGGA ACGTTCTGCT CATACCCGGT GTCAATCAGT ACATACCACT TGTATGGGGA
TGGATCGCTC TATGCGTTAC GATAGTGGTA CATGAGCTCT CACACGGAAT CCTTTGCAGG
GTCGAGGGGA TTCGGGTAAA ATCGATGGGT TTGATATTCC TGCTCTTCCC CATAGGCGCA
TTTGTGGAGC CTGATGATTC CGAGCTGTTC GGGGACGAGA AGAACCCACC AAAAGCAACA
TGTCAGGCGA GGATAAGAAT ACTGTCAGCA GGAGTTATCG CGAACTTCCT GGTGGCTGCG
CTGGCGCTCT CTCTGTTCTT CGGTCCTGTC ATCAGCGCGC TCTCACCGGT CGACAGGGTT
GTCGTTGTTG ATGTCGATCC TGAAAGCAGA TTTGCAGGTG ATGTCCGTGC AGGGATGGTT
GTTTCGGGTG CGAGCAGCCT GGAGGAGCTT TACAGGATCG CATCAGAGGG GAGGTCCCTT
GAGCTCTTTG ATGATACTTC CAGGGCAATA ATCTCCGGAG AGCCTGTCCT CGGGGTGCAG
GTCGTAGACA CGTTCGACGG ATCGCCGGCA AAGGACGCCG GAATGCCGGA GAGGTTCGTC
ATCACAGATA TCAATGATAC AAGAGTCGAT TCACTCGAGA GCTTCAGGGA CTACATGAAC
TCCACATACC CAGGGCAGAT TCTAAAAATA AATACCACAA AGGGATCTTA CAGCGTGAAA
CTTGCGCCAA AAGGCGATGG CACAGGCATG ATAGGGGTTG CGATATCAGG CACAGCCCTA
TACCTCGATG GAGTTACCTT TCAGGAGTTA CAGCCGGAGA GGTTCCTTGC GCTCATGAGA
TCGATCCCGT CAAGCGGATT GAAGGGTTTT AACACGCTCA TGGGACTGCC GTTTACGGGC
GTTGTAGGAT TCACATCAGA TGGATTCCAG GGCTTCAGCG GATCGATGCT CTACCTCTTC
GAGCCAGCCG GATGGGCGGA GCCACTGGGT GGAAAAATAT TCTGGATCGC AAACCTGCTG
CTATGGATCG GCTGGATCAA CATGTATGCG GGGCTCTTCA ACTGCCTGCC GACGATACCG
CTCGACGGCG GCCATATAGC TCGCGATATG ATCAGGATGC TCCTAGATAA AGTGATGAGC
GAGAGGAGCG CCGAGCGGTT CACGAGAGGC ATCGTGGCCG CGCTCTCCTG GCTCGTGATA
TCCTCACTCA TATTCACAGT TCTCGGCCCG TACCTGGCTC ATGGGATACC TCTCTGA
 
Protein sequence
MDFSSEQWII LIVLALGLLS LLGTALGGGR GISSWGPVIF VRTTRGLGLL DRLAGARSLW 
RLTTTIFMPI VMAGMIYFLL LLLLMVYVMW RAPPEPGSYS APRNVLLIPG VNQYIPLVWG
WIALCVTIVV HELSHGILCR VEGIRVKSMG LIFLLFPIGA FVEPDDSELF GDEKNPPKAT
CQARIRILSA GVIANFLVAA LALSLFFGPV ISALSPVDRV VVVDVDPESR FAGDVRAGMV
VSGASSLEEL YRIASEGRSL ELFDDTSRAI ISGEPVLGVQ VVDTFDGSPA KDAGMPERFV
ITDINDTRVD SLESFRDYMN STYPGQILKI NTTKGSYSVK LAPKGDGTGM IGVAISGTAL
YLDGVTFQEL QPERFLALMR SIPSSGLKGF NTLMGLPFTG VVGFTSDGFQ GFSGSMLYLF
EPAGWAEPLG GKIFWIANLL LWIGWINMYA GLFNCLPTIP LDGGHIARDM IRMLLDKVMS
ERSAERFTRG IVAALSWLVI SSLIFTVLGP YLAHGIPL