Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0172 |
Symbol | |
ID | 4462154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 163138 |
End bp | 164694 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639699180 |
Product | peptidase M50 |
Protein accession | YP_842611 |
Protein GI | 116753493 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0750] Predicted membrane-associated Zn-dependent proteases 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTCA GCAGCGAGCA GTGGATAATT CTGATTGTAC TGGCTCTAGG GCTGCTATCT CTCCTTGGTA CAGCGCTCGG CGGTGGAAGG GGGATTAGCA GTTGGGGGCC GGTGATATTT GTGAGAACGA CCAGGGGTCT GGGCCTCCTT GACAGGCTCG CAGGTGCGAG GAGTCTCTGG CGTTTAACAA CAACGATCTT CATGCCTATC GTCATGGCAG GGATGATCTA CTTTCTGCTC CTGCTGCTTC TGATGGTATA CGTGATGTGG AGGGCGCCAC CCGAGCCCGG GAGCTACAGC GCTCCACGGA ACGTTCTGCT CATACCCGGT GTCAATCAGT ACATACCACT TGTATGGGGA TGGATCGCTC TATGCGTTAC GATAGTGGTA CATGAGCTCT CACACGGAAT CCTTTGCAGG GTCGAGGGGA TTCGGGTAAA ATCGATGGGT TTGATATTCC TGCTCTTCCC CATAGGCGCA TTTGTGGAGC CTGATGATTC CGAGCTGTTC GGGGACGAGA AGAACCCACC AAAAGCAACA TGTCAGGCGA GGATAAGAAT ACTGTCAGCA GGAGTTATCG CGAACTTCCT GGTGGCTGCG CTGGCGCTCT CTCTGTTCTT CGGTCCTGTC ATCAGCGCGC TCTCACCGGT CGACAGGGTT GTCGTTGTTG ATGTCGATCC TGAAAGCAGA TTTGCAGGTG ATGTCCGTGC AGGGATGGTT GTTTCGGGTG CGAGCAGCCT GGAGGAGCTT TACAGGATCG CATCAGAGGG GAGGTCCCTT GAGCTCTTTG ATGATACTTC CAGGGCAATA ATCTCCGGAG AGCCTGTCCT CGGGGTGCAG GTCGTAGACA CGTTCGACGG ATCGCCGGCA AAGGACGCCG GAATGCCGGA GAGGTTCGTC ATCACAGATA TCAATGATAC AAGAGTCGAT TCACTCGAGA GCTTCAGGGA CTACATGAAC TCCACATACC CAGGGCAGAT TCTAAAAATA AATACCACAA AGGGATCTTA CAGCGTGAAA CTTGCGCCAA AAGGCGATGG CACAGGCATG ATAGGGGTTG CGATATCAGG CACAGCCCTA TACCTCGATG GAGTTACCTT TCAGGAGTTA CAGCCGGAGA GGTTCCTTGC GCTCATGAGA TCGATCCCGT CAAGCGGATT GAAGGGTTTT AACACGCTCA TGGGACTGCC GTTTACGGGC GTTGTAGGAT TCACATCAGA TGGATTCCAG GGCTTCAGCG GATCGATGCT CTACCTCTTC GAGCCAGCCG GATGGGCGGA GCCACTGGGT GGAAAAATAT TCTGGATCGC AAACCTGCTG CTATGGATCG GCTGGATCAA CATGTATGCG GGGCTCTTCA ACTGCCTGCC GACGATACCG CTCGACGGCG GCCATATAGC TCGCGATATG ATCAGGATGC TCCTAGATAA AGTGATGAGC GAGAGGAGCG CCGAGCGGTT CACGAGAGGC ATCGTGGCCG CGCTCTCCTG GCTCGTGATA TCCTCACTCA TATTCACAGT TCTCGGCCCG TACCTGGCTC ATGGGATACC TCTCTGA
|
Protein sequence | MDFSSEQWII LIVLALGLLS LLGTALGGGR GISSWGPVIF VRTTRGLGLL DRLAGARSLW RLTTTIFMPI VMAGMIYFLL LLLLMVYVMW RAPPEPGSYS APRNVLLIPG VNQYIPLVWG WIALCVTIVV HELSHGILCR VEGIRVKSMG LIFLLFPIGA FVEPDDSELF GDEKNPPKAT CQARIRILSA GVIANFLVAA LALSLFFGPV ISALSPVDRV VVVDVDPESR FAGDVRAGMV VSGASSLEEL YRIASEGRSL ELFDDTSRAI ISGEPVLGVQ VVDTFDGSPA KDAGMPERFV ITDINDTRVD SLESFRDYMN STYPGQILKI NTTKGSYSVK LAPKGDGTGM IGVAISGTAL YLDGVTFQEL QPERFLALMR SIPSSGLKGF NTLMGLPFTG VVGFTSDGFQ GFSGSMLYLF EPAGWAEPLG GKIFWIANLL LWIGWINMYA GLFNCLPTIP LDGGHIARDM IRMLLDKVMS ERSAERFTRG IVAALSWLVI SSLIFTVLGP YLAHGIPL
|
| |