Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1731 |
Symbol | |
ID | 4462924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 1875250 |
End bp | 1876515 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639700750 |
Product | hypothetical protein |
Protein accession | YP_844137 |
Protein GI | 116755019 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1030] Membrane-bound serine protease (ClpP class) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTTGC ACCTGAAGTC GCTATTAATT CTGCTGCTAT TATCATCCCC CTGCCTCGGG GATGTCGTCT CTCTCCGGAT AGATGGGGCC ATAACTCCGG CGAGCGACGA TCTTGTGAAG ACTGCTATAG GCTACGCTGA GAGCTCGAAC GCTGATGCAT TGATTCTGAT GCTCGATACA CCCGGAGGCG GCCTGAGCGA GACACTGGAG ATAATAGCTG TTGTGGAGAG AACCGAGATA CCTGTGGTGG GATACGTCTC CCCATCAGGG GCGAAGGCGT GGTCTGCGGG CACAATGATA CTGATCAGCA CAGATATAGC AGCAATGGCC CCGAACACGA TCATAGGCTC GGCCCAGCCG GTCAGGCTTC TTCCCACAGG TGCGACTGAG CCTGTCAACG ACACGAAGAC GACAAACGCG ATAGTTGCGC TCATCGAGGA GAAGGCCAAG ATCCATGGGC GGAACAGGAC CGCGGCTAGG GAGTTCGTCC TGAGCAACCT CAATCTCAAT GCTGAGGAGG CGCTGGAGTA TGGGGTGATC GAGCACGTCT CCCCCGATAT CAGCAGTCTC CTCAAATCCA TCAACGGCAG CTCTGCGAAG AATAGAACTC TTGTGACAGA GGGCGCTGCG GTAGTGATCT TCGAGCCGGA CCTCAGGCTC AGGGTGCTGA TGCTTCTATC TGATCCAACA ATCGCCGGAT TGCTGCTGCT CGTCGGCCTC TACGCTCTCA TCTTCGGAAT CTCAAATCCG GGCCTCGGAG CGGAGGTGTT TGGCGTTATA GCGATCGCGC TTGGGCTCAT AGGTCAGGGG TTCGATGTCA ACATCGGCGC TCTGTTTCTC ATAATCCTCG GCATGGGCCT GATTCTGGCA GAGCTGCACA CCCACGCCAT GGGAGTTCTG GGCGTTGCCG GACTGATCTG CATCGTCCTG GGCACACTTC TCTTCGCACC CATAGGGTTC CCGGAGTGGT ATCTGCCAGG GGAATACCAG CGGTCTGTTA TCAGGCTCTT CCTGCTTCCG TCCCTGACGA TGGCCGGATT CTTCGCGTTT GCAGTTTATA AGATAGCTGA GGCAAGGCGC AGGCCGACTT TTGAGGAGAC TGCTGGGCAG TACGCAGAGA CGATCGAGAC ACTGGATCCA AAAGGCTATG TCATATTCCG CGGGGAGTAC TGGAAGGCGG AGGCTGATGA GAGAATCGAA AAAGGAGAAA CAGTCGAGGT TGTGGGAATC TCCGGCCAGA CGCTCAAAGT CAGGAAGGTC AGGTGA
|
Protein sequence | MGLHLKSLLI LLLLSSPCLG DVVSLRIDGA ITPASDDLVK TAIGYAESSN ADALILMLDT PGGGLSETLE IIAVVERTEI PVVGYVSPSG AKAWSAGTMI LISTDIAAMA PNTIIGSAQP VRLLPTGATE PVNDTKTTNA IVALIEEKAK IHGRNRTAAR EFVLSNLNLN AEEALEYGVI EHVSPDISSL LKSINGSSAK NRTLVTEGAA VVIFEPDLRL RVLMLLSDPT IAGLLLLVGL YALIFGISNP GLGAEVFGVI AIALGLIGQG FDVNIGALFL IILGMGLILA ELHTHAMGVL GVAGLICIVL GTLLFAPIGF PEWYLPGEYQ RSVIRLFLLP SLTMAGFFAF AVYKIAEARR RPTFEETAGQ YAETIETLDP KGYVIFRGEY WKAEADERIE KGETVEVVGI SGQTLKVRKV R
|
| |