Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0597 |
Symbol | |
ID | 4461742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 619583 |
End bp | 621490 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639699606 |
Product | ATP-dependent protease Lon |
Protein accession | YP_843028 |
Protein GI | 116753910 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGAGTA TGGAAGAACC GAACGAGTTG CTTGGCAGTA TTCAGTTTGA GGATACAAGC AGCATCACCG TCCCCGAGAG CCTCATAGAC CAGGTCATAG GTCAGGAGGA GGCGGTGGAG GTGATCAAGA AGGCGGCACA TCAGCGCCGC CACGTGATGC TCATAGGCTC GCCCGGTACA GGCAAGTCAA TGCTGGGAAA GGCCATGAGC GAGCTCCTTC CTGTGGAGGA GCTGCAGGAT ATCCTGGTGT ATCACAATCC GGAGGACAAC AACAACCCAC GCATCAGGGT CGTGCCCGCA GGTCGGGGCA GGCAGATCGT GGATGCACAC AAGATGGAGG CCAGGAAGAA GGTCCAGACG AGGAACATGT TCTTCATGCT CATCGTCATG GGGCTGATAG TCTACGCCTA CTATACAGGC CAGCTTCTCT TCGGGATAAT CGCAGCAGCA CTGATATTTC TCTCAATGAG GTATCTCATA CCAAAAGAGG ATGTGTTCAT TCCGAAACTT CTTGTCGACA ACAGCGGCAA GAAGACAGCT CCATTCGTTG ATGCAACAGG TGCACACGCC GGAGCTCTGC TCGGTGATGT CAGGCATGAT CCTTTCCAGT CTGGCGGCCT CGAGACCCCT AGCCATGAGA GGGTTGAGTG CGGCGCCATA CACAGGGCTC ACAAGGGTGT GCTATTCATA GATGAGATCA ACACGCTCCG GCTGGAGTCG CAGCAGAGCC TTCTCACAGC CCTTCAGGAG GGCATGTATC CGATAACAGG TCAGAGCGAG AGGTCCTCAG GAGCGCTCGT GAGGACTGAA CCTGTGCCAT GCAGCTTCAT AATGGTCGTC GCGGGGAACT TGGACGCGGT TCAGGGAATG CACCCGGCGC TCAGGTCCCG CATAAAGGGC TACGGCTATG AGGTCTACAT GAGGGACACC ATGGAGGACA CTGTGGAGAA CAGGGACAAG CTGATAAGGT TCGTGGCCCA GGAGGTTGTG CGTGACGGCA AGATCCCGCA CTTCACCAGG GATGCTGTGG CCGAGATAAT CAGAGAGGCG AAGAGGCGTT CCGGCAGGAA GGGCCATCTG ACGCTGATGC TCAGGGACCT CGGAGGTCTT ATACGTGTCG CCGGCGATAT CGCGAGATCT GAGGGCGCTC CACTGACCGA GGCCAGGCAT GTCATAGAGG CCAAAAAGAT GGCGAGATCG CTGGAGCAGC AGATGGCTGA CAGATATCTC GAGAGGCGAA AGGAGTACAG CATGTACAAG AGCTCTGGCG ATGAGGTGGG GAGGGTGAAC GGGCTTGCTG TCATTGGCGA TTCCGGCATC GTCCTGCCCA TAATGGCAGA GGTCACTCCT GCCCAGTCCA AGGAGGAGGG GCGGGTGATT GCGACCGGCC GGCTTCAGGA GATCGCGAAG GAGGCCGTGA CAAACGTCTC AGCTCTGATC AAGAAGCTCC AGGGAGAGGA CATAACAACA AAGGATGTCC ATATCCAGTT CATCGGCACA TACGAGGGCG TTGAGGGGGA TAGCGCGTCG ATATCGATAG CCACAGCGGT AGTCTCTGCG CTTGAGGGAA TACCAGTAAA GCAGAGCGTC GCGATGACGG GCTCCCTCTC AGTCAGAGGA GATGTGCTAC CCGTCGGAGG GGTAACACAG AAGATAGAGG CGGCTGCCCA GGCCGGGATA AAGACCGTGC TCATACCAAA ATCCAACATG GGCGACGTGC TGGTGGATGA ATCCATACGG GACAAGATAG AGATCATACC CGTGTCCAAT ATCAGCGAGG TCTTTGAGTA CAGTCTTGCC GGGCAGAGGT CCAAGCTCCT CGAGAAGCTC AGGAGATTCG CAACAGAGAA GAAGATAGGT ATCTCAATAC CGGATGGCAT TCCAAGTCCA GCGATGAGCA TACTCTAG
|
Protein sequence | MLSMEEPNEL LGSIQFEDTS SITVPESLID QVIGQEEAVE VIKKAAHQRR HVMLIGSPGT GKSMLGKAMS ELLPVEELQD ILVYHNPEDN NNPRIRVVPA GRGRQIVDAH KMEARKKVQT RNMFFMLIVM GLIVYAYYTG QLLFGIIAAA LIFLSMRYLI PKEDVFIPKL LVDNSGKKTA PFVDATGAHA GALLGDVRHD PFQSGGLETP SHERVECGAI HRAHKGVLFI DEINTLRLES QQSLLTALQE GMYPITGQSE RSSGALVRTE PVPCSFIMVV AGNLDAVQGM HPALRSRIKG YGYEVYMRDT MEDTVENRDK LIRFVAQEVV RDGKIPHFTR DAVAEIIREA KRRSGRKGHL TLMLRDLGGL IRVAGDIARS EGAPLTEARH VIEAKKMARS LEQQMADRYL ERRKEYSMYK SSGDEVGRVN GLAVIGDSGI VLPIMAEVTP AQSKEEGRVI ATGRLQEIAK EAVTNVSALI KKLQGEDITT KDVHIQFIGT YEGVEGDSAS ISIATAVVSA LEGIPVKQSV AMTGSLSVRG DVLPVGGVTQ KIEAAAQAGI KTVLIPKSNM GDVLVDESIR DKIEIIPVSN ISEVFEYSLA GQRSKLLEKL RRFATEKKIG ISIPDGIPSP AMSIL
|
| |