Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2694 |
Symbol | |
ID | 7272516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 2823917 |
End bp | 2825833 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643571285 |
Product | ATP-dependent protease Lon |
Protein accession | YP_002467681 |
Protein GI | 219853249 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAGTA CTGTAAATGA GGAGCCAACG CACGAGGATA CCATTTTAGA GAACGTGCAG GTGGAGACCT CATCCGAGAT TGAGGTACCG GCGCATCTGA TCGATCAGGT GATCGGTCAG GAGCATGCGG TTGAGGTGAT CAGGAAGGCG GCGATACAAC GCCGTCATGT GATGATGATC GGTAGCCCGG GGACCGGGAA GTCGATGCTT GCAAAAGCAA TGGCTGAACT GCTCCCCAAA GAGGAGCTGC AGGACCTGCT GGTCTACCCC AATGCCGAGG ACTCAAACAA TCCTATCATC AGGACCGTCC CTGCAGGAAA GGCAAAGCAG ATCGTCGGTG CTCATAAAGC TGAGGCCAAG AAGCGGGCGC AGTTCCGGAA CACGTTGATG ATGTTGCTGA TGGTCGGGAT CATCGGCTAC TCATTCATCA CGATGCAGTG GCTGATGGGG ATCATCGCTG CAGCCTTCGT CTTCATGGCT CTCCGGTACA GCACTCCCAA GGATGAGGCG ATGATCCCAA AGTTGCTCGT CTCCAATGAT ACCACCGCGA CAGCTCCGTT CATCGATGCG ACCGGTTCCC AGGCCGGCGC CTTGCTCGGG GATGTTCGGC ATGACCCGTT CCAGAGCGGC GGGCTTGAGA CCCCTGCCCA TGACCGTGTG GAGTCTGGGG CGATCCACCG TGCTAACGGA GGTGTGCTCT TCATCGATGA GATCAATACC CTCTCGCCGG GTTCACAACA GAACCTGCTG ACAGCACTGC AAGAGGGTGA ATTTCCCATC ACCGGGCAAA GTGAACGCTC AAGCGGTGCG ATGGTCAGAA CCGAACCGGT CCCGTGCCGA TTCGTGATGA TCGCAGCCGG CAACCTGGAC GCGGTCCAGG GGATGCATCC GGCCCTCCGG TCCCGTATCA GGGGGTACGG TTACGAGGTT TTCATGTCCG AGTCGATGGA GGAGACCCCT GAGAACCGTG AGAAGTTCAT CAGGTTCATT GCCCAGGAGA TCAAGAACGA CGGCAAGATC CCACACTTCG ACCAGGGTGC AATGGCAGAG GTGCTCAGAG AGGGCCGCCG CCGGTCAGGG CGCAAAGGGC ACCTGACCTT GAAACTGCGT GACATGGGTG GATTGATCCG GGTGGCCGGG GACCTGGCCA GGCAGGATGG GGTCGAACTG ACGACCGCTG CCCACGTGCT TGCAGCCAAG GAGACCGCTC GTTCGATCGA GGATCAGATC TCTGATGAGA ACAGCCGGCG GTTGAAGGAC TATGATCTCT CGGTGGTGAA GGGGACAAGC ATCGGTCGGG TGAACGGGCT TGCCGTGACC GGGGCCGACT CGGGCTCGGT GCTCCCGATC ATGGCCGAGG TCACCCTCAG CCAGAGCCAG TTCGGCCAGG TGATCGCCAC CGGGCTGCTC AAGGAGATCG CCCAGGAGTC GATCACCAAT GTCTCAGCGA TCATCAAGAA GTTCACCGGG CAGGACATCC AGAAGCTTGA CATCCATATC CAGTTCATCG GCACCTACGG TGGTGTGGAC GGCGACTCGG CATCGGTTAG TGTGGCCACG GCTGTGATCA GTGCTATCGA GGGGATCCCG GTCAGGCAGG ATCTCGCGAT GACCGGGTCG CTCTCGGTTC GTGGGGACGT CCTTCCGATT GGGGGGGTCA CCTACAAGAT CGAGGCTGCG GCCAAGGCAG GGATCAAGAA GGTACTGATC CCGGCCTCGA ACATGAACGA TGTGATGATC GAGGAGCGGT ACCGCTCAAT GATCGAGATC GTTCCGGTCT ATCATATCGA GGACGTGCTG AAGGAGGCCC TGGTCCCGGA GAACGAGGCG GGTTTCCTTT CCAAGATTAA GAACATGGCC TCGCGGCCGG CCGCCAACCT CCTCGACAAG ACCGGAATCC GTCCAACGGT GATCTGA
|
Protein sequence | MDSTVNEEPT HEDTILENVQ VETSSEIEVP AHLIDQVIGQ EHAVEVIRKA AIQRRHVMMI GSPGTGKSML AKAMAELLPK EELQDLLVYP NAEDSNNPII RTVPAGKAKQ IVGAHKAEAK KRAQFRNTLM MLLMVGIIGY SFITMQWLMG IIAAAFVFMA LRYSTPKDEA MIPKLLVSND TTATAPFIDA TGSQAGALLG DVRHDPFQSG GLETPAHDRV ESGAIHRANG GVLFIDEINT LSPGSQQNLL TALQEGEFPI TGQSERSSGA MVRTEPVPCR FVMIAAGNLD AVQGMHPALR SRIRGYGYEV FMSESMEETP ENREKFIRFI AQEIKNDGKI PHFDQGAMAE VLREGRRRSG RKGHLTLKLR DMGGLIRVAG DLARQDGVEL TTAAHVLAAK ETARSIEDQI SDENSRRLKD YDLSVVKGTS IGRVNGLAVT GADSGSVLPI MAEVTLSQSQ FGQVIATGLL KEIAQESITN VSAIIKKFTG QDIQKLDIHI QFIGTYGGVD GDSASVSVAT AVISAIEGIP VRQDLAMTGS LSVRGDVLPI GGVTYKIEAA AKAGIKKVLI PASNMNDVMI EERYRSMIEI VPVYHIEDVL KEALVPENEA GFLSKIKNMA SRPAANLLDK TGIRPTVI
|
| |