Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1517 |
Symbol | |
ID | 4269073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1728708 |
End bp | 1731149 |
Gene Length | 2442 bp |
Protein Length | 813 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638126275 |
Product | peptidase S16, lon domain-containing protein |
Protein accession | YP_742356 |
Protein GI | 114320673 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.30513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.334171 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCCA TTCGGCCATT GTCCGCCGAC CAACTCTACC GCCCCTGCCG CGCCGACCAG CTCGGCTTTC GCACCACCGA GGAGCTGGAG CGCCTGGAGC TGCCCTGTGG GCAGACGCGG GCGCTGGAGG CGTTGGACTT TGCCACCGGC ATCCGCAACG ACGGCTTCAA CCTCTTCGTG CTCGGGCCGG CTGGCGCGGG CAAGCGGGAG TGGGTCCAGC GCTTTCTGGA GCGCAAGGCC GAGGCGCAGC AGCGGCCGTC GGACTGGGCC TATATCTACA ATTTCGACGC CCCCGATCAG CCCCGGGCCC TGGCGCTGCC CGCCGGGGCC GGCCGGCGCC TGAAGCGGGA CGTGGACGAA CTCACCGATG AGCTGCGCAA CTCCATCCCC GCCACCTTCG AGAGCGACGA GTACCAGTCC CGGATCCAGG AGCTGCAACA GGCGGCGAAT CGTCGCCACC GAGAAGCAAT CGAACAGATC CAACACGAGG CCGAGGAGCA GGGCATTGCC CTGCTGACCA CCCCCTCGGG GTTCACCTTC GCCCCCAAGA AGGAGGGCGA GGTGCTGAGC GCCGAGGAGT TCCAGAAACT GCCCGAGGAG GAGCGCAACG CCATCGAGCA GCGGGTGGAG CAGCTACAGG AGAAGCTCCA GCAGTCCATC CAGCAGTTGC CCCAGATCCA GCGTGAGCTG CGCCAGCAGG TCCGGGAGCT CAATGAGGAG ATGGTGCTGG TGGCCGCCGG GACCCCCATC CGCAATCTCA AGGACGCCTA CAGCCATATC GAGGGGGTGG TCGCCCACCT GGAGGCAGTG CGCAAGGACA TCATCGAGAA CGTGGACGCC CTGCAGGGGG ACAAGCATGG CCGCCACTCG GCGATGGAGG CGGTGCTGGA GCGCTACCGC ATCAACCTCA TCGTCGATCA GTCGGCGCAG ACCGGCGCCC CGGTGGTGTA CGAGGACCTG CCGCTGCACC AGCACCTGGT GGGGCGGATC GAGCACTACG TGCACCAGGG TGCGCTGATG ACCGACTTCA CCCTGATCCG GGGTGGTGCC CTGCACCGGG CCAACGGTGG TTATCTGATC CTGGATGCCC TGCGGGTGTT GCAACAGCCG ATGGCGTGGG AGAGCCTGAA GCGGGCGCTG AGCGCCCACA CCGTGCGCAT CCAGTCGCTG GAGCGGCTCT ACGGCCTGGC CAGCACCGTC AGCCTGGAGC CGGAGCCGAT TCCGCTGCAG CTCAAGGTGG CGCTGGTGGG CGACCGGTTC CTGTACTACC TGCTGGCGGC CTACGACCCC GACTTTCTCG ATCTCTTCAA GGTGCAGGCC GACTTTGAGG ACGACCTGCC CCGGACCGAC GAGAACCAGC AGGATTACGC CCGCATGCTG GCCACCATGG CGCACCAGGA TAAGCTGCGC CCGCTCACCG CCGAGGCGGT GGCCCTGATC ATCGAACAGG GCGGCCGGCT GGCCGATGAC CAGGAGAAGC TCACCGCCCA GGCACGGATG CTGCGCGACC TGCTGGTGGA GGCCGACCAC TGGGCGGCCC GCGACGAGGC CGGGGCGATC GATGCCGCCC ACGTGGAGCG GACTATCGAG CAGCAGCGCT ACCGGGCCGG GCGGGTGCGG GACCGGACGC TGGAGCTGAT CCAGCGCGGT ACGGTCATGA TCGCCACTGA GGGCGAGGCC ATCGCCCAGG TCAACGGCCT GTCGGTGCTG CAGCTCGGCG ACCAGGCCTT CGGCCGACCG ACCCGTATCA CGGCCACGGC CCGGGCCGGC CGCGGCCAAG TGCTGGATAT CGAACGCGAG GCCAAACTGG GCGGCAACAT CCACTCCAAG GGCGTGATGA TCCTGTCCCG CTACCTGGCA ACGCGCTATG CCCGGGAGGG GGCGCTCTCG CTCTCGGCCA GCCTCGCCTT CGAGCAGTCC TACGGCGGGG TGGAGGGCGA CAGCGCCTCG GTGGCCGAAC TCTGCGCCCT GGTCTCCGCC ATCGGCCGGG CGCCGATCAG GCAGTCGCTG GCGGTGACCG GCTCGGTCAA CCAGCACGGC GAGGTGCAGG CGGTCGGCGG CGTCAATGAG AAGATCGAGG GCTTTTTCGA GGTCTGCCGC GGGGCCGGGA CCTTGGACGG GCAGGGCGTG CTCCTGCCCG AGGCCAATGT GCCCCACCTG ATGCTGCGCC GGGAGGTGCG CGAGACGGTG GCCGCCGGGC AGTTCCATGT CTATCCCATC CGCCATGTGG ACCAGGCCCT GGAGTTGCTG ACCGGGCTGC CGGTGGGCGA GGCGGACGCC GAGGGGGGCT ATCCGGAGGG CAGCTTGAAC CGCCGGGTGG CGGACCGGTT GGAGGCCTTC GGCCGATCGG TGCGCCGGCA GAGTCAGGAC GACAACGGCG AGGGGGGCGG CCCCCGGACG GAGGAGGGTG ACACCTCGCC GCGTGGGGGT GACGATGAGT GA
|
Protein sequence | MTPIRPLSAD QLYRPCRADQ LGFRTTEELE RLELPCGQTR ALEALDFATG IRNDGFNLFV LGPAGAGKRE WVQRFLERKA EAQQRPSDWA YIYNFDAPDQ PRALALPAGA GRRLKRDVDE LTDELRNSIP ATFESDEYQS RIQELQQAAN RRHREAIEQI QHEAEEQGIA LLTTPSGFTF APKKEGEVLS AEEFQKLPEE ERNAIEQRVE QLQEKLQQSI QQLPQIQREL RQQVRELNEE MVLVAAGTPI RNLKDAYSHI EGVVAHLEAV RKDIIENVDA LQGDKHGRHS AMEAVLERYR INLIVDQSAQ TGAPVVYEDL PLHQHLVGRI EHYVHQGALM TDFTLIRGGA LHRANGGYLI LDALRVLQQP MAWESLKRAL SAHTVRIQSL ERLYGLASTV SLEPEPIPLQ LKVALVGDRF LYYLLAAYDP DFLDLFKVQA DFEDDLPRTD ENQQDYARML ATMAHQDKLR PLTAEAVALI IEQGGRLADD QEKLTAQARM LRDLLVEADH WAARDEAGAI DAAHVERTIE QQRYRAGRVR DRTLELIQRG TVMIATEGEA IAQVNGLSVL QLGDQAFGRP TRITATARAG RGQVLDIERE AKLGGNIHSK GVMILSRYLA TRYAREGALS LSASLAFEQS YGGVEGDSAS VAELCALVSA IGRAPIRQSL AVTGSVNQHG EVQAVGGVNE KIEGFFEVCR GAGTLDGQGV LLPEANVPHL MLRREVRETV AAGQFHVYPI RHVDQALELL TGLPVGEADA EGGYPEGSLN RRVADRLEAF GRSVRRQSQD DNGEGGGPRT EEGDTSPRGG DDE
|
| |