Gene Mlg_1517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1517 
Symbol 
ID4269073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1728708 
End bp1731149 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content69% 
IMG OID638126275 
Productpeptidase S16, lon domain-containing protein 
Protein accessionYP_742356 
Protein GI114320673 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.30513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.334171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCA TTCGGCCATT GTCCGCCGAC CAACTCTACC GCCCCTGCCG CGCCGACCAG 
CTCGGCTTTC GCACCACCGA GGAGCTGGAG CGCCTGGAGC TGCCCTGTGG GCAGACGCGG
GCGCTGGAGG CGTTGGACTT TGCCACCGGC ATCCGCAACG ACGGCTTCAA CCTCTTCGTG
CTCGGGCCGG CTGGCGCGGG CAAGCGGGAG TGGGTCCAGC GCTTTCTGGA GCGCAAGGCC
GAGGCGCAGC AGCGGCCGTC GGACTGGGCC TATATCTACA ATTTCGACGC CCCCGATCAG
CCCCGGGCCC TGGCGCTGCC CGCCGGGGCC GGCCGGCGCC TGAAGCGGGA CGTGGACGAA
CTCACCGATG AGCTGCGCAA CTCCATCCCC GCCACCTTCG AGAGCGACGA GTACCAGTCC
CGGATCCAGG AGCTGCAACA GGCGGCGAAT CGTCGCCACC GAGAAGCAAT CGAACAGATC
CAACACGAGG CCGAGGAGCA GGGCATTGCC CTGCTGACCA CCCCCTCGGG GTTCACCTTC
GCCCCCAAGA AGGAGGGCGA GGTGCTGAGC GCCGAGGAGT TCCAGAAACT GCCCGAGGAG
GAGCGCAACG CCATCGAGCA GCGGGTGGAG CAGCTACAGG AGAAGCTCCA GCAGTCCATC
CAGCAGTTGC CCCAGATCCA GCGTGAGCTG CGCCAGCAGG TCCGGGAGCT CAATGAGGAG
ATGGTGCTGG TGGCCGCCGG GACCCCCATC CGCAATCTCA AGGACGCCTA CAGCCATATC
GAGGGGGTGG TCGCCCACCT GGAGGCAGTG CGCAAGGACA TCATCGAGAA CGTGGACGCC
CTGCAGGGGG ACAAGCATGG CCGCCACTCG GCGATGGAGG CGGTGCTGGA GCGCTACCGC
ATCAACCTCA TCGTCGATCA GTCGGCGCAG ACCGGCGCCC CGGTGGTGTA CGAGGACCTG
CCGCTGCACC AGCACCTGGT GGGGCGGATC GAGCACTACG TGCACCAGGG TGCGCTGATG
ACCGACTTCA CCCTGATCCG GGGTGGTGCC CTGCACCGGG CCAACGGTGG TTATCTGATC
CTGGATGCCC TGCGGGTGTT GCAACAGCCG ATGGCGTGGG AGAGCCTGAA GCGGGCGCTG
AGCGCCCACA CCGTGCGCAT CCAGTCGCTG GAGCGGCTCT ACGGCCTGGC CAGCACCGTC
AGCCTGGAGC CGGAGCCGAT TCCGCTGCAG CTCAAGGTGG CGCTGGTGGG CGACCGGTTC
CTGTACTACC TGCTGGCGGC CTACGACCCC GACTTTCTCG ATCTCTTCAA GGTGCAGGCC
GACTTTGAGG ACGACCTGCC CCGGACCGAC GAGAACCAGC AGGATTACGC CCGCATGCTG
GCCACCATGG CGCACCAGGA TAAGCTGCGC CCGCTCACCG CCGAGGCGGT GGCCCTGATC
ATCGAACAGG GCGGCCGGCT GGCCGATGAC CAGGAGAAGC TCACCGCCCA GGCACGGATG
CTGCGCGACC TGCTGGTGGA GGCCGACCAC TGGGCGGCCC GCGACGAGGC CGGGGCGATC
GATGCCGCCC ACGTGGAGCG GACTATCGAG CAGCAGCGCT ACCGGGCCGG GCGGGTGCGG
GACCGGACGC TGGAGCTGAT CCAGCGCGGT ACGGTCATGA TCGCCACTGA GGGCGAGGCC
ATCGCCCAGG TCAACGGCCT GTCGGTGCTG CAGCTCGGCG ACCAGGCCTT CGGCCGACCG
ACCCGTATCA CGGCCACGGC CCGGGCCGGC CGCGGCCAAG TGCTGGATAT CGAACGCGAG
GCCAAACTGG GCGGCAACAT CCACTCCAAG GGCGTGATGA TCCTGTCCCG CTACCTGGCA
ACGCGCTATG CCCGGGAGGG GGCGCTCTCG CTCTCGGCCA GCCTCGCCTT CGAGCAGTCC
TACGGCGGGG TGGAGGGCGA CAGCGCCTCG GTGGCCGAAC TCTGCGCCCT GGTCTCCGCC
ATCGGCCGGG CGCCGATCAG GCAGTCGCTG GCGGTGACCG GCTCGGTCAA CCAGCACGGC
GAGGTGCAGG CGGTCGGCGG CGTCAATGAG AAGATCGAGG GCTTTTTCGA GGTCTGCCGC
GGGGCCGGGA CCTTGGACGG GCAGGGCGTG CTCCTGCCCG AGGCCAATGT GCCCCACCTG
ATGCTGCGCC GGGAGGTGCG CGAGACGGTG GCCGCCGGGC AGTTCCATGT CTATCCCATC
CGCCATGTGG ACCAGGCCCT GGAGTTGCTG ACCGGGCTGC CGGTGGGCGA GGCGGACGCC
GAGGGGGGCT ATCCGGAGGG CAGCTTGAAC CGCCGGGTGG CGGACCGGTT GGAGGCCTTC
GGCCGATCGG TGCGCCGGCA GAGTCAGGAC GACAACGGCG AGGGGGGCGG CCCCCGGACG
GAGGAGGGTG ACACCTCGCC GCGTGGGGGT GACGATGAGT GA
 
Protein sequence
MTPIRPLSAD QLYRPCRADQ LGFRTTEELE RLELPCGQTR ALEALDFATG IRNDGFNLFV 
LGPAGAGKRE WVQRFLERKA EAQQRPSDWA YIYNFDAPDQ PRALALPAGA GRRLKRDVDE
LTDELRNSIP ATFESDEYQS RIQELQQAAN RRHREAIEQI QHEAEEQGIA LLTTPSGFTF
APKKEGEVLS AEEFQKLPEE ERNAIEQRVE QLQEKLQQSI QQLPQIQREL RQQVRELNEE
MVLVAAGTPI RNLKDAYSHI EGVVAHLEAV RKDIIENVDA LQGDKHGRHS AMEAVLERYR
INLIVDQSAQ TGAPVVYEDL PLHQHLVGRI EHYVHQGALM TDFTLIRGGA LHRANGGYLI
LDALRVLQQP MAWESLKRAL SAHTVRIQSL ERLYGLASTV SLEPEPIPLQ LKVALVGDRF
LYYLLAAYDP DFLDLFKVQA DFEDDLPRTD ENQQDYARML ATMAHQDKLR PLTAEAVALI
IEQGGRLADD QEKLTAQARM LRDLLVEADH WAARDEAGAI DAAHVERTIE QQRYRAGRVR
DRTLELIQRG TVMIATEGEA IAQVNGLSVL QLGDQAFGRP TRITATARAG RGQVLDIERE
AKLGGNIHSK GVMILSRYLA TRYAREGALS LSASLAFEQS YGGVEGDSAS VAELCALVSA
IGRAPIRQSL AVTGSVNQHG EVQAVGGVNE KIEGFFEVCR GAGTLDGQGV LLPEANVPHL
MLRREVRETV AAGQFHVYPI RHVDQALELL TGLPVGEADA EGGYPEGSLN RRVADRLEAF
GRSVRRQSQD DNGEGGGPRT EEGDTSPRGG DDE