Gene Mlg_0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0820 
Symbol 
ID4270651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp924246 
End bp926264 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content72% 
IMG OID638125571 
Productpeptidase M48, Ste24p 
Protein accessionYP_741664 
Protein GI114319981 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTTT TCGAGCATCA GGACCGCGCC CGGCGTACCA CCCTCTGGTT GATCCTCTTC 
TTCGTGCTCG GGGTGATCGC CATCGCCGTG GTGGTCAACG CCCTGGCCCT CTTCTTCCTC
GGCGAGCCCC CGCCGGCCGG CGCACCGCCG GAGCACTGGC TAAGCCAGAA CCTGGAGCTG
CTGATCACCA CCACGGTGCT GGTGGTGGCC GGCATCGGGC TGGCCAGCGC CTTCCGGGTG
GCGAGCCTCT CCGGCGGCGG CAGCAAGGTG GCGGAGATGC TGGGCGGCAC CCGCGTCACC
CCCGACACCC GGGACCCGAA ACGGCGCCAG CTGCTCAACG TGGTGGAGGA GGTGGCCCTG
GCCTCCGGCA CGCCGGTGCC GGATGTCTAC GTACTGGAGG AGGAGGCCGC CATCAACGCC
TTTGCCGCCG GCTACAGCCA GAGTGATGCG GCGGTGGCCG TCACCCGCGG CACCCTGGAG
AAACTCAACC GCGAGGAACT GCAGGGCGTG GTGGCCCACG AGTTCGCCCA CATCGTCAAC
GGTGATATGC GCCTGAACAT CCGGCTGATG GGGGTGGTGT TCGGGCTGTT GGTGCTCACC
GTGGTGGGCC GGTTCATGAC CCGCGCCATC TTTGTCGGCG GCGGCAGCCG GGAGGGCAAA
CAGGCCGCCA TGGGTATCGC CGCCCTGGGA CTGGCGCTGA TCCTGGTGGG GGCGCTGGGG
GTGTTCTTCG GCCGGCTGAT CAAGGCCGCG GTCTCGCGCC AGCGCGAGTT CCTGGCCGAC
GCCTCGGCGG TGCAGTACAC CCGTAACCCG GACAGCATCG GCGGCGCACT GAAGAAGATC
GCCGTGCACA GCCGCGGCTC CGGGCTGGAG TCGCCGGAGA CCGAGGAGGT CAGCCACATG
CTCTTCGCCT CGGGGTTTGC CTCCATGAGC GGCCTGCTGG CCACGCACCC CCCGCTGGAG
GATCGCATCC GCGCCATCGA GCCACAATTC GACCCGGAGC GCGACCTGCC AGCCCTCGCC
GAGCGGGAGC AGCGCCGCCG GGCACGGGAA GAGGCCGAGG CGGAGCGCCG GCGCGAGGCC
GAACGCGCCG CCGCCGAGGG CCAGGGGCAA GGCGTGCGCA TTCCCGGCGC CATACCCCTG
CCCGGTACCG ACGCCCTGCC CCAGGGCGCG ATCCTGGGCG CCATCCTGGC CGACGTGGAC
CAGCCGGACA CCCGCCGCCA CCAGGCCGCC GCCCAATTGC TCCACGCCCT GCCCGAACCC
CTGCGGGACG CCGTCCACGG TGAGGACGCC GGGCTGGCGG TGCTGTACAC CGTCATCAGC
GAAGACCCGG AGGTGCGCCG CCAGCAGCTG GCGCGGATCC GGGAGGACTG GGGCGAGGAC
GCCGAGGCCC GGGTCCGCGA GTGGCTGGAG GACGACCAGT CGTTGGCGCC CGGGCAGCGG
CTGCCGCTGG TGGAACTGGC CCTCCCCGCC CTGCGCCACC AGCCGCGGGA GCGCCTGGGT
GAGCTGCGCG AGACCCTGGG CGCGCTGATC CGCGCCGACG GCGGGGTTTC GGTCTTCGAG
TTCGCCTTGG CACGCATGTT TGACGCCCAC CTGCGCGATA TCCTCAACCC CGGCGCGGCC
GACCGGGGGC ACGCCCGGGT CAACGCCGAG AAGGCGGCCG CCATCCAGCA GTTGCTCTCG
GTACTCGCCT GGGCGGGGGC CGAGGGTGAC GAGGACGCCG CACGGCAGGC CTACGCCGCG
GGGATGGCCC TGCTCTACCG GGAAAACCGG CCGCCGGCCT ACGGGGTGCC GGCGGACTGG
CCCAACACCC TTACCGAGGG CTTGGAGCGG CTGGACCGCC TGGGCGCTGG GCCCAAACGG
CGGCTGCTGG AGGCCATGGT GGCCACCGTG GGGCACAACG GCCAGGTCAA CGTGGCCGAG
GCGGAGTTGC TGCGGGCACT GGCGGCGGCC TTGCACGTAC CCATACCGCT GATTCTGCCC
ACCACCGAAG AAAGCGGCGG CGACGGCGTG TCGCCGTAA
 
Protein sequence
MDFFEHQDRA RRTTLWLILF FVLGVIAIAV VVNALALFFL GEPPPAGAPP EHWLSQNLEL 
LITTTVLVVA GIGLASAFRV ASLSGGGSKV AEMLGGTRVT PDTRDPKRRQ LLNVVEEVAL
ASGTPVPDVY VLEEEAAINA FAAGYSQSDA AVAVTRGTLE KLNREELQGV VAHEFAHIVN
GDMRLNIRLM GVVFGLLVLT VVGRFMTRAI FVGGGSREGK QAAMGIAALG LALILVGALG
VFFGRLIKAA VSRQREFLAD ASAVQYTRNP DSIGGALKKI AVHSRGSGLE SPETEEVSHM
LFASGFASMS GLLATHPPLE DRIRAIEPQF DPERDLPALA EREQRRRARE EAEAERRREA
ERAAAEGQGQ GVRIPGAIPL PGTDALPQGA ILGAILADVD QPDTRRHQAA AQLLHALPEP
LRDAVHGEDA GLAVLYTVIS EDPEVRRQQL ARIREDWGED AEARVREWLE DDQSLAPGQR
LPLVELALPA LRHQPRERLG ELRETLGALI RADGGVSVFE FALARMFDAH LRDILNPGAA
DRGHARVNAE KAAAIQQLLS VLAWAGAEGD EDAARQAYAA GMALLYRENR PPAYGVPADW
PNTLTEGLER LDRLGAGPKR RLLEAMVATV GHNGQVNVAE AELLRALAAA LHVPIPLILP
TTEESGGDGV SP