Gene Mlg_2545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2545 
Symbol 
ID4270933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2887649 
End bp2889319 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content71% 
IMG OID638127304 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_743375 
Protein GI114321692 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.668362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0765765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATGACCA CCGCGCGGGC GAACGATTGG GCGGCGCTGC AGGTCTTCTG CGGCTACCGC 
CTGGTGGTGG TGCTGGCGCT GCTGTTGCTC TTCATCTGGG CCCGGGGCGA ACCCTTACTG
CTGGCCGTGC GCTGGACGGA CGTCTTCCTG GCCACCCTGC TGGCCTACCT CGCCTGGTCG
GTGGTGGCCT TATGGCTGCA GCAGCGACGG GTTCCGGCCT TCTCCCTGCA GCTTTACGCC
CAGTTGGGCG TGGATGTCCT GGCACTGAGT CTGCTGGTAG CGGCCACCGG TCGGATGGAC
GGCGGGCTTG CCCTGCTGGT GTTGATCGTG GTGGCCGGTG GCAGCCTGAT GCTGGCCAAT
CTGCGCCTGG CGCTGGGGTT GGCGGCCATG GCCACGCTCG CCCTGCTGGC GGTCCAGGGC
TTCGTGGCCC TCTACGCCGA CGGCGCCGCT GAGGGCTACA CCCTGGTGGG GATGTACGGC
ATGGGCCTGT TTCTGCTCGG CGCCGGCGGT AGCCTGCTGG CCATCCGCGT GCGTACGGCG
CAGGCCCTGG CCGAACGCCG CGGGGTGGAC CTGGCCAACA TGCAGGCGCT AAACGAGCAC
ATCGTCCAGC ACATGGAACC GGGGGTGGTG GTGGTGGACG GGGCCGGCAT CATCCGGCTG
CTCAACCATT CCGCCATGGG CTGGCTGGCC AGCGGCCGCG GCGCGGCGCT GGAGCATGTG
GCGCCGACCC TGGATCTCGC AGTGCGGCGC TGGCGGCGGG GCCGGGTGGG TTCCGGCCTG
GTGGTGCCGG TGCAGGCGCG TGGCGCCGAG GTGCGGGTGA ACATCTCCGC GCTGGGCGCG
GACCCGGAGG GGCCGTTGTT GCTGTTGCTG GAGGACCAAG CGGAACTGCG TGCCCGGGTG
CAGCAGGCCA AACTGGCCGC CCTCGGCCGG CTGACGGCCA GTATCGCCCA CGAGATCCGC
AATCCGCTCA GTGCCATTCT GCATGCCGGG CAACTGCTGG CCGAGTCGCC GGATCTGAGC
GAGGATGACC GGCGGTTGCT GGACATCGTT CGCCGCCACG GCCGGCGGCT CAACACCATC
GTCGAGGACG TGCAGCAACT CTCCCGGCGG GGACGGGCGC GGCGGGAGGC GGTGGCGCTG
GACGCCTTTC TTCAGGAGTT CCTGCAGCGC TGGGGCGAGC AGCACGGCCG GGAGGGGGCC
CGCATCCGCT GTCGGGTGAC GCCGGCGGGG CTGTTGGTGC TGTTCGACCC CAACCACCTC
CACCAAGTGT TGACCAACCT GGTGGAGAAC GCCGTTCGCC ACGCCTCGGA CGGCCGCCCG
AGGGTGACGG TCACCCTGAG CGGACGGCAG CCGCAAGCCG GGGAGGCGTG GCTGGAGATC
TGCGACGATG GCCCCGGTGT CGGGCGGGAT ATCGCCAACA GTGTGTTCGA GCCCTTTTTC
ACCAGCCGGC CGTCCGGGTC TGGGCTGGGG CTCTTCATCT GCCGGGAGCT CTGCGAGAGC
AATCGGGCCG ATCTCCGCCT GAGCAACCCG GGCGAGGCGG GGGCCTGCTT CCGGCTGACG
TTGCAGATGG CGCCAGCGGG GGTACCGGCG GGCTGGCAGG AGCCGGAGAC CGAGGTCCGG
CTCAGCCGGT CTGCGGATGG GGACGCGCCA GCGCCGGCAG CTCGCCGCTG A
 
Protein sequence
MMTTARANDW AALQVFCGYR LVVVLALLLL FIWARGEPLL LAVRWTDVFL ATLLAYLAWS 
VVALWLQQRR VPAFSLQLYA QLGVDVLALS LLVAATGRMD GGLALLVLIV VAGGSLMLAN
LRLALGLAAM ATLALLAVQG FVALYADGAA EGYTLVGMYG MGLFLLGAGG SLLAIRVRTA
QALAERRGVD LANMQALNEH IVQHMEPGVV VVDGAGIIRL LNHSAMGWLA SGRGAALEHV
APTLDLAVRR WRRGRVGSGL VVPVQARGAE VRVNISALGA DPEGPLLLLL EDQAELRARV
QQAKLAALGR LTASIAHEIR NPLSAILHAG QLLAESPDLS EDDRRLLDIV RRHGRRLNTI
VEDVQQLSRR GRARREAVAL DAFLQEFLQR WGEQHGREGA RIRCRVTPAG LLVLFDPNHL
HQVLTNLVEN AVRHASDGRP RVTVTLSGRQ PQAGEAWLEI CDDGPGVGRD IANSVFEPFF
TSRPSGSGLG LFICRELCES NRADLRLSNP GEAGACFRLT LQMAPAGVPA GWQEPETEVR
LSRSADGDAP APAARR