Gene Mlg_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1802 
Symbol 
ID4268721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2060677 
End bp2062047 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content67% 
IMG OID638126558 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_742636 
Protein GI114320953 
COG category[T] Signal transduction mechanisms 
COG ID[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACATA AGAGGTTGTC ACCCGAGATC ATGCCAAACA ATACCGGCCA TCCCGCGCCC 
CAGGACCCCC GTGAACTGCA ACAGGCGCTG GTGGAGGTCA GCCTGCGGCT GGAGCGCCGG
GTACGGGAAC TGGAGGCGCT GGTCAGCGTC ACCGAACGAA TCAACCGCGG GCTGTTGCTG
GACGACGTGC TGGACGAGAT CTACGACTCC TTCCGGCCGA TCATCCCCTA TGACCGGATC
GGTGTCGCCC TGCTCGAGGA GAACGACCAG GGCGAGCAAC TGGTCCGCTC GCGCTGGGCA
CGCTCCGACC ACCGCACCGC CCACATCGGC CCGGGCTACG CCGCGCCGCT GGCCGGCAGT
AGCCTGGAGC GCATCCTCAA GACCGGCGAG CCGCGCATCA TCAACGACCT GCAGGAACAC
CTGCTGCACA ACCCGCGCTC CAAGTCCACC CAGCAGATCC TCAAAGACGG CGTGCGCTCC
AGCCTCACCT GCCCGCTGGT GGCCATGGGC CGTCCGGTGG GCTTCATCTT CTTCTCCAGC
AACGCGCCCA ACACCTACCG CAACGCCCAC ATCGCCACCT TCCAGCAGAT CGCCGGGCAG
CTCGCCACCA TCCTGGAAAA GAGCCGCCTT TACGAGCGGC TGATGGAGCT CAACGACCTC
AAAAACCGCT TCCTGGGCGT GGCGGCCCAC GACCTGCGCA ACCCGTTGGG GGTGCTCAAC
GGCTACATCG ATCTGCTGCG CCAGGAGGCC CTCGGCCCAT TGAACGAGGC CCAACAGGAG
GTGATGGGGG TGATGGCGGA CGTCGCCGAG CGCATGAGCG CGCTGGTGGA GGACCTGCTC
GATGTCAGCG CCATCGAATC GGGGCAGTTG GAGCTGGAGC GCGAACCGCT GGACCTCAAC
CGCTTCCTGC AGGGGCAGGC CCACGCCCAA GGGCTGATCG CCCAGGGCAA GTCGATCCGC
ATCGTGCTGG ACATCCCGGA ACCGCTGCCC ACGGTGGCGG TGGACAGCCG CCGCCTGGGC
CAGGTGCTGG ACAACTTGAT CGCCAACGCG GTCAAGTTCT CGCCCAGGGA CAGCACCATC
ACCCTCGGCG GCCGCGCCGA CGATCAGTCG GTTCGGATCA GCGTCAGTGA CCAGGGCCCG
GGCATCCCGG CCGAGGAACG GGCGCAGCTC TTCCAGCCCT TCCGGCGCGG CAGTAACGCC
CCCACCGCCG ATGAGAAGAG CACCGGGCTG GGTCTGTCCA TTGTCCAGAA ACTGGTTCAG
GCCCACGGCG GACAGGTGGC GGTGGACGCG GCCCCCGGCG GCGGCGCCCG CTTCACCGTG
ACCCTGCCCC GGCAGCCCGA CCCAGAACAG AAAGGCGATG CCCCGGCATG A
 
Protein sequence
MIHKRLSPEI MPNNTGHPAP QDPRELQQAL VEVSLRLERR VRELEALVSV TERINRGLLL 
DDVLDEIYDS FRPIIPYDRI GVALLEENDQ GEQLVRSRWA RSDHRTAHIG PGYAAPLAGS
SLERILKTGE PRIINDLQEH LLHNPRSKST QQILKDGVRS SLTCPLVAMG RPVGFIFFSS
NAPNTYRNAH IATFQQIAGQ LATILEKSRL YERLMELNDL KNRFLGVAAH DLRNPLGVLN
GYIDLLRQEA LGPLNEAQQE VMGVMADVAE RMSALVEDLL DVSAIESGQL ELEREPLDLN
RFLQGQAHAQ GLIAQGKSIR IVLDIPEPLP TVAVDSRRLG QVLDNLIANA VKFSPRDSTI
TLGGRADDQS VRISVSDQGP GIPAEERAQL FQPFRRGSNA PTADEKSTGL GLSIVQKLVQ
AHGGQVAVDA APGGGARFTV TLPRQPDPEQ KGDAPA