Gene Mlg_1620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1620 
Symbol 
ID4269352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1849446 
End bp1851125 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content69% 
IMG OID638126377 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_742456 
Protein GI114320773 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.96029 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCGTC CACTGAACCA CACCATCACC GCGCGCCCCG GCCGGCCGTC CACCGCCGGC 
TGGGCCCGCC TGATGCTGCT CAGCGGCTCA TTGGTCGGCC TCTACCTCCT GGTTGCCGTC
TGGGGGTTTG CCTTCACCCC CGACCAATAC GGAATCCCGC TGGTCTGGCC GGCCACCGGC
GTGGGGCTGG CCTTTGTCTT CCTCTACGGC TATCGCCTGG TGCCCGCCGT CGGGCTGGCG
GCGGCGCTGG TCGGTGTCTG GTTCACCGGC GATCTCGTCG CCCCGCTGGA ACAGGCCTTC
ACGGTGGTAG TCACGATGCT GGCCACGGCC TTGGGCACGG CAATCCTGCG CTACTGCCGC
TTCGATCCCA GGATGGAACG GCTGCGCGAC CTGGGCCTGC TGCTAGCGGC CGGCGGCGGC
GTCAGCTCCG GGCTCGCCGC CACCGTCGGG GCCTACAGCC TGGCGGCCAG TCCCGGCCCC
CTGAGCCTCG CCCAAACCTG GTGGGTCTGC TGGAGCGCGG ACCTGATGGG GCTGGTACTG
ATCTCGCCGT TCCTGTTCAC CCTGCTGGGC GGACGCCTGG TCATGCCCTC AGGCGCGGAC
CTGCGCATTG GAATGATGCT GGTCACCGCC ACCCTGGCCA CCGGCGCGGT GGCTTACCTG
TTGGCGCTCG ACCTGACCCT GGCGTTGCCC CTCTCCTACG CCGTCTTCCC GCTCGTCATG
TATGCAGCAT TGCGCTGCCC GGCGCCCATC ACCAGTGGCC TCATCCTGGT GTTCGGGGGG
CTGGCGCTGA CCGCCACCGG GCTGGGTCAC GGCCCCTACG CCGCGCTGGG GCTGGAGCGG
AGCCTGTTGG CACTCAATGC CCAGTTGGGG TTGCTGGTCC TCACCGGCCT TTCGCTGACC
GCGATCAGCG CGGAACGGGA GGCCGCTGAA AGTCGGGCGC GCCAACACCT GGAGGATCTG
GCACGCGCCG GGCGCATCAA CCTGATGGGT CAGCTCTCGA CCACTCTCGC CCACGAACTC
AACCAACCGC TGTGCGCGCT GAGCACCTAC GCCCAGGCCA GCCGCCGGCT GTTGGCCCGG
GGTGATACCG AAGGCCTCGG CACCGCCCTG GAGCGACTGG AACAGAACGC CCACCGAGCC
GCCCATACGG TCCGGCAAAT CCGCGACTTC GCCGCGCGGC AGGACCTGGC GCATCAGGTG
GTGGCCCCCG CCCACCTGAT TGCCGGGGTG GAACGCCTGA TGGCGCCCGA GTTCACGCGC
CGCGGGATCC ATCTGAAGGT CAACGTCCAG CCCGGCCTGC CCCCGATACG CATCGCCCCC
ATGCAGATCG AGCAGGTGCT GGTCAACCTG CTGCGCAACG CGACGGAAGC CCTCGCCGGC
CGCGCCGACG GGCGGGTGCG CCTGGCGGTC TACCGCCGCC GGCGGGAGTT GGTGCTGGAG
GTCATCGACA ACGGCCCAGG CATCCCGCCG GAGCGGCTGT CGCTGCTGTT CGAGCCCTTT
ACCAGCTGGA AGCGGGGCGG CATGGGACTG GGCCTTGCCC TTTCCCGTTC CTTGGTCGAA
TCTCATGGGG GCACATTCAC CGCCCGCAAC GGCAACGCCG GCGGGGCGGA ATTCCGTTTC
ACTCTGCCAT TGGAGGACGA CCATGGCGAC ACCGCGACCT ACGGTACATC TGGTGGATGA
 
Protein sequence
MARPLNHTIT ARPGRPSTAG WARLMLLSGS LVGLYLLVAV WGFAFTPDQY GIPLVWPATG 
VGLAFVFLYG YRLVPAVGLA AALVGVWFTG DLVAPLEQAF TVVVTMLATA LGTAILRYCR
FDPRMERLRD LGLLLAAGGG VSSGLAATVG AYSLAASPGP LSLAQTWWVC WSADLMGLVL
ISPFLFTLLG GRLVMPSGAD LRIGMMLVTA TLATGAVAYL LALDLTLALP LSYAVFPLVM
YAALRCPAPI TSGLILVFGG LALTATGLGH GPYAALGLER SLLALNAQLG LLVLTGLSLT
AISAEREAAE SRARQHLEDL ARAGRINLMG QLSTTLAHEL NQPLCALSTY AQASRRLLAR
GDTEGLGTAL ERLEQNAHRA AHTVRQIRDF AARQDLAHQV VAPAHLIAGV ERLMAPEFTR
RGIHLKVNVQ PGLPPIRIAP MQIEQVLVNL LRNATEALAG RADGRVRLAV YRRRRELVLE
VIDNGPGIPP ERLSLLFEPF TSWKRGGMGL GLALSRSLVE SHGGTFTARN GNAGGAEFRF
TLPLEDDHGD TATYGTSGG