Gene Mlg_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2009 
Symbol 
ID4269609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2280445 
End bp2281857 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content60% 
IMG OID638126765 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_742841 
Protein GI114321158 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAAC GTTCAAGACA TCGACGTGGG GCGGAGACCC TGCTCAATCA GGCCATGTCG 
CCCGGGGCAG GGCAGACCGA TTCCGAGGGC AGCTTCGACG AGCAAACCTG GCTGGAAGTC
ATCCATCGGA TGGATGAGGT CTACAGCGAA CTGCTGGAGA ACGAGACGGA GCTCGAGCGC
AAGAATGCAG AACTGGAGCG CTCCCAGCAG TTTCTCTTTA GTGTGCTCTC CGCCATGTCG
GATGTCCTGG TGGTGTGCGA TCACCAGGGG CGGGTCCAGC GGGTGAACCC GGCTATGGAA
CAGCTGGTAG GAGAGGGTAG CGATGTACTG GTGGGGCGGC CGCTGCGGGA TCTCCTTGCC
GATCAGGCAT CCCGTGATCG CCTGTTGACC AGGGGCGTTT TTGAGCATGC CGAGGCTGTG
GCGGTTAATG ATTGCGAGGT CAACTTCGTT GATCACCGGG GCGGCCATAA CCCGGTGGCG
GTGAACTGTG CGCCTCTGCG TGACCGGATC GGACGGATTC AGGGATTGGT TTTGATCGGG
CGGCCGGTCG GCGAACTGCG ACAGGCTTAC CATGACTTAA GTGAGGCCCA TGAGAGCTTG
AAGCGGACCC AACAGCAACT CATTCAAAGC GAAAAGCTGG CATCGATCGG TCAGCTGGTT
GCCGGTGTGG CCCATGAGCT GAACAATCCC ATTAGCTTTG TCGTGGGTAA TGCCTTCGCC
CTGAAGCGTT ATCTCGGGCG TCTTGAGGAG TATCTGGGCG CCATCCACCG GGGAGTGAGC
GCGGAAGAGC AGCAACGCCT GAGGAGTAAA CTCCGCATCG ACTACGTCCT GGAGGATATT
GCGCCGCTAC TCGAGGGGAC AGTCGAAGGG GCTGAGCGCA CGCACGATAT CGTGGATGCC
TTGAAACGTT TTGCAGCAGT AGACCGGGAT GCGGATCAGG TGTTTGATCT GCGGGAGGTG
GTGGAGCGAT CAGTGCATTG GGTGGAGAAG GCGGTATCCC ACCCCGTTTC CGTTGTCCAT
GCCTTGCAGC AGCCGTACTG GGTCCGTGGC TCACAGGGGC AGATGCAGCA GGTGGTCGTT
AACCTCGTGA GCAACGCGGT TGATGCCTTG CAGGGGTGCC CGGATCCAGT GTTGCGAATC
TCCGGTCATG TGGTGGATGG AATGGTAGAG CTCTGGTTCC ACGATAACGG GCCAGGGATT
GATGAGGAGG CATTGGGGCG GATATTTGAT CCCTTTTTTT CCACCAAACC CGTGGGGAAG
GGAACCGGAC TGGGGCTCTC GGTCAGCTTT GGGATAGTGG AGCGTCATGG GGGCCAGCTA
AGCGGCGACA ACCATGTCCA AGGCGGGGCG TTGTTCCGCC TGCGGCTGCC GTTGGAGACA
GATGAGAGCA GCGTGGGGTG TCCGGATGAG TGA
 
Protein sequence
MGERSRHRRG AETLLNQAMS PGAGQTDSEG SFDEQTWLEV IHRMDEVYSE LLENETELER 
KNAELERSQQ FLFSVLSAMS DVLVVCDHQG RVQRVNPAME QLVGEGSDVL VGRPLRDLLA
DQASRDRLLT RGVFEHAEAV AVNDCEVNFV DHRGGHNPVA VNCAPLRDRI GRIQGLVLIG
RPVGELRQAY HDLSEAHESL KRTQQQLIQS EKLASIGQLV AGVAHELNNP ISFVVGNAFA
LKRYLGRLEE YLGAIHRGVS AEEQQRLRSK LRIDYVLEDI APLLEGTVEG AERTHDIVDA
LKRFAAVDRD ADQVFDLREV VERSVHWVEK AVSHPVSVVH ALQQPYWVRG SQGQMQQVVV
NLVSNAVDAL QGCPDPVLRI SGHVVDGMVE LWFHDNGPGI DEEALGRIFD PFFSTKPVGK
GTGLGLSVSF GIVERHGGQL SGDNHVQGGA LFRLRLPLET DESSVGCPDE