Gene Mlg_1937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1937 
Symbol 
ID4270138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2202254 
End bp2203879 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content66% 
IMG OID638126691 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_742769 
Protein GI114321086 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.960341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTG TCAATCGCTT CAACAACTGG CCAATCTGGG TGCGGCTGCT GCTGGCCATC 
TGGCTCATGC TGGTGGTCGC CTGGAGTGCG CTCATCGCCT GGTCGGTCTA CGAACAGCGC
AACACCGCGC TCACCCAGGC GGTCACCTTC AGTGAGACCA TGAATGAAAT GACCATGGCC
GGTCTCACCA CGCTGATGAT CCTCTGGAAG ATGGATGACC GGGACGAGTT CCTGGACCAG
ATCCTCGCCC TCCACAACAT GGAGGACCTG CGCGTGCTGC GCTCCGAGGC CGTCAGCCGG
CAGTTCGGCG AGGGGTTGGC GGTGAGCCAG CCGGCCAATG CCGTCGAGGA GCAGGTGCTG
GCCAGCGGTG AGGCGCATAT CGAGGTGGAG CCGGGGGGTG ATCACCTTTA CGCGGTCATC
CCGAACGTCA ACGCGCGGGA CTACCTGGGT AAAAACTGCA TGGCCTGTCA CGCCATGGCC
GAGGAGGACG AGGTGTTGGG GGCGGTCAGC ATGCGCATCG GCCTGCAGGA GGTGAACCAG
GCCGTGTTCC GCTTTGGCAC CCTGGTCTTC GGCCTGGCGG TGTTGCTCAG CATCCCCTTG
CTGGGCGTGG TCTATCTGTT CATCAAACGC TTCGTCTCCG CGCCGCTCAG CGATATGACC
GAGCGGCTGG AGGACATCGC CAGCGGTGAC GGGGATCTGA CCCGCCGGCT GCCCGACCGG
GGCACGGATG AGATCGGCAA GGCCTCGCTG GCCTTCAACC ACACCATGGA CAAGTTCCAC
GACCTGGTGA AACGGGTGGT CAACACCGCC AGCCGGCTGA CCGATGCCGC AGACCGGGTC
TCGTCCGTGA CCGTGCAGAC CAACCAGGGG GTGGAGTCCC AGCGCGAGCA GATCGAACAG
GTGGCGACGG CCATGAATGA GATGACCGCC ACCGCCCAGG AGGTGGCCCG CAACGCGCAG
GACGCCGCCC AGGCCACCCG CGCCGGTGCC GAGGCCTCGG AGCGGGGCCA GGATGTGGTG
GAGCGCACCA TCGCCAGCAT CGACCGGTTG GCCGAGGAGG TGCAGAAGGC CTCTGAGGTG
ATCCGCAAGC TGGCAGTGGA CAGCGAGCGC ATCGGTGAGG TCTCCGACCT GATTCGGGAG
ATCGCCGAAC AGACCAACCT GCTGGCGCTC AACGCCGCCA TCGAGGCGGC CCGCGCCGGC
GACGCCGGTC GCGGCTTTGC CGTGGTCGCC GACGAGGTCC GCTCGCTGGC CAGCCGGACC
CACGAGTCCA CCCAGAGCAT CCAGGAGATG ATCAGCGGCC TGCAGCAGGA GACCCAGACC
GCGGTCCAGG TCATGGAGGC CGGTTACGGC CAGGCGCAGC AGACCGTGGG GCAGGCGGGC
GATGCCGGCG AGGCTTTGAA GGAGATCGCC TCCTCGGTAC AGACCATCAG CAGCTCCAAC
GAGCAGATCG CCAGTGCGGC TGAGGAGCAG AGCGCGGTGG CCGAGGAGAT CAACCGCAAT
ATCACCAGCA TCACCGATGT GGCCGAGCAG ACGGCCAACG GGTCCCGCGA GACCGCCACT
GCTGGCGATG AGCTGGCGAA ACTGGCCCGG GAGCTGAAGG GGCTGGTGGG GCAGTTCAAG
GTCTGA
 
Protein sequence
MNLVNRFNNW PIWVRLLLAI WLMLVVAWSA LIAWSVYEQR NTALTQAVTF SETMNEMTMA 
GLTTLMILWK MDDRDEFLDQ ILALHNMEDL RVLRSEAVSR QFGEGLAVSQ PANAVEEQVL
ASGEAHIEVE PGGDHLYAVI PNVNARDYLG KNCMACHAMA EEDEVLGAVS MRIGLQEVNQ
AVFRFGTLVF GLAVLLSIPL LGVVYLFIKR FVSAPLSDMT ERLEDIASGD GDLTRRLPDR
GTDEIGKASL AFNHTMDKFH DLVKRVVNTA SRLTDAADRV SSVTVQTNQG VESQREQIEQ
VATAMNEMTA TAQEVARNAQ DAAQATRAGA EASERGQDVV ERTIASIDRL AEEVQKASEV
IRKLAVDSER IGEVSDLIRE IAEQTNLLAL NAAIEAARAG DAGRGFAVVA DEVRSLASRT
HESTQSIQEM ISGLQQETQT AVQVMEAGYG QAQQTVGQAG DAGEALKEIA SSVQTISSSN
EQIASAAEEQ SAVAEEINRN ITSITDVAEQ TANGSRETAT AGDELAKLAR ELKGLVGQFK
V