Gene Mlg_1670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1670 
Symbol 
ID4268902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1911115 
End bp1912455 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content69% 
IMG OID638126428 
Productsignal transduction histidine kinase, nitrate/nitrite-specific, NarQ 
Protein accessionYP_742506 
Protein GI114320823 
COG category[T] Signal transduction mechanisms 
COG ID[COG3850] Signal transduction histidine kinase, nitrate/nitrite-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAACG TGAATGCGCC CGCATGGTTC CATTCCGCGG CTCGGTCGAT GGTGCCTATG 
GGAAAACGCC TGCCCGTCTC GCTCGTGGAG GTGCCGGCGG AGAGCAACGC GGTTCCGCCG
CATGACCGGG CGGGCCCTGA GAACGACGAG ATCGTGGCCC TGCGCGAGCG TGCGCGGGCC
ATGGAAGTGC TGTATACGCT GAGCACCTCC AGCGAACGCT GTGATGATCT GGAAACCCTG
CTCAGTGGAG CGCTGGAGCA ATTGATGCGC GCCACCGGGG CCACTGCCGG CGTGGTCCGC
CGTATGGACG CAGAGGGCAC GCTGCGCCTG GTGGCCGCCC GTGGCGTCGG CGACGACTAC
CCCGAAAACG ACTGTGGGGT CCGGCCGGAG GACTGCGCCT GTGGTGAGGC CGCCCTCCGC
GGATGTCTGA TCTCCAACCC CGGTATGAAG GGTTGCCGCC ATAAGCGCTA CCAGCGCCCC
ATCGGGCACG AGGACCTGCA CCTGCTGGCG GTGCCGTTGA CCGCTGACGA CCGCCGCCTG
GGCGTGTTCA GCCTGTTCCT CGAGCCGGTC ATGCTGCAGC AGTGGCAGCA GCTCAATCTG
CACCGCATGC TGCAGATGGC CGGCGAGCAC CTCGGGCTGG CGATGGAGCG CATCCGGCGC
GAGAGCGAGG CCCGCCTGCA ATCGCTGGAG CAGGAGCGCA GCCAGATCAC CCACGAACTC
CACGATTGCC TCGCCCAGCG GTTGGCGGCG TTGGGCCTGG AGGTGCGCAA CCTGGAGGCC
AGCCACGGCG GTGGACGGGC GCCGGGCGGG CTGCGTGCCG GCCTGCGCCA TGTGCGGCGC
GGCCTGGACC AGGCCTATGG CGAACTGCGT CAACTCATGG GGCAATTCCG TATCGCCCTG
GAGGGGGGTG GCCTGGAGCC CGCGCTCAAA CGGCTGGTGC ACCGTTTCCA GCGCGATAGC
GGCATCCGGG TGCTGCTCAG CCATGACTGG CCGCGGGGAC GGCTCAGTCC CGACCAAGAG
TTCCAGGTCC TGCGTATCGT GCAGGAGGCG CTGAACAATG TCAGGAACCA CAGCGGCGCC
CGCCATGTTC AGGTGGCCTT GCACCGGATT GGCTGCGACA TGGAGCTGGT GGTGGAGGAC
GACGGGCGCG GTTTCGCCGA TTCGCCGCCG GATCACCGTG ACGGTGACGA CGGCCACCAC
TTGGGCCTGG GCGTGATGCG CGACCGCGCC GACGCCATCG GCGGCCATCT GGAGATCCAA
AGCGAACTGG GGGAGGGGAC CCGGATCGCC GTCTATCTGC CTTCCTGCGG CCGTTGCCCC
GGTCGGGAGG AACGGGGCTG A
 
Protein sequence
MGNVNAPAWF HSAARSMVPM GKRLPVSLVE VPAESNAVPP HDRAGPENDE IVALRERARA 
MEVLYTLSTS SERCDDLETL LSGALEQLMR ATGATAGVVR RMDAEGTLRL VAARGVGDDY
PENDCGVRPE DCACGEAALR GCLISNPGMK GCRHKRYQRP IGHEDLHLLA VPLTADDRRL
GVFSLFLEPV MLQQWQQLNL HRMLQMAGEH LGLAMERIRR ESEARLQSLE QERSQITHEL
HDCLAQRLAA LGLEVRNLEA SHGGGRAPGG LRAGLRHVRR GLDQAYGELR QLMGQFRIAL
EGGGLEPALK RLVHRFQRDS GIRVLLSHDW PRGRLSPDQE FQVLRIVQEA LNNVRNHSGA
RHVQVALHRI GCDMELVVED DGRGFADSPP DHRDGDDGHH LGLGVMRDRA DAIGGHLEIQ
SELGEGTRIA VYLPSCGRCP GREERG