Gene Mlg_1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1005 
Symbol 
ID4268373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1144073 
End bp1146007 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content66% 
IMG OID638125756 
Productsignal transduction histidine kinase, nitrate/nitrite-specific, NarQ 
Protein accessionYP_741848 
Protein GI114320165 
COG category[T] Signal transduction mechanisms 
COG ID[COG3850] Signal transduction histidine kinase, nitrate/nitrite-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCA GCCGTCTCTA CAATCGCCTG CTCCACCCCA CCCTGGCCGG GTCGCTCTTC 
GCGTTGCTGG TGGGAGTGGC CACCATCGGC TTCCTGGGCA TGCTGCATGG GGCGTTCGTC
GCCAAAACGG CGCAGCATGA TGCGGCGGCG ATCAACGTGG CCGGGTCCCT GCGCATGCAG
TCCTACCGCA TTGCCACTAC CCTGCTGGAC GACCAGGGCC CGCAGGCGGG TGCCAAGACG
TTGGATGCGG AGGTTGCTGC CTTCGAGCGG CGGCTGCACA GCCCGGAGCT GCTGCGACCC
ATACCGGCAA ACCCGGACGA CCCGGTTCGA GAGGCCTGGG AGGAGGTCCA CCGGCAGTGG
ACCACTGAGT TAAAGCCGCT GCTCGTCGAT GGGAGCAGCG CCGGTTCGCC GGTCCGCTAT
CTGGACCAAG TGGACCGATT CGTGGGCACG GTGGACAACC TGGTCACGGC GGCTCAGTCG
GCCGCTGAGG CGCGCATTCA CAACTCCCAG TCGGTGCGGG TGCTGGCCAT GGCGTTGCTC
AGTGGCCTGG TGCTGTACGG GCTCTACCGG CTTCACATGC GCCTGGTGCT CCCCGTGCAG
CAGTTGAACT ACGTGGCCGG TCGCCTGATG AAGGGGGATT TCGAGGCCCG CGCCCGCTAC
CTGCCGAACG ATGAGCTGGG CATGTACGCC CGGACCTTCA ACAGGATGGC GGACACCCTC
ACCGATATGC AGCGCAGTCT CGCCCACCGG GTGAGCGAGA AGACCGCAGA GCTTCGGCGG
CGCAACAGTG CCTTGGAGCT GGTCTACAAT GCCAGTCGTG ATCTCAGTGC CGGCGCACTG
GAGTCCAGTA ACCTCAAGGC GATGCTCGAC ACCCTGGAGC GGGTGACGGG GCTGGGGCCG
CTGAGCATCT GCCTGGCCCA GCCCGGTGCG GACAAGTCCT TCGAGACCCT CTCCACCTCC
GCTGAGAGTC GCCCGGCCCG CTGCAGCGCT CCGGACTGCA AGGGTTGTAT GAATGTCGTA
CCCCGTCACG GGGGCGGCGT GGTTTGCCCG GGTATGCTGG CTCTGCCCCT GGAGGACCGG
AATGTCCGCT ACGGTCTGCT GCAGGTGGAG TACACCCCCG GCGACCAGCC CAGGGGGTGG
CAGCTGCACC TGGCGGAGAC GGTGGCCAGC CATATCGCCG CCGCCTGTGC CCGCGAGCGC
GAATTGGATG CCGAGCACCG CATTGTGTTG ATGGAGGAGC GGGCGGTGAT TGCCAGGGAA
CTGCACGACT CGCTGGCCCA GGCACTGTCC TACATGAAGA TCCAGGTGGC ACGGCTGCAG
GGGTTGATGC GCAAGGAGCA CGAACCCGCC CAGCTCGAGG CCATCCTCTC TGAGTTGCGT
GATGGGCTCA ATTCCGCCTA TGAGCAGTTG CGCGAGCTCC TGACCACGTT CCGCCTGGGG
ATGGACTCCC CCGGGCTGGA GTCTGCGCTG CGCAAGGCCG TCAAGGAGTT CTCCGAGCGG
GGTGATATCC CCATCCGTCT GGACTACCGT CTGAGCCACT GGCCCTTGAA CGCGAACGAG
GAGATCCATC TCCTGCAAAT CGCGCGTGAG GCGCTGGCGA ACGTGGTTCG GCACAGTCAG
GCCACCCGGG CGGACGTGGT GGTGGCCCCG GGGTCGAACC ACGACGTGGT GCTGGAAGTC
CTCGACAACG GCGTCGGGCT GGCGGCTGTT TCCGAGGAAC CGGGGCACCA CTATGGCACC
GCCATCATGC AGGAACGGGC GGAGGGCCTT GGAGGGATGC TGCGACTCGG TAACCGGGCC
GGGGGGGGGA TGCGGGTGAT GCTGACCTTC CGGCCGGAGT CCATGGCGCG CGCTGTCGCG
TTCAGCGCCA GCCAGAACAA CAATCAAGGG GTTCAGGATG AGGACATTCG GGGAGCCTAT
GCAGACTACC GCTGA
 
Protein sequence
MAISRLYNRL LHPTLAGSLF ALLVGVATIG FLGMLHGAFV AKTAQHDAAA INVAGSLRMQ 
SYRIATTLLD DQGPQAGAKT LDAEVAAFER RLHSPELLRP IPANPDDPVR EAWEEVHRQW
TTELKPLLVD GSSAGSPVRY LDQVDRFVGT VDNLVTAAQS AAEARIHNSQ SVRVLAMALL
SGLVLYGLYR LHMRLVLPVQ QLNYVAGRLM KGDFEARARY LPNDELGMYA RTFNRMADTL
TDMQRSLAHR VSEKTAELRR RNSALELVYN ASRDLSAGAL ESSNLKAMLD TLERVTGLGP
LSICLAQPGA DKSFETLSTS AESRPARCSA PDCKGCMNVV PRHGGGVVCP GMLALPLEDR
NVRYGLLQVE YTPGDQPRGW QLHLAETVAS HIAAACARER ELDAEHRIVL MEERAVIARE
LHDSLAQALS YMKIQVARLQ GLMRKEHEPA QLEAILSELR DGLNSAYEQL RELLTTFRLG
MDSPGLESAL RKAVKEFSER GDIPIRLDYR LSHWPLNANE EIHLLQIARE ALANVVRHSQ
ATRADVVVAP GSNHDVVLEV LDNGVGLAAV SEEPGHHYGT AIMQERAEGL GGMLRLGNRA
GGGMRVMLTF RPESMARAVA FSASQNNNQG VQDEDIRGAY ADYR