Gene EcSMS35_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1918 
SymbolnarX 
ID6145880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1940010 
End bp1941806 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content53% 
IMG OID641616794 
Productnitrate/nitrite sensor protein NarX 
Protein accessionYP_001743970 
Protein GI170682187 
COG category[T] Signal transduction mechanisms 
COG ID[COG3850] Signal transduction histidine kinase, nitrate/nitrite-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0184147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.168413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAAC GTTGTCTCTC TCCGCTCACC CTGGTTAATC AGGTTGCGCT TATTGTGTTG 
CTTTCTACTG CTATTGGACT GGCAGGGATG GCAGTTTCTG GCTGGCTGGT GCAAGGCGTT
CAGGGCAGCG CCCATGCGAT CAACAAAGCG GGATCGCTGC GCATGCAAAG TTACCGTCTG
TTGGCGGCAG TGCCATTAAG CGAGAAAGAC AAGCCCTTAA TTAAAGAAAT GGAACAAACG
GCATTTAGCG CCGAGTTGAC TCGAGCAGCG GAACGAGACG GACAACTGGC GCAATTACAG
GGTTTACAAG ATTACTGGCG TAATGAACTG ATCCCTGCGC TGATGCGTGC ACAAAACCGC
GAAACGGTGT CAGCGGATGT CAGCCAGTTT GTTGCCGGGC TTGATCAGCT GGTATCTGGT
TTTGACCGCA CCACGGAAAT GCGCATTGAG ACAGTGGTAC TGGTCCATCG GGTAATGGCG
GTATTTATGG CACTTTTACT GGTGTTCACT ATTATCTGGT TGCGGGCGCG ACTGCTACAA
CCGTGGCGGC AACTGCTGGC AATGGCGAGT GCCGTCAGCC ATCGCGATTT TACCCAACGC
GCAAACATCA GCGGGCGCAA CGAAATGGCG ATGCTTGGAA CTGCATTGAA CAATATGTCT
GCAGAACTGG CCGAAAGTTA TGCCGTACTT GAGCAGCGGG TTCAGGAGAA AACTGCCGGG
CTGGAGCATA AAAATCAGAT CCTCTCTTTT TTATGGCAGG CTAACCGCCG TTTGCATTCC
CGCGCCCCGC TGTGTGAACG CCTGTCACCG GTACTCAACG GCTTACAGAA TTTAACCCTG
CTACGTGATA TCGAACTGCG GGTGTATGAC ACTGATGATG AAGAGAATCA TCAGGAGTTT
ACCTGCCAGC CAGATATGAC TTGTGATGAT AAAGGCTGCC AGCTCTGCCC GCGCGGCATA
TTACCCGTTG GCGATCGCGG CACAACCCTG AAGTGGCGGC TGGCTGACTC TCATACGCAG
TACGGTATTT TGCTGGCGAC CCTGCCGCAG GGGCGTCATC TTAGCCATGA TCAACAACAA
CTGGTGGATA CCCTGGTTGA ACAACTCACC GCCACGCTGG CGCTGGATCG CCATCAGGAA
CGTCAGCAAC AGTTGATCGT GATGGAAGAG CGTGCCACCA TTGCGCGCGA ACTGCATGAT
TCTATTGCCC AATCTCTCTC TTGCATGAAG ATGCAGGTGA GTTGTTTACA GATGCAGGGC
GATGCGCTGC CAGAAAGCAG CCGCGAACTG TTAAGTCAGA TCCGTAACGA ACTGAATGCA
TCCTGGGCGC AGTTGCGTGA ATTGCTCACC ACATTCCGTT TGCAGCTCAC CGAGCCTGGA
TTACGTCCGG CGCTGGAGGC GAGTTGCGAA GAGTACAGCG CCAAATTTGG CTTCCCGGTG
AAGCTGGATT ATCAATTGCC GCCTCGTCTG GTGCCTTCGC ATCAGGCAAT CCACTTGTTG
CAAATTGCCC GTGAGGCATT AAGTAACGCC CTCAAACATT CGCAAGCGAG TGAGGTCGTG
GTGACGGTGG CGCAAAACGA TAATCAGGTC AAACTGACCG TCCAGGATAA CGGCTGCGGC
GTGCCTGAAA ATGCCATCCG CAGCAATCAC TACGGCATGA TAATAATGCG CGACCGTGCG
CAAAGTTTAC GAGGCGATTG CCGCGTCCGC CGTCGTGAAT CAGGTGGCAC CGAAGTGGTT
GTCACCTTTA TTCCCGAAAA AACTTTCACA GACGTCCAAG GAGATACCCA TGAGTAA
 
Protein sequence
MLKRCLSPLT LVNQVALIVL LSTAIGLAGM AVSGWLVQGV QGSAHAINKA GSLRMQSYRL 
LAAVPLSEKD KPLIKEMEQT AFSAELTRAA ERDGQLAQLQ GLQDYWRNEL IPALMRAQNR
ETVSADVSQF VAGLDQLVSG FDRTTEMRIE TVVLVHRVMA VFMALLLVFT IIWLRARLLQ
PWRQLLAMAS AVSHRDFTQR ANISGRNEMA MLGTALNNMS AELAESYAVL EQRVQEKTAG
LEHKNQILSF LWQANRRLHS RAPLCERLSP VLNGLQNLTL LRDIELRVYD TDDEENHQEF
TCQPDMTCDD KGCQLCPRGI LPVGDRGTTL KWRLADSHTQ YGILLATLPQ GRHLSHDQQQ
LVDTLVEQLT ATLALDRHQE RQQQLIVMEE RATIARELHD SIAQSLSCMK MQVSCLQMQG
DALPESSREL LSQIRNELNA SWAQLRELLT TFRLQLTEPG LRPALEASCE EYSAKFGFPV
KLDYQLPPRL VPSHQAIHLL QIAREALSNA LKHSQASEVV VTVAQNDNQV KLTVQDNGCG
VPENAIRSNH YGMIIMRDRA QSLRGDCRVR RRESGGTEVV VTFIPEKTFT DVQGDTHE