Gene EcHS_A1331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1331 
SymbolnarX 
ID5593632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1324261 
End bp1326057 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content53% 
IMG OID640920488 
Productnitrate/nitrite sensor protein NarX 
Protein accessionYP_001458049 
Protein GI157160731 
COG category[T] Signal transduction mechanisms 
COG ID[COG3850] Signal transduction histidine kinase, nitrate/nitrite-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.150699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAAAC GTTGTCTCTC TCCGCTCACC CTGGTTAATC AGGTTGCGCT TATTGTGTTG 
CTTTCTACTG CTATTGGACT GGCAGGGATG GCGGTTTCTG GCTGGCTGGT GCAAGGCGTT
CAGGGCAGCG CCCATGCGAT CAACAAAGCG GGATCGCTGC GCATGCAAAG TTACCGTCTG
TTGGCAGCAT TGCCATTAAG CGAGAAAGAC AAGCCCTTAA TTAAAGAGAT GGAACAAACG
GCATTTAGCG CCGAGTTGAC TCGAGCAGCA GAACGAGACG GACAACTGGC GCAATTACAG
GGTTTACAAG ATTACTGGCG TAATGAACTG ATCCCTGCAC TGATGCGTGC ACAAAACCGC
GAAACGGTGT CAGCGGATGT CAGCCAGTTT GTTGCCGGGC TTGATCAGCT GGTATCTGGT
TTTGACCGCA CCACGGAAAT GCGCATTGAG ACAGTGGTAC TGGTCCATCG GGTAATGGCG
GTATTTATGG CACTTTTACT GGTGTTCACT ATTATCTGGT TGCGGGCGCG ACTGCTACAA
CCGTGGCGGC AACTGCTGGC AATGGCGAGT GCCGTCAGTC ATCGCGATTT TACCCAACGC
GCGAACATCA GCGGGCGCAA CGAAATGGCG ATGCTTGGAA CTGCATTGAA CAATATGTCT
GCAGAACTGG CCGAAAGTTA TGCCGTACTT GAGCAGCGGG TTCAGGAGAA AACCGCCGGG
CTGGAGCATA AAAATCAGAT CCTCTCTTTT TTATGGCAGG CTAACCGCCG TTTGCATTCC
CGCGCCCCGC TGTGTGAACG CCTGTCACCT GTACTCAACG GCTTACAGAA TTTAACCCTG
CTACGTGATA TCGAATTGCG GGTGTATGAC ACTGATGATG AAGAGAATCA TCAGGAGTTT
ACCTGCCAGC CAGATATGAC TTGTGATGAT AAAGGCTGCC AGCTCTGCCC GCGCGGCGTA
TTACCCGTTG GCGATCGCGG CACAACCCTG AAGTGGCGGC TGGCTGACTC ACATACGCAG
TACGGTATTT TGCTGGCGAC CCTACCGCAG GGGCGTCATC TTAGCCATGA TCAACAACAA
CTGGTGGATA CCCTGGTTGA ACAACTCACC GCCACGCTGG CGCTGGATCG GCATCAGGAA
CGTCAGCAAC AGTTGATCGT GATGGAAGAG CGTGCCACCA TTGCGCGCGA ACTGCATGAT
TCTATTGCCC AATCTCTCTC TTGCATGAAG ATGCAGGTGA GTTGTTTACA GATGCAGGGC
GATGCGCTGC CAGAAAGCAG CCGCGAACTG TTAAGTCAGA TCCGTAACGA ACTGAATGCA
TCCTGGGCGC AGTTGCGTGA ATTGCTCACC ACATTCCGCT TGCAGCTCAC CGAGCCTGGA
TTACGTCCGG CGCTGGAAGC GAGTTGCGAA GAGTACAGCG CCAAATTTGG CTTCCCGGTG
AAGCTGGATT ATCAATTGCC GCCTCGTCTG GTGCCTTCAC ATCAGGCAAT CCACTTGTTG
CAAATTGCCC GTGAGGCATT AAGTAACGCC CTCAAACATT CGCAAGCGAG TGAGGTCGTG
GTGACGGTGG CGCAAAACGA TAATCAGGTC AAACTGACCG TCCAGGATAA CGGCTGCGGC
GTGCCTGAAA ATGCCATCCG CAGCAATCAC TACGGCATGA TAATAATGCG CGACCGTGCG
CAAAGTTTAC GAGGCGATTG CCGCGTCCGC CGTCGTGAAT CAGGTGGCAC CGAAGTGGTG
GTCACCTTTA TTCCCGAAAA AACTTTCACA GACGTCCAAG GAGATACCCA TGAGTAA
 
Protein sequence
MLKRCLSPLT LVNQVALIVL LSTAIGLAGM AVSGWLVQGV QGSAHAINKA GSLRMQSYRL 
LAALPLSEKD KPLIKEMEQT AFSAELTRAA ERDGQLAQLQ GLQDYWRNEL IPALMRAQNR
ETVSADVSQF VAGLDQLVSG FDRTTEMRIE TVVLVHRVMA VFMALLLVFT IIWLRARLLQ
PWRQLLAMAS AVSHRDFTQR ANISGRNEMA MLGTALNNMS AELAESYAVL EQRVQEKTAG
LEHKNQILSF LWQANRRLHS RAPLCERLSP VLNGLQNLTL LRDIELRVYD TDDEENHQEF
TCQPDMTCDD KGCQLCPRGV LPVGDRGTTL KWRLADSHTQ YGILLATLPQ GRHLSHDQQQ
LVDTLVEQLT ATLALDRHQE RQQQLIVMEE RATIARELHD SIAQSLSCMK MQVSCLQMQG
DALPESSREL LSQIRNELNA SWAQLRELLT TFRLQLTEPG LRPALEASCE EYSAKFGFPV
KLDYQLPPRL VPSHQAIHLL QIAREALSNA LKHSQASEVV VTVAQNDNQV KLTVQDNGCG
VPENAIRSNH YGMIIMRDRA QSLRGDCRVR RRESGGTEVV VTFIPEKTFT DVQGDTHE