Gene ECH74115_3690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3690 
SymbolnarQ 
ID6967799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3405972 
End bp3407672 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content52% 
IMG OID643387484 
Productnitrate/nitrite sensor protein NarQ 
Protein accessionYP_002271937 
Protein GI209396261 
COG category[T] Signal transduction mechanisms 
COG ID[COG3850] Signal transduction histidine kinase, nitrate/nitrite-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones93 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGTTA AACGACCCGT CTCGGCCAGT CTGGCCCGGG CCTTTTTTTA CATTGTGCTG 
CTGTCGATTC TTTCCACGGG TATCGCTCTG CTAACTCTGG CGAGCAGTTT GCGCGACGCT
GAGGCTATCA ATATTGCCGG ATCGCTGCGT ATGCAGAGTT ACCGCCTGGG CTACGACTTG
CAAAGTGGCA CTCCACAACT CAATGCACAT CGCCAGCTGT TTCAGCAGGC ACTGCATTCA
CCGGTATTAA CCAACCTCAA CGTCTGGTAT GTGCCAGAAG CAGTAAAAAC TCGCTATGCG
CATCTGAATG CCAACTGGCT GGAGATGAAT AATCGGCTCA GCAAGGGCGA TTTGCCGTGG
TATCAGGCCA ATATTAATAA TTATGTTAAT CAGATAGACC TGTTCGTGCT GGCTTTACAG
CACTACGCTG AACGCAAAAT GCTGCTGGTG GTGGCGATTT CCCTGGCTGG CGGTATCGGT
ATTTTCACGC TGGTCTTTTT TACTTTGGGC CGCATACGCC ATCAGGTGGT TGCCCCGCTG
AATCAGCTGG TTACCGCCAG TCAGCGTATT GAACACGGAC AGTTCGACTC GCCACCGCTG
GATACCAGCC TGCCGAATGA GCTTGGTCTA CTTGCAAAAA CCTTTAACCA GATGTCGAGC
GAGCTGCATA AATTGTACCG TTCGCTGGAG GCGTCAGTAG AAGAAAAGAC CCGCGATCTC
CACGAGGCCA AGCGTCGTCT GGAGGTGTTG TATCAATGTT CGCAGGCGCT GAACACCAGC
CAGATTGATG TGCATTGTTT CCGCCATATT TTGCAGATTG TTCGCGACAA TGAAGCGGCT
GAATATCTGG AGTTAAATGT CGGTGAAAAC TGGCGGATTA GCGAAGGGCA ACCAAACCCG
GAATTGCCGA TGCAGATTTT ACCGGTGACA ATGCAAGAGA CGGTTTACGG CGAACTGCAC
TGGCAAAATA GTCACGTTTC ATCATCAGAA CCGCTGCTTA ACAGCGTTTC GTCGATGCTG
GGACGCGGTT TGTACTTTAA TCAGGCGCAG AAGCATTTTC AGCAGTTATT GTTGATGGAA
GAACGTGCAA CCATCGCCCG CGAATTGCAC GACTCGCTGG CTCAGGTACT TTCTTACTTA
CGTATCCAGT TGACGTTACT TAAGCGTTCG ATACCGGAAG ATAACGCCAC CGCACAAAGT
ATCATGGCCG ATTTTTCCCA GGCGTTGAAT GATGCTTATC GGCAGTTACG CGAGCTGTTG
ACTACTTTTC GCCTGACGCT GCAGCAGGCG GATCTCCCCT CCGCGTTGAG GGAAATGCTG
GATACGTTAC AAAATCAAAC CAGCGCCAAA CTGACCCTCG ACTGCCGTCT GCCAACCCTG
GCACTGGATG CGCAAATGCA GGTGCATTTG TTGCAAATTA TTCGCGAAGC GGTGCTGAAT
GCGATGAAGC ACGCCAACGC CAGCGAAATC GCCGTCAGTT GCGTCACCGC GCCGGACGGC
AATCACACGG TTTATATCCG TGATAACGGG ATTGGTATCG GTGAACCGAA AGAACCCGAA
GGTCATTATG GTCTGAATAT CATGCGCGAA CGCGCGGAAC GGCTAGGTGG GACGCTGACT
TTTTCACAAC CTTCCGGCGG CGGCACGTTA GTGAGTATTA GCTTTCGCTC TGCGGAGGGT
GAGGAAAGTC AGTTGATGTA A
 
Protein sequence
MIVKRPVSAS LARAFFYIVL LSILSTGIAL LTLASSLRDA EAINIAGSLR MQSYRLGYDL 
QSGTPQLNAH RQLFQQALHS PVLTNLNVWY VPEAVKTRYA HLNANWLEMN NRLSKGDLPW
YQANINNYVN QIDLFVLALQ HYAERKMLLV VAISLAGGIG IFTLVFFTLG RIRHQVVAPL
NQLVTASQRI EHGQFDSPPL DTSLPNELGL LAKTFNQMSS ELHKLYRSLE ASVEEKTRDL
HEAKRRLEVL YQCSQALNTS QIDVHCFRHI LQIVRDNEAA EYLELNVGEN WRISEGQPNP
ELPMQILPVT MQETVYGELH WQNSHVSSSE PLLNSVSSML GRGLYFNQAQ KHFQQLLLME
ERATIARELH DSLAQVLSYL RIQLTLLKRS IPEDNATAQS IMADFSQALN DAYRQLRELL
TTFRLTLQQA DLPSALREML DTLQNQTSAK LTLDCRLPTL ALDAQMQVHL LQIIREAVLN
AMKHANASEI AVSCVTAPDG NHTVYIRDNG IGIGEPKEPE GHYGLNIMRE RAERLGGTLT
FSQPSGGGTL VSISFRSAEG EESQLM