Gene EcSMS35_2615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2615 
SymbolnarQ 
ID6145645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2670040 
End bp2671740 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content52% 
IMG OID641617486 
Productnitrate/nitrite sensor protein NarQ 
Protein accessionYP_001744651 
Protein GI170682253 
COG category[T] Signal transduction mechanisms 
COG ID[COG3850] Signal transduction histidine kinase, nitrate/nitrite-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.538567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGTTA AACGACCCGT CTCGGCCAGT CTGGCCCGGG CCTTTTTTTA CATTGTGCTG 
CTGTCGATTC TTTCCACGGG TATCGCTCTG CTAACTCTGG CGAGCAGTTT GCGCGACGCT
GAGGCTATCA ATATTGCCGG ATCGCTGCGT ATGCAGAGTT ACCGCCTGGG CTACGATTTG
CAAAGTGGCA GTCCACAACT CAATGCACAT CGCCAGTTAT TTCAGCAGGC ACTGCATTCA
CCGGTATTAA CCAATCTCAA CGTCTGGTAT GTGCCAGAGG CAGTAAAAAC CCGCTATGCG
CATCTGAATG CCAACTGGCT GGAGATGAAT AATCGGCTCA GCAAGGGCGA CTTGCCGTGG
TATCAGGCCA ATATTAATAA TTATGTTAAT CAGATAGACC TGTTCGTGCT GGCTTTACAG
CACTACGCTG AACGCAAAAT GCTGCTGGTG GTGGCGATTT CCCTGGCTGG CGGCATCGGT
ATTTTCACGC TGGTCTTTTT TACTTTGCGC CGCATACGCC ATCAGGTGGT TGCCCCGCTG
AATCAGCTGG TTACCGCCAG TCAGCGTATT GAACACGGGC AGTTCGACTC GCCGCCGCTG
GATACCAGCC TGCCGAATGA GCTTGGCCTG CTTGCAAAAA CCTTTAACCA GATGTCGAGC
GAGCTGCATA AATTGTACCG TTCGCTGGAG GCGTCAGTAG AAGAAAAGAC CCGCGATCTC
CACGAGGCCA AGCGTCGTCT GGAGGTGTTG TATCAGTGTT CGCAGGCGCT AAACACCAGC
CAGATTGATG TGCATTGTTT CCGCCATATT TTGCAGATTG TTCGCGACAA TGAAGCGGCT
GAATATCTGG AGTTAAATGT CGGTGACAAC TGGCGGATTA GCGAAGGGCA GCCAAACCCG
GAATTGCCGA TGCAGATTTT ACCGGTGACA ATGCAAGAGA CGGTTTACGG CGAACTGCAC
TGGCAAAATA GTCACGTTTC ATCATCAGAA CCGCTGCTTA ACAGCGTTTC GTCGATGCTG
GGACGCGGTT TGTACTTTAA TCAGGCGCAG AAGCATTTTC AGCAGTTATT GTTGATGGAA
GAACGTGCGA CCATCGCCCG CGAATTGCAC GACTCGCTGG CTCAGGTACT TTCTTACTTA
CGTATCCAGT TGACGTTACT GAAGCGTTCG ATACCGGAAG ATAATGCCAC CGCACAAAGT
ATCATGGCCG ATTTTTCCCA GGCGTTGAAT GATGCTTATC GGCAGTTACG CGAGCTGTTA
ACTACTTTCC GCCTGACGCT GCAGCAGGCG GATCTCCCCT CCGCGTTGAG GGAAATGCTG
GATACGTTAC AAAATCAAAC CAGCGCCAAA CTGACCCTCG ACTGCCGTCT GCCAACCCTG
GCGCTGGATG CGCAAATGCA GGTGCATTTG TTGCAAATTA TTCGCGAAGC GGTGCTGAAT
GCGATGAAGC ACGCCAACGC CAGCGAAATC GCCGTTAGCT GCGTCACCGC GCCGGATGGC
AATCACACAG TCTATATTCG CGACAACGGG ATTGGCATCG GTGAACCGAA AGAACCCGAA
GGCCATTATG GTCTGAATAT CATGCGCGAA CGCGCGGAAA GATTGGGTGG GACGCTGACT
TTTTCGCAAC CTTCCGGCGG CGGCACGTTA GTGAGTATTA GCTTTCGCTC TGCGGAGGGT
GAGGAAAGTC AGCTGATGTA A
 
Protein sequence
MIVKRPVSAS LARAFFYIVL LSILSTGIAL LTLASSLRDA EAINIAGSLR MQSYRLGYDL 
QSGSPQLNAH RQLFQQALHS PVLTNLNVWY VPEAVKTRYA HLNANWLEMN NRLSKGDLPW
YQANINNYVN QIDLFVLALQ HYAERKMLLV VAISLAGGIG IFTLVFFTLR RIRHQVVAPL
NQLVTASQRI EHGQFDSPPL DTSLPNELGL LAKTFNQMSS ELHKLYRSLE ASVEEKTRDL
HEAKRRLEVL YQCSQALNTS QIDVHCFRHI LQIVRDNEAA EYLELNVGDN WRISEGQPNP
ELPMQILPVT MQETVYGELH WQNSHVSSSE PLLNSVSSML GRGLYFNQAQ KHFQQLLLME
ERATIARELH DSLAQVLSYL RIQLTLLKRS IPEDNATAQS IMADFSQALN DAYRQLRELL
TTFRLTLQQA DLPSALREML DTLQNQTSAK LTLDCRLPTL ALDAQMQVHL LQIIREAVLN
AMKHANASEI AVSCVTAPDG NHTVYIRDNG IGIGEPKEPE GHYGLNIMRE RAERLGGTLT
FSQPSGGGTL VSISFRSAEG EESQLM