Gene EcSMS35_2834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2834 
SymbolnorW 
ID6143799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2909139 
End bp2910272 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content54% 
IMG OID641617703 
Productnitric oxide reductase 
Protein accessionYP_001744858 
Protein GI170681646 
COG category[C] Energy production and conversion 
COG ID[COG1251] NAD(P)H-nitrite reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0243221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA GCATTGTGAT CATTGGTTCG GGCTTCGCCG CCCGCCAACT GGTGAAAAAT 
ATTCGCAAAC AGGACGCCAG TATTCCATTA ACTCTGATTG CCGCCGACAG CATGGATGAG
TACAACAAAC CTGACCTCAG CCATGTTATC AGTCAGGGGC AACGTGCCGA CGACCTTACC
CGCCAGACGG CAGGTGAATT TGCCGAGCAG TTTAATCTGC GCCTGTTTCC GCACACCTGG
GTAACGGATA TCGATGCCGA AGCCCATGTG GTGAAAAGTC AGAATAATCA GTGGCAATAC
GACAAGTTAG TGCTGGCAAC CGGTGCCAGC GCCTTTGTCC CGCCAGTGCC CGGGCGTGAG
TTAATGCTGA CGTTAAATAG TCAGCAAGAG TATCGCGCCT GTGAAACGCA ACTGCGGGAT
GCCCGACGCG TGTTGATTGT TGGCGGTGGC TTGATTGGTA GCGAGCTGGC GATGGATTTT
TGTCGGGCAG GCAAAGCGGT CACGCTGATC GACAACGCTG CCAGTATTCT GGCGTCGTTA
ATGCCACCGG AAGTAAGCAG CCGCTTGCAG CATCGGTTGA CGGAGATGGG CGTTCATCTG
CTGTTAAAAT CTCAGTTGCA GGGACTGGAA AAAACGGATT CTGGCATTCT GGCAACGCTG
GAATGCCAGC GCTGCATTGA AGTGGATGCG GTAATTGCCG CCACCGGACT GCGCCCGGAA
ACCGCCCTGG CACGACGCGC CGGGCTGACG ATTAATCGTG GCGTTTGCGT CGATAGTTAT
CTGCAAACCA GTAATGCCGA TATTTATGCG CTGGGCGATT GCGCGGAAAT TAACGGTCAG
GTATTGCCGT TCCTCCAGCC GATTCAACTT AGCGCAATGG TGCTGGCAAA AAATCTTCTC
GGCAATAACA CGCCGCTGAA ACTCCCGGCG ATGCTGGTGA AAATCAAAAC GCCAGAATTA
CCGCTGCATC TGGCAGGCGA AACCCAGCGT CAGGATTTAC GCTGGCAAAT TAATACCGAA
CGCCAGGGAA TGGTTGCGCG CGGTGTTGAC GATGCTGACC AGCTTCGCGC CTTTGTGGTC
AGTGAGGATC GGATGAAAGA GGCGTTTGGA TTGTTGAAAA CGTTGCCGAT GTAG
 
Protein sequence
MSNSIVIIGS GFAARQLVKN IRKQDASIPL TLIAADSMDE YNKPDLSHVI SQGQRADDLT 
RQTAGEFAEQ FNLRLFPHTW VTDIDAEAHV VKSQNNQWQY DKLVLATGAS AFVPPVPGRE
LMLTLNSQQE YRACETQLRD ARRVLIVGGG LIGSELAMDF CRAGKAVTLI DNAASILASL
MPPEVSSRLQ HRLTEMGVHL LLKSQLQGLE KTDSGILATL ECQRCIEVDA VIAATGLRPE
TALARRAGLT INRGVCVDSY LQTSNADIYA LGDCAEINGQ VLPFLQPIQL SAMVLAKNLL
GNNTPLKLPA MLVKIKTPEL PLHLAGETQR QDLRWQINTE RQGMVARGVD DADQLRAFVV
SEDRMKEAFG LLKTLPM