Gene EcSMS35_1916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1916 
SymbolnarK 
ID6145199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1938280 
End bp1939671 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content54% 
IMG OID641616792 
Productnitrite extrusion protein 1 
Protein accessionYP_001743968 
Protein GI170682669 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID[TIGR00886] nitrite extrusion protein (nitrite facilitator) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0184588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.196505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCACT CATCCGCCCC CGAAAGGGCT ACTGGAGCTG TCATTACAGA TTGGCGACCG 
GAAGATCCTG CGTTCTGGCA ACAACGCGGT CAACGTATTG CCAGCCGCAA CCTGTGGATT
TCCGTTCCCT GTCTGCTGCT GGCGTTTTGC GTATGGATGT TGTTCAGCGC TGTTGCGGTG
AACTTACCGA AAGTCGGTTT TAATTTTACG ACCGATCAGC TATTTATGTT GACTGCGCTG
CCTTCGGTTT CTGGCGCGTT ATTACGTGTT CCATACTCCT TTATGGTTCC TATCTTCGGT
GGTCGTCGCT GGACGGCGTT CAGCACCGGT ATTCTGATTA TTCCTTGCGT CTGGCTGGGT
TTTGCCGTGC AGGATACCTC CACGCCTTAT AGCGTCTTCA TCATCATCTC TCTGCTGTGC
GGCTTTGCTG GCGCGAACTT CGCATCCAGT ATGGCAAACA TCAGCTTCTT CTTTCCGAAA
CAGAAGCAGG GTGGCGCGCT GGGTCTGAAT GGTGGTCTGG GCAACATGGG CGTCAGCGTC
ATGCAGTTGG TTGCTCCGCT GGTGGTATCA CTGTCGATTT TCGCAGTATT TGGTAGCCAG
GGTGTCAAAC AGCCGGATGG GACTGAGCTG TATCTGGCGA ATGCGTCCTG GGTATGGGTG
CCGTTCCTTG CCATCTTCAC CATTGCGGCG TGGTTTGGCA TGAACGATCT TGCTACCTCG
AAAGCCTCCA TCAAGGAGCA GTTGCCGGTA CTCAAACGGG GTCATCTGTG GATTATGAGC
CTGCTGTATC TGGCAACCTT CGGCTCCTTC ATCGGCTTCT CCGCGGGCTT TGCGATGCTG
TCAAAAACGC AGTTCCCGGA TGTTCAGATT CTGCAATACG CTTTCTTCGG GCCGTTTATT
GGTGCGCTGG CGCGTTCTGC AGGTGGTGCA TTATCTGACC GTCTGGGCGG AACTCGTGTC
ACGCTGGTGA ACTTTATCCT GATGGCGATT TTCAGCGGCC TGCTGTTCCT GACCTTACCG
ACTGACGGAC AGGGCGGAAG CTTCATGGCG TTCTTCGCAG TCTTCCTGGC GCTGTTCCTG
ACAGCTGGGC TGGGTAGTGG TTCCACTTTC CAGATGATTT CCGTGATCTT CCGTAAACTG
ACAATGGATC GCGTGAAAGC AGAAGGGGGT TCTGACGAAC GTGCGATGCG TGAAGCGGCA
ACCGACACGG CGGCGGCGCT GGGTTTCATC TCTGCGATTG GCGCGATTGG TGGCTTCTTT
ATCCCGAAAG CGTTTGGTAG CTCGCTGGCA TTAACGGGTT CGCCAGTCGG CGCAATGAAA
GTATTTTTGA TTTTCTATAT CGCCTGCGTG GTGATTACCT GGGCGGTATA TGGTCGGCAT
TCTAAAAAAT AA
 
Protein sequence
MSHSSAPERA TGAVITDWRP EDPAFWQQRG QRIASRNLWI SVPCLLLAFC VWMLFSAVAV 
NLPKVGFNFT TDQLFMLTAL PSVSGALLRV PYSFMVPIFG GRRWTAFSTG ILIIPCVWLG
FAVQDTSTPY SVFIIISLLC GFAGANFASS MANISFFFPK QKQGGALGLN GGLGNMGVSV
MQLVAPLVVS LSIFAVFGSQ GVKQPDGTEL YLANASWVWV PFLAIFTIAA WFGMNDLATS
KASIKEQLPV LKRGHLWIMS LLYLATFGSF IGFSAGFAML SKTQFPDVQI LQYAFFGPFI
GALARSAGGA LSDRLGGTRV TLVNFILMAI FSGLLFLTLP TDGQGGSFMA FFAVFLALFL
TAGLGSGSTF QMISVIFRKL TMDRVKAEGG SDERAMREAA TDTAAALGFI SAIGAIGGFF
IPKAFGSSLA LTGSPVGAMK VFLIFYIACV VITWAVYGRH SKK