Gene EcHS_A1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1333 
SymbolnarK 
ID5593634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1326396 
End bp1327787 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content54% 
IMG OID640920490 
Productnitrite extrusion protein 1 
Protein accessionYP_001458051 
Protein GI157160733 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID[TIGR00886] nitrite extrusion protein (nitrite facilitator) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.000558307 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCACT CATCCGCCCC CGAAAGGGCT ACTGGAGCTG TCATTACAGA TTGGCGACCG 
GAAGATCCTG CGTTCTGGCA ACAACGCGGT CAACGTATTG CCAGCCGCAA CCTGTGGATT
TCCGTTCCCT GTCTGCTGCT GGCGTTTTGC GTATGGATGT TGTTCAGCGC TGTTGCGGTG
AACTTACCGA AAGTCGGTTT TAATTTTACG ACCGATCAGC TATTTATGTT GACTGCGCTG
CCTTCGGTTT CTGGCGCGTT ATTACGTGTT CCATACTCCT TTATGGTTCC TATCTTCGGT
GGTCGTCGCT GGACGGCGTT CAGCACCGGT ATTCTGATTA TTCCTTGCGT CTGGCTGGGT
TTTGCCGTGC AGGATACCTC CACGCCTTAT AGCGTCTTCA TCATCATCTC TCTGCTATGC
GGCTTTGCTG GCGCGAACTT CGCATCCAGT ATGGCAAACA TCAGCTTCTT CTTTCCGAAA
CAGAAGCAGG GTGGCGCGCT GGGTCTGAAT GGTGGTCTGG GAAACATGGG CGTCAGCGTC
ATGCAGTTGG TTGCTCCGCT GGTGGTATCA CTGTCGATTT TCGCAGTATT TGGTAGCCAG
GGCGTCAAAC AGCCGGATGG GACTGAGCTG TATCTGGCGA ATGCGTCCTG GATATGGGTG
CCGTTCCTTG CCATCTTCAC CATTGCGGCG TGGTTTGGCA TGAACGATCT TGCTACCTCG
AAAGCCTCCA TCAAGGAGCA GTTGCCGGTA CTCAAACGGG GTCATCTGTG GATTATGAGC
CTGCTGTATC TGGCAACCTT CGGCTCCTTC ATCGGCTTCT CCGCGGGCTT TGCAATGCTG
TCAAAAACGC AGTTCCCGGA TGTTCAGATT CTGCAATACG CTTTCTTCGG GCCGTTTATT
GGTGCGCTGG CGCGTTCTGC AGGTGGTGCA TTATCTGACC GTCTGGGCGG AACTCGTGTC
ACGCTGGTGA ACTTTATTCT GATGGCGATT TTCAGCGGCC TGCTGTTCCT GACCTTACCG
ACTGACGGGC AGGGCGGAAG CTTCATGGCG TTCTTCGCGG TCTTCCTGGC GCTGTTCCTG
ACAGCTGGGC TGGGTAGTGG TTCCACTTTC CAGATGATTT CAGTGATCTT CCGTAAACTG
ACAATGGATC GCGTGAAAGC AGAAGGGGGT TCTGACGAAC GTGCGATGCG TGAAGCGGCA
ACCGACACGG CGGCGGCGCT GGGTTTCATC TCTGCGATTG GCGCGATTGG TGGCTTCTTT
ATCCCGAAAG CGTTTGGTAG CTCGCTGGCA TTAACGGGTT CGCCAGTCGG CGCAATGAAG
GTATTTTTGA TTTTCTATAT CGCCTGCGTA GTGATTACCT GGGCGGTATA TGGTCGGCAT
TCTAAAAAAT AA
 
Protein sequence
MSHSSAPERA TGAVITDWRP EDPAFWQQRG QRIASRNLWI SVPCLLLAFC VWMLFSAVAV 
NLPKVGFNFT TDQLFMLTAL PSVSGALLRV PYSFMVPIFG GRRWTAFSTG ILIIPCVWLG
FAVQDTSTPY SVFIIISLLC GFAGANFASS MANISFFFPK QKQGGALGLN GGLGNMGVSV
MQLVAPLVVS LSIFAVFGSQ GVKQPDGTEL YLANASWIWV PFLAIFTIAA WFGMNDLATS
KASIKEQLPV LKRGHLWIMS LLYLATFGSF IGFSAGFAML SKTQFPDVQI LQYAFFGPFI
GALARSAGGA LSDRLGGTRV TLVNFILMAI FSGLLFLTLP TDGQGGSFMA FFAVFLALFL
TAGLGSGSTF QMISVIFRKL TMDRVKAEGG SDERAMREAA TDTAAALGFI SAIGAIGGFF
IPKAFGSSLA LTGSPVGAMK VFLIFYIACV VITWAVYGRH SKK