Gene ECH74115_3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3343 
SymbolnapA 
ID6968316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3075182 
End bp3077668 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content56% 
IMG OID643387155 
Productnitrate reductase catalytic subunit 
Protein accessionYP_002271618 
Protein GI209399788 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01706] periplasmic nitrate reductase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCA GTCGTCGTAG CTTTATGAAA GCTAACGCCG TTGCGGCCGC TGCGGCGGCT 
GCCGGTCTCA GCGTGCCGGG CGTTGCCCGC GCCGTTGTTG GTCAGCAGGA AGCCATTAAA
TGGGATAAAG CGCCGTGCCG TTTCTGCGGT ACTGGTTGCG GCGTTCTGGT CGGAACGCAG
CAGGGGCGTG TGGTGGCCTG TCAGGGCGAC CCGGACGCAC CGGTTAACCG TGGCCTGAAC
TGCATTAAGG GCTATTTCCT GCCCAAGATC ATGTACGGTA AAGACCGTTT GACGCAGCCG
CTGCTGCGTA TGAAAAACGG TAAATATGAC AAAGAAGGCG AATTTACCCC AATCACCTGG
GATCAGGCCT TCGATGTGAT GGAAGAGAAA TTCAAAACCG CCCTGAAAGA AAAAGGGCCG
GAATCGATCG GTATGTTCGG TTCTGGTCAG TGGACTATCT GGGAAGGTTA TGCCGCGTCC
AAGCTGTTTA AAGCGGGCTT CCGTTCGAAC AACATCGACC CGAATGCGCG TCACTGTATG
GCGTCGGCAG TAGTTGGCTT TATGCGAACC TTTGGTATGG ATGAGCCGAT GGGCTGCTAT
GACGACATCG AGCAGGCTGA CGCGTTTGTG CTGTGGGGCG CAAACATGGC GGAGATGCAC
CCGATCCTCT GGTCACGCAT CACTAACCGT CGTCTCTCTA ACCAGGACGT CACCGTGGCG
GTGCTTTCAA CCTACCAGCA TCGTAGCTTC GAGCTGGCGG ATAACGGCAT CATCTTTACG
CCGCAATCTG ACCTGGTGAT CCTGAACTAC ATCGCCAACT ATATCATTCA AAACAATGCG
ATAAATCAGG ACTTCTTCAG CAAGCACGTT AACCTGCGCA AAGGGGCGAC GGACATCGGC
TACGGTTTAC GTCCGACCCA TCCGCTGGAA AAAGCAGCGA AGAATCCGGG TTCTGACGCC
TCCGAACCGA TGAGCTTTGA AGATTACAAA GCCTTCGTTG CTGAGTATAC GCTGGAAAAA
ACCGCTGAAA TGACTGGCGT ACCAAAAGAT CAGCTGGAAC AACTGGCGCA GCTGTATGCC
GATCCGAACA AGAAAGTCAT CTCCTACTGG ACGATGGGCT TCAACCAGCA TACTCGTGGC
GTGTGGGCCA ACAACCTGGT CTACAACCTG CACCTGCTGA CCGGCAAAAT TTCCCAGCCG
GGTTGCGGTC CGTTCTCCCT GACCGGGCAG CCTTCCGCGT GTGGTACTGC GCGTGAAGTG
GGCACCTTTG CTCACCGTCT GCCTGCGGAC ATGGTGGTAA CTAACGAGAA ACATCGTGAT
ATCTGCGAGA AGAAGTGGAA TATCCCGAGC GGCACCATTC CGGCGAAAAT CGGTCTGCAT
GCGGTAGCAC AAGACCGTGC GCTGAAAGAC GGCAAGCTGA ATGTTTACTG GACCATGTGT
ACCAACAACA TGCAGGCCGG GCCGAACATT AATGAAGAGC GTATGCCGGG CTGGCGCGAT
CCGCGCAACT TCATCATCGT CTCCGATCCG TATCCGACAG TCAGTGCGCT GGCCGCCGAC
TTGATCCTGC CGACCGCAAT GTGGGTAGAA AAAGAGGGGG CTTACGGTAA CGCCGAACGC
CGTACTCAGT TCTGGCGTCA GCAGGTACAG GCGCCAGGCG AAGCGAAATC GGACCTCTGG
CAGTTAGTCC AGTTCTCCCG CCGCTTCAAA ACTGAAGAAG TATGGCCGGA AGAGCTGCTG
GCGAAGAAAC CGGAACTGCG TGGCAAAACG CTGTACGAAG TTCTGTATGC CACCCCCGAA
GTGAGCAAAT TCCCGGTATC CGAACTGGCG GAAGATCAGC TGAACGATGA ATCCCGCGAG
CTGGGCTTCT ATCTGCAAAA AGGGCTGTTC GAAGAGTACG CATGGTTTGG TCGCGGTCAC
GGTCACGATC TCGCGCCGTT CGATGACTAC CACAAAGCGC GCGGTCTGCG CTGGCCGGTG
GTGAACGGTA AAGAAACGCA GTGGCGTTAC AGCGAAGGTA ACGACCCGTA CGTGAAAGCG
GGCGAAGGCT ATAAGTTCTA CGGTAAACCG GATGGTAAAG CGGTGATCTT CGCGCTGCCG
TTCGAACCGG CGGCGGAAGC ACCGGATGAA GAGTACGACC TGTGGCTCTC TACCGGACGC
GTCCTGGAGC ACTGGCACAC CGGCAGTATG ACTCGCCGTG TGCCGGAACT GCACCGTGCC
TTCCCGGAAG CGGTCCTGTT TATTCACCCG CTGGATGCGA AAGCGCGCGA TCTGCGCCGT
GGCGACAAAG TGAAAGTGGT TTCTCGCCGT GGCGAAGTGA TCTCGATTGT CGAAACGCGC
GGTCGTAACC GTCCGCCGCA GGGCCTGGTG TACATGCCGT TCTTCGACGC CGCACAGCTG
GTTAACAAAC TAACGCTGGA CGCGACCGAT CCGCTCTCGA AAGAGACGGA CTTCAAGAAG
TGCGCGGTCA AACTGGAGAA GGTGTAA
 
Protein sequence
MKLSRRSFMK ANAVAAAAAA AGLSVPGVAR AVVGQQEAIK WDKAPCRFCG TGCGVLVGTQ 
QGRVVACQGD PDAPVNRGLN CIKGYFLPKI MYGKDRLTQP LLRMKNGKYD KEGEFTPITW
DQAFDVMEEK FKTALKEKGP ESIGMFGSGQ WTIWEGYAAS KLFKAGFRSN NIDPNARHCM
ASAVVGFMRT FGMDEPMGCY DDIEQADAFV LWGANMAEMH PILWSRITNR RLSNQDVTVA
VLSTYQHRSF ELADNGIIFT PQSDLVILNY IANYIIQNNA INQDFFSKHV NLRKGATDIG
YGLRPTHPLE KAAKNPGSDA SEPMSFEDYK AFVAEYTLEK TAEMTGVPKD QLEQLAQLYA
DPNKKVISYW TMGFNQHTRG VWANNLVYNL HLLTGKISQP GCGPFSLTGQ PSACGTAREV
GTFAHRLPAD MVVTNEKHRD ICEKKWNIPS GTIPAKIGLH AVAQDRALKD GKLNVYWTMC
TNNMQAGPNI NEERMPGWRD PRNFIIVSDP YPTVSALAAD LILPTAMWVE KEGAYGNAER
RTQFWRQQVQ APGEAKSDLW QLVQFSRRFK TEEVWPEELL AKKPELRGKT LYEVLYATPE
VSKFPVSELA EDQLNDESRE LGFYLQKGLF EEYAWFGRGH GHDLAPFDDY HKARGLRWPV
VNGKETQWRY SEGNDPYVKA GEGYKFYGKP DGKAVIFALP FEPAAEAPDE EYDLWLSTGR
VLEHWHTGSM TRRVPELHRA FPEAVLFIHP LDAKARDLRR GDKVKVVSRR GEVISIVETR
GRNRPPQGLV YMPFFDAAQL VNKLTLDATD PLSKETDFKK CAVKLEKV