Gene EcHS_A2344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2344 
SymbolnapA 
ID5592269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2341246 
End bp2343732 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content56% 
IMG OID640921470 
Productnitrate reductase catalytic subunit 
Protein accessionYP_001459005 
Protein GI157161687 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01706] periplasmic nitrate reductase, large subunit 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCA GTCGTCGTAG CTTTATGAAA GCTAACGCCG TTGCGGCCGC TGCGGCGGCT 
GCCGGTCTCA GCGTGCCGGG CGTTGCCCGC GCCGTTGTTG GTCAGCAGGA AGCCATCAAA
TGGGATAAAG CGCCGTGCCG TTTCTGCGGT ACTGGTTGCG GCGTTCTGGT CGGAACGCAG
CAGGGACGTG TGGTGGCCTG TCAGGGCGAC CCGGACGCAC CGGTTAACCG TGGCCTGAAC
TGCATTAAGG GCTATTTCCT GCCCAAGATC ATGTACGGTA AAGACCGTTT GACGCAGCCG
CTGCTGCGTA TGAAAAACGG TAAATATGAC AAAGAAGGCG AATTTACCCC AATCACCTGG
GATCAGGCCT TCGATGTGAT GGAAGAGAAA TTCAAAACCG CCCTGAAAGA AAAAGGACCG
GAATCGATCG GTATGTTCGG TTCTGGTCAG TGGACTATCT GGGAAGGTTA TGCCGCGTCC
AAGCTGTTCA AAGCGGGCTT CCGTTCGAAC AACATCGACC CGAACGCACG TCACTGTATG
GCGTCGGCAG TAGTTGGCTT TATGCGTACC TTTGGTATGG ATGAGCCGAT GGGCTGCTAT
GACGACATCG AGCAGGCTGA CGCGTTTGTG CTGTGGGGCG CTAACATGGC GGAGATGCAC
CCGATCCTCT GGTCACGCAT CACTAACCGT CGTCTCTCTA ACCAGAACGT CACCGTGGCG
GTGCTTTCTA CCTACCAGCA TCGTAGCTTC GAGCTGGCGG ATAACGGCAT CATCTTTACG
CCGCAATCTG ACCTGGTGAT CCTGAACTAC ATCGCCAACT ATATCATTCA AAACAATGCG
ATAAATCAGG ACTTCTTCAG CAAGCACGTT AACCTGCGCA AAGGGGCGAC GGACATCGGC
TACGGTTTAC GTCCGACCCA TCCGCTGGAA AAAGCAGCGA AGAATCCGGG TTCTGACGCC
TCCGAACCGA TGAGCTTTGA AGATTACAAA GCCTTCGTTG CCGAGTATAC GCTGGAAAAA
ACTGCCGAAA TGACCGGCGT GCCGAAAGAC CAGTTAGAAC AACTGGCGCA GCTGTATGCC
GATCCGAACA AGAAAGTCAT CTCCTACTGG ACGATGGGCT TCAACCAGCA TACTCGTGGC
GTGTGGGCCA ACAACCTGGT CTACAACCTG CACCTGCTGA CCGGCAAAAT TTCCCAGCCG
GGTTGCGGTC CGTTCTCCCT GACCGGGCAG CCTTCCGCGT GTGGTACTGC GCGTGAAGTG
GGCACCTTTG CTCACCGTCT GCCTGCGGAC ATGGTGGTGA CTAACGAGAA ACACCGTGAT
ATTTGCGAGA AGAAGTGGAA TATCCCGAGC GGCACCATTC CGGCGAAAAT CGGTCTGCAC
GCGGTGGCAC AAGACCGTGC ACTGAAAGAC GGCAAGCTGA ATGTTTACTG GACCATGTGT
ACCAACAACA TGCAGGCCGG GCCGAACATT AATGAAGAGC GTATGCCGGG CTGGCGCGAT
CCGCGCAACT TCATCATCGT CTCCGATCCG TATCCGACAG TCAGTGCGCT GGCAGCCGAC
TTGATCCTGC CGACCGCAAT GTGGGTAGAG AAAGAGGGCG CTTACGGTAA CGCCGAACGC
CGTACTCAGT TCTGGCGTCA GCAGGTACAG GCGCCAGGCG AAGCGAAATC GGATCTCTGG
CAGTTAGTCC AGTTCTCCCG CCGCTTCAAA ACTGAAGAAG TATGGCCGGA AGAGCTGCTG
GCGAAGAAAC CGGAACTGCG TGGCAAAACG CTGTACGAAG TTCTGTATGC CACCCCCGAA
GTGAGCAAAT TCCCGGTATC CGAACTGGCG GAAGATCAGC TGAACGATGA ATCCCGCGAG
CTGGGCTTCT ATCTGCAAAA AGGGCTGTTC GAAGAGTACG CATGGTTTGG TCGCGGTCAC
GGTCACGATC TGGCACCGTT CGATGACTAC CACAAAGCGC GCGGTCTGCG CTGGCCGGTG
GTGAACGGTA AAGAAACGCA GTGGCGTTAC AGCGAAGGTA ACGACCCGTA CGTGAAAGCG
GGCGAAGGCT ACAAGTTCTA CGGTAAACCG GATGGCAAAG CGGTGATCTT CGCGCTGCCG
TTCGAACCGG CGGCGGAAGC ACCGGATGAA GAGTACGACC TGTGGCTCTC TACCGGACGC
GTCCTGGAGC ACTGGCACAC CGGCAGTATG ACTCGCCGTG TGCCGGAACT GCACCGCGCC
TTCCCGGAAG CGGTCCTGTT TATTCACCCG CTGGATGCGA AAGCGCGCGA TCTGCGCCGT
GGCGACAAAG TGAAAGTGGT TTCTCGCCGT GGCGAAGTGA TCTCGATTGT TGAAACGCGC
GGTCGTAACC GTCCGCCACA GGGCCTGGTG TACATGCCGT TCTTCGACGC CGCACAGCTG
GTTAACAAAC TGACGCTGGA TGCGACCGAT CCGCTCTCGA AAGAGACGGA CTTCAAGAAG
TGCGCGGTCA AACTGGAGAA GGTGTAA
 
Protein sequence
MKLSRRSFMK ANAVAAAAAA AGLSVPGVAR AVVGQQEAIK WDKAPCRFCG TGCGVLVGTQ 
QGRVVACQGD PDAPVNRGLN CIKGYFLPKI MYGKDRLTQP LLRMKNGKYD KEGEFTPITW
DQAFDVMEEK FKTALKEKGP ESIGMFGSGQ WTIWEGYAAS KLFKAGFRSN NIDPNARHCM
ASAVVGFMRT FGMDEPMGCY DDIEQADAFV LWGANMAEMH PILWSRITNR RLSNQNVTVA
VLSTYQHRSF ELADNGIIFT PQSDLVILNY IANYIIQNNA INQDFFSKHV NLRKGATDIG
YGLRPTHPLE KAAKNPGSDA SEPMSFEDYK AFVAEYTLEK TAEMTGVPKD QLEQLAQLYA
DPNKKVISYW TMGFNQHTRG VWANNLVYNL HLLTGKISQP GCGPFSLTGQ PSACGTAREV
GTFAHRLPAD MVVTNEKHRD ICEKKWNIPS GTIPAKIGLH AVAQDRALKD GKLNVYWTMC
TNNMQAGPNI NEERMPGWRD PRNFIIVSDP YPTVSALAAD LILPTAMWVE KEGAYGNAER
RTQFWRQQVQ APGEAKSDLW QLVQFSRRFK TEEVWPEELL AKKPELRGKT LYEVLYATPE
VSKFPVSELA EDQLNDESRE LGFYLQKGLF EEYAWFGRGH GHDLAPFDDY HKARGLRWPV
VNGKETQWRY SEGNDPYVKA GEGYKFYGKP DGKAVIFALP FEPAAEAPDE EYDLWLSTGR
VLEHWHTGSM TRRVPELHRA FPEAVLFIHP LDAKARDLRR GDKVKVVSRR GEVISIVETR
GRNRPPQGLV YMPFFDAAQL VNKLTLDATD PLSKETDFKK CAVKLEKV