Gene EcHS_A1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1551 
SymbolnarY 
ID5591495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1554121 
End bp1555665 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content54% 
IMG OID640920705 
Productnitrate reductase, beta subunit 
Protein accessionYP_001458261 
Protein GI157160943 
COG category[C] Energy production and conversion 
COG ID[COG1140] Nitrate reductase beta subunit 
TIGRFAM ID[TIGR01660] nitrate reductase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAC GTTCGCAAGT CGGCATGGTG CTTAACCTCG ACAAATGTAT CGGCTGCCAT 
ACCTGTTCGG TGACCTGTAA AAACGTCTGG ACCGGACGCG AAGGCATGGA GTACGCATGG
TTTAACAACG TCGAAACCAA ACCGGGCATT GGTTATCCGA AAAACTGGGA AGATCAGGAA
GAGTGGCAAG GCGGCTGGGT CCGCGATGTG AATGGCAAGA TACGCCCGCG TCTGGGTAGC
AAGATGGGCG TAATAACCAA AATCTTCGCC AACCCGGTGG TGCCGCAGAT TGATGATTAC
TACGAACCTT TCACCTTCGA CTACGAACAT TTGCATAGCG CACCGGAAGG CAAACATATC
CCTACTGCTC GCCCGCGTTC GCTGATTGAC GGTAAGCGGA TGGACAAAGT GATCTGGGGG
CCAAACTGGG AAGAACTGCT GGGCGGCGAG TTTGAAAAAC GTGCCCGCGA CCGCAACTTC
GAGGCCATGC AAAAGGAGAT GTACGGGCAG TTTGAAAACA CCTTCATGAT GTACCTGCCG
CGCCTGTGCG AACACTGCCT CAATCCCAGT TGCGTGGCGA CCTGCCCAAG CGGCGCTATC
TACAAACGCG AAGAAGACGG CATTGTGCTG ATTGATCAGG ATAAATGCCG TGGCTGGCGT
TTGTGCATTA GCGGTTGTCC GTACAAAAAA ATCTACTTCA ACTGGAAAAG CGGCAAGTCA
GAAAAATGTA TCTTCTGTTA TCCGCGAATT GAGTCAGGCC AACCCACCGT GTGCTCAGAA
ACCTGCGTGG GTCGCATCCG GTATCTGGGC GTGCTGCTTT ACGACGCCGA CCGCATTGAG
GAAGCGGCGA GCACCGAGCG CGAAGTTGAC CTCTATGAAC GTCAGTGCGA AGTGTTCCTC
GATCCACACG ATCCCTCAGT GATCGAGGAA GCCCTGAAAC AAGGTATTCC ACAAAACGTG
ATTGAAGCTG CCCAGCGTTC GCCTGTCTAC AAAATGGCGA TGGACTGGAA ACTGGCGCTA
CCGCTGCACC CTGAATATCG CACCCTGCCA ATGGTCTGGT ACGTTCCTCC GCTGTCGCCG
ATTCAGTCCT ACGCAGATGC GGGCGGTTTG CCGAAAAGCG AAGGCGTGCT GCCCGCCATC
GAAAGCCTGC GTATTCCGGT GCAATATCTC GCCAATATGT TGAGTGCCGG CGATACCGGT
CCGGTACTGC GGGCGCTGAA ACGGATAATG GCGATGCGCC ACTATATGCG TTCACAAACC
GTGGAAGGCG TTACTGATAC TCGTGCCATC GACGAAGTAG GCCTGAGCGT CGCCCAGGTC
GAAGAGATGT ATCGTTACCT CGCCATTGCC AACTATGAAG ATCGTTTTGT CATCCCGACG
AGCCATCGGG AAATGGCGGG CGATGCCTTC GCAGAACGCA ACGGCTGCGG TTTTACCTTT
GGCGACGGTT GTCACGGCTC GGACAGCAAA TTCAACCTGT TCAACAGTAG CCGTATCGAT
GCCATCAACA TCACCGAAGT GCGCGACAAA GCGGAGGGCG AATAA
 
Protein sequence
MKIRSQVGMV LNLDKCIGCH TCSVTCKNVW TGREGMEYAW FNNVETKPGI GYPKNWEDQE 
EWQGGWVRDV NGKIRPRLGS KMGVITKIFA NPVVPQIDDY YEPFTFDYEH LHSAPEGKHI
PTARPRSLID GKRMDKVIWG PNWEELLGGE FEKRARDRNF EAMQKEMYGQ FENTFMMYLP
RLCEHCLNPS CVATCPSGAI YKREEDGIVL IDQDKCRGWR LCISGCPYKK IYFNWKSGKS
EKCIFCYPRI ESGQPTVCSE TCVGRIRYLG VLLYDADRIE EAASTEREVD LYERQCEVFL
DPHDPSVIEE ALKQGIPQNV IEAAQRSPVY KMAMDWKLAL PLHPEYRTLP MVWYVPPLSP
IQSYADAGGL PKSEGVLPAI ESLRIPVQYL ANMLSAGDTG PVLRALKRIM AMRHYMRSQT
VEGVTDTRAI DEVGLSVAQV EEMYRYLAIA NYEDRFVIPT SHREMAGDAF AERNGCGFTF
GDGCHGSDSK FNLFNSSRID AINITEVRDK AEGE