Gene EcSMS35_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1707 
SymbolnarY 
ID6146791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1715140 
End bp1716684 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content54% 
IMG OID641616583 
Productnitrate reductase 2, beta subunit 
Protein accessionYP_001743761 
Protein GI170683492 
COG category[C] Energy production and conversion 
COG ID[COG1140] Nitrate reductase beta subunit 
TIGRFAM ID[TIGR01660] nitrate reductase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.764488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC GTTCGCAAGT CGGCATGGTG CTTAACCTCG ATAAATGTAT CGGCTGCCAT 
ACCTGTTCGG TGACCTGTAA AAACGTCTGG ACCGGACGCG AAGGCATGGA GTACGCATGG
TTTAACAACG TCGAAACCAA ACCGGGCATT GGTTATCCGA AAAACTGGGA AGATCAGGAA
GAGTGGCAAG GCGGCTGGGT GCGTGATGTG AATGGCAAGA TCCGCCCGCG TCTGGGCAGC
AAGATGGGCG TGATTACCAA AATCTTCGCC AACCCGGTGG TGCCGCAGAT TGATGATTAC
TACGAACCTT TCACCTTCGA CTACGAACAT TTGCATAGCG CGCCAGAAGG TAAACATATT
CCTACTGCGC GCCCGCGTTC ACTGATTGAC GGCAAACGGA TGGACAAAGT GATCTGGGGG
CCAAACTGGG AAGAATTGTT GGGCGGCGAA TTTGAAAAAC GCGCCCGCGA CCGCAACTTC
GAGGCCATGC AAAAGGAGAT GTACGGGCAG TTTGAAAACA CGTTCATGAT GTACCTGCCG
CGCCTGTGCG AACACTGCCT CAATCCCAGC TGCGTGGCGA CCTGCCCAAG CGGTGCTATC
TACAAACGTG AAGAAGACGG CATTGTGCTG ATTGATCAGG ATAAATGCCG TGGCTGGCGT
TTGTGCATTA GCGGTTGTCC GTACAAAAAA ATCTACTTCA ACTGGAAAAG CGGCAAGTCA
GAAAAATGCA TCTTCTGTTA CCCACGAATT GAGTCAGGCC AACCCACCGT GTGCTCAGAA
ACCTGCGTGG GTCGCATCCG ATATCTGGGC GTGCTGCTTT ACGACGCCGA CCGCATCGAG
GAAGCGGCGA GCACCGAGCG CGAAGTTGAC CTCTATGAAC GCCAGTGCGA AGTGTTCCTC
GATCCGCACG ATCCCTCGGT GATCGAGGAA GCCCTGAAAC AAGGTATTCC ACAAAACGTG
ATTGACGCTG CCCAGCGTTC GCCAGTCTAC AAAATGGCAA TGGACTGGAA ACTGGCGCTA
CCGCTGCACC CTGAATACCG CACCCTGCCG ATGGTCTGGT ACGTTCCGCC GCTGTCACCG
ATTCAGTCCT ACGCTGATGC GGGAGGTTTG CCGAAAAGCG AAGGCGTGCT GCCCGCCATC
GAAAGCCTGC GTATTCCGGT GCAATATCTC GCCAATATGT TGAGTGCCGG TGATACCGGT
CCGGTACTGC GGGCGCTGAA ACGGATGATG GCGATGCGCC ACTATATGCG TTCACAAACC
GTGGAAGGCG TTACTGATAC TCGTGCCATC GACGAAGTAG GTCTGAGCGT CGCCCAGGTC
GAAGAGATGT ATCGTTACCT CGCCATTGCC AACTATGAAG ACCGTTTTGT CATCCCGACA
AGCCATCGGG AAATGGCGGG CGATGCCTTC GCAGAACGCA ACGGCTGCGG TTTTACCTTT
GGCGACGGTT GCCACGGCTC GGACAGCAAA TTCAACCTGT TCAACAGTAG CCGTATCGAT
GCCATCAACA TCACCGAAGT GCGCGACAAA GCGGAGGGCG AATAA
 
Protein sequence
MKIRSQVGMV LNLDKCIGCH TCSVTCKNVW TGREGMEYAW FNNVETKPGI GYPKNWEDQE 
EWQGGWVRDV NGKIRPRLGS KMGVITKIFA NPVVPQIDDY YEPFTFDYEH LHSAPEGKHI
PTARPRSLID GKRMDKVIWG PNWEELLGGE FEKRARDRNF EAMQKEMYGQ FENTFMMYLP
RLCEHCLNPS CVATCPSGAI YKREEDGIVL IDQDKCRGWR LCISGCPYKK IYFNWKSGKS
EKCIFCYPRI ESGQPTVCSE TCVGRIRYLG VLLYDADRIE EAASTEREVD LYERQCEVFL
DPHDPSVIEE ALKQGIPQNV IDAAQRSPVY KMAMDWKLAL PLHPEYRTLP MVWYVPPLSP
IQSYADAGGL PKSEGVLPAI ESLRIPVQYL ANMLSAGDTG PVLRALKRMM AMRHYMRSQT
VEGVTDTRAI DEVGLSVAQV EEMYRYLAIA NYEDRFVIPT SHREMAGDAF AERNGCGFTF
GDGCHGSDSK FNLFNSSRID AINITEVRDK AEGE