Gene ECH74115_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1109 
SymbolpyrD 
ID6966835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1137595 
End bp1138605 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content46% 
IMG OID643385115 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_002269614 
Protein GI209397463 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000226851 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTACC CCTTCGTTCG TAAAGCCCTT TTCCAGCTCG ATCCAGAGCG CGCTCATGAG 
TTTACTTTTC AGCAATTACG CCGCATTACC GGAACACCGT TTGAAGCACT GGTGCGGCAG
AAAGTGCCTG CAAAACCTGT TAACTGCATG GGCCTGACGT TTAAAAATCC GCTTGGTCTG
GCAGCCGGTC TTGACAAAGA CGGGGAGTGC ATTGATGCGT TAGGCGCGAT GGGATTTGGA
TCGATAGAGA TCGGTACCGT CACGCCACGT CCACAGCCAG GTAATGACAA GCCGCGTCTC
TTTCGTCTGG TAGATGCCGA AGGTTTGATC AACCGTATGG GCTTTAATAA TCTTGGCGTT
GATAACCTCG TAGAGAACGT AAAAAAAGCC CATTATGACG GCGTCCTGGG TATTAACATC
GGCAAAAATA AAGATACGCC AGTAGAGCAG GGCAAAGATG ACTATCTGAT TTGTATGGAA
AAAATCTATG CCTATGCGGG ATATATCGCC ATCAATATTT CATCGCCAAA TACCCCAGGA
TTACGCACAC TGCAATATGG TGAAGCGCTG GATGATCTCT TAACTGCGAT TAAAAATAAA
CAAAATGATT TGCAAGCGAT GCACCATAAA TATGTGCCGA TCGCAGTGAA GATCGCGCCG
GATCTTTCTG AAGAAGAATT GATCCAGGTT GCCGATAGTT TAGTTCGCCA TAATATTGAT
GGCGTTATTG CAACCAATAC CACACTCGAT CGTTCTCTGG TTCAGGGAAT GAAAAATTGC
GATCAAACCG GTGGCTTAAG TGGTCGTCCG CTTCAGTTAA AAAGCACCGA AATTATTCGC
CGCTTGTCAC TGGAATTAAA CGGTCGCTTA CCGATCATCG GTGTTGGCGG CATCGACTCG
GTTATCGCTG CGCGTGAAAA GATTGCTGCG GGTGCCTCAC TGGTGCAAAT TTATTCTGGT
TTTATTTTTA AAGGTCCGCC GCTGATTAAA GAAATCGTTA CCCATATCTA A
 
Protein sequence
MYYPFVRKAL FQLDPERAHE FTFQQLRRIT GTPFEALVRQ KVPAKPVNCM GLTFKNPLGL 
AAGLDKDGEC IDALGAMGFG SIEIGTVTPR PQPGNDKPRL FRLVDAEGLI NRMGFNNLGV
DNLVENVKKA HYDGVLGINI GKNKDTPVEQ GKDDYLICME KIYAYAGYIA INISSPNTPG
LRTLQYGEAL DDLLTAIKNK QNDLQAMHHK YVPIAVKIAP DLSEEELIQV ADSLVRHNID
GVIATNTTLD RSLVQGMKNC DQTGGLSGRP LQLKSTEIIR RLSLELNGRL PIIGVGGIDS
VIAAREKIAA GASLVQIYSG FIFKGPPLIK EIVTHI