Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1109 |
Symbol | pyrD |
ID | 6966835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1137595 |
End bp | 1138605 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643385115 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_002269614 |
Protein GI | 209397463 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000226851 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACTACC CCTTCGTTCG TAAAGCCCTT TTCCAGCTCG ATCCAGAGCG CGCTCATGAG TTTACTTTTC AGCAATTACG CCGCATTACC GGAACACCGT TTGAAGCACT GGTGCGGCAG AAAGTGCCTG CAAAACCTGT TAACTGCATG GGCCTGACGT TTAAAAATCC GCTTGGTCTG GCAGCCGGTC TTGACAAAGA CGGGGAGTGC ATTGATGCGT TAGGCGCGAT GGGATTTGGA TCGATAGAGA TCGGTACCGT CACGCCACGT CCACAGCCAG GTAATGACAA GCCGCGTCTC TTTCGTCTGG TAGATGCCGA AGGTTTGATC AACCGTATGG GCTTTAATAA TCTTGGCGTT GATAACCTCG TAGAGAACGT AAAAAAAGCC CATTATGACG GCGTCCTGGG TATTAACATC GGCAAAAATA AAGATACGCC AGTAGAGCAG GGCAAAGATG ACTATCTGAT TTGTATGGAA AAAATCTATG CCTATGCGGG ATATATCGCC ATCAATATTT CATCGCCAAA TACCCCAGGA TTACGCACAC TGCAATATGG TGAAGCGCTG GATGATCTCT TAACTGCGAT TAAAAATAAA CAAAATGATT TGCAAGCGAT GCACCATAAA TATGTGCCGA TCGCAGTGAA GATCGCGCCG GATCTTTCTG AAGAAGAATT GATCCAGGTT GCCGATAGTT TAGTTCGCCA TAATATTGAT GGCGTTATTG CAACCAATAC CACACTCGAT CGTTCTCTGG TTCAGGGAAT GAAAAATTGC GATCAAACCG GTGGCTTAAG TGGTCGTCCG CTTCAGTTAA AAAGCACCGA AATTATTCGC CGCTTGTCAC TGGAATTAAA CGGTCGCTTA CCGATCATCG GTGTTGGCGG CATCGACTCG GTTATCGCTG CGCGTGAAAA GATTGCTGCG GGTGCCTCAC TGGTGCAAAT TTATTCTGGT TTTATTTTTA AAGGTCCGCC GCTGATTAAA GAAATCGTTA CCCATATCTA A
|
Protein sequence | MYYPFVRKAL FQLDPERAHE FTFQQLRRIT GTPFEALVRQ KVPAKPVNCM GLTFKNPLGL AAGLDKDGEC IDALGAMGFG SIEIGTVTPR PQPGNDKPRL FRLVDAEGLI NRMGFNNLGV DNLVENVKKA HYDGVLGINI GKNKDTPVEQ GKDDYLICME KIYAYAGYIA INISSPNTPG LRTLQYGEAL DDLLTAIKNK QNDLQAMHHK YVPIAVKIAP DLSEEELIQV ADSLVRHNID GVIATNTTLD RSLVQGMKNC DQTGGLSGRP LQLKSTEIIR RLSLELNGRL PIIGVGGIDS VIAAREKIAA GASLVQIYSG FIFKGPPLIK EIVTHI
|
| |