Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0405 |
Symbol | prpD |
ID | 6966826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 411510 |
End bp | 412961 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384457 |
Product | 2-methylcitrate dehydratase |
Protein accession | YP_002268971 |
Protein GI | 209398521 |
COG category | [R] General function prediction only |
COG ID | [COG2079] Uncharacterized protein involved in propionate catabolism |
TIGRFAM ID | [TIGR02330] 2-methylcitrate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCTC AAATCAACAA CATCCGCCCG GAATTTGATC GTGAAATCGT TGATATCGTC GATTACGTCA TGAACTACGA AATCAGCTCC AAAGTAGCCT ACGACACCGC ACATTACTGT CTGCTCGACA CGCTCGGCTG CGGTCTTGAA GCTCTCGAAT ACCCAGCCTG TAAAAAACTG CTGGGGCCAA TTGTCCCCGG CACCGTCGTA CCTAACGGCG TGCGCGTCCC CGGAACTCAG TTCCAGCTCG ACCCCGTCCA GGCGGCATTT AACATCGGCG CGATGATCCG CTGGCTCGAT TTCAACGATA CCTGGCTTGC GGCAGAGTGG GGCCATCCTT CCGACAATCT CGGCGGCATT CTGGCAACGG CGGACTGGCT TTCACGCAAC GCGGTCGCCA GCGGCAAAGC GCCGTTGACC ATGAAACAGG TGCTGACCGG AATGATTAAA GCCCATGAAA TTCAGGGCTG CATCGCGCTG GAAAACTCCT TTAACCGCGT CGGCCTCGAC CACGTTCTGT TAGTGAAAGT GGCTTCCACC GCCGTGGTCG CCGAAATGCT TGGCCTGACC CGCGAGGAAA TTCTCAACGC TGTTTCGCTG GCGTGGGTGG ACGGTCAGTC GCTGCGCACC TATCGCCATG CGCCGAACAC CGGCACGCGT AAATCCTGGG CGGCGGGCGA TGCCACTTCC CGCGCGGTAC GTCTGGCACT GATGGCGAAA ACGGGCGAAA TGGGTTACCC GTCAGCCCTG ACTGCGCCTG TGTGGGGCTT CTACGACGTC TCCTTTAAAG GTGAATCGTT CCGCTTCCAG CGTCCGTACG GTTCTTACGT GATGGAAAAT GTGCTGTTCA AAATCTCCTT CCCGGCGGAG TTCCACTCCC AGACGGCAGT TGAAGCAGCG ATGACGCTCT ATGAACAGAT GCAGGCAGCA GGCAAAACGG CGGCAGATAT CGAAAAAGTG TCCATCCGCA CCCACGAAGC CTGTATTCGC ATCATCGACA AAAAGGGGCC GCTCAATAAC CCGGCAGACC GCGACCACTG CATTCAGTAC ATGGTGGCGA TCCCACTGCT ATTCGGGCGC TTAACGGCGG CAGATTACGA GGACAACGTT GCGCAAGATA AACGCATTGA CGCCCTGCGC GAGAAGATCA ATTGCTTTGA AGATCCGGTA TTTACCGCTG ACTACCACGA CCCGGAAAAA CGCGCCATCG CCAATGCCAT TACCCTTGAG TTCACCGACG GCACACGATT TGAAGAAGTG GTGGTGGAGT ACCCCATTGG TCATGCTCGC CGCCGTCAGG ATGGTATTCC GAAACTGGTC GATAAATTCA AAATCAATCT CGCGCGCCAG TTCCCGACTC GCCAACAGCA GCGCATTCTG GAGGTTTCTC TCGACAGAGC TCGCCTGGAA CAGATGCCGG TCAATGAGTA TCTCGACCTG TACGTCATTT AA
|
Protein sequence | MSAQINNIRP EFDREIVDIV DYVMNYEISS KVAYDTAHYC LLDTLGCGLE ALEYPACKKL LGPIVPGTVV PNGVRVPGTQ FQLDPVQAAF NIGAMIRWLD FNDTWLAAEW GHPSDNLGGI LATADWLSRN AVASGKAPLT MKQVLTGMIK AHEIQGCIAL ENSFNRVGLD HVLLVKVAST AVVAEMLGLT REEILNAVSL AWVDGQSLRT YRHAPNTGTR KSWAAGDATS RAVRLALMAK TGEMGYPSAL TAPVWGFYDV SFKGESFRFQ RPYGSYVMEN VLFKISFPAE FHSQTAVEAA MTLYEQMQAA GKTAADIEKV SIRTHEACIR IIDKKGPLNN PADRDHCIQY MVAIPLLFGR LTAADYEDNV AQDKRIDALR EKINCFEDPV FTADYHDPEK RAIANAITLE FTDGTRFEEV VVEYPIGHAR RRQDGIPKLV DKFKINLARQ FPTRQQQRIL EVSLDRARLE QMPVNEYLDL YVI
|
| |