Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3291 |
Symbol | prpD |
ID | 6065087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3604468 |
End bp | 3605919 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641602706 |
Product | 2-methylcitrate dehydratase |
Protein accession | YP_001726240 |
Protein GI | 170021286 |
COG category | [R] General function prediction only |
COG ID | [COG2079] Uncharacterized protein involved in propionate catabolism |
TIGRFAM ID | [TIGR02330] 2-methylcitrate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCTC AAATCAACAA CATCCGCCCG GAATTTGATC GTGAAATCGT TGATATCGTC GATTACGTCA TGAACTACGA AATCAGCTCT AAAGTGGCCT ACGACACCGC ACATTACTGC CTGCTCGACA CGCTCGGCTG CGGTCTGGAA GCTCTCGAAT ACCCGGCCTG TAAAAAACTG CTGGGGCCAA TTGTTCCCGG CACCGTCGTA CCCAACGGCG TGCGCGTCCC CGGAACTCAG TTCCAGCTCG ACCCCGTCCA GGCGGCATTT AACATCGGCG CGATGATCCG CTGGCTCGAT TTCAACGATA CCTGGCTGGC GGCGGAGTGG GGCCATCCTT CCGACAACCT CGGCGGCATT CTGGCAACGG CGGACTGGCT TTCGCGCAAC GCGGTCGCCA GCGGCAAAGC GCCGTTGACC ATGAAACAGG TGCTGACCGC AATGATCAAA GCCCATGAAA TTCAGGGCTG CATCGCGCTG GAAAACTCCT TTAACCGCGT CGGCCTCGAC CACGTTCTGT TAGTGAAAGT GGCTTCCACC GCCGTGGTCG CCGAAATGCT CGGCCTGACC CGCGAGGAAA TTCTCAACGC CGTTTCGCTG GCGTGGGTGG ACGGTCAGTC GCTGCGCACC TATCGCCATG CGCCGAACAC CGGCACGCGT AAATCCTGGG CGGCGGGCAA TGCCACTTCC CGCGCGGTAC GTCTGGCACT GATGGCGAAA ACGGGCGAAA TGGGTTACCC GTCAGCCCTG ACTGCGCCGG TGTGGGGCTT CTACGACGTC TCCTTTAAAG GTGAATCGTT CCGCTTCCAG CGCCCGTACG GTTCCTACGT TATGGAAAAT GTGCTGTTCA AAATCTCCTT CCCGGCGGAG TTCCACTCCC AGACGGCAGT TGAAGCAGCG ATGACGCTCT ATGAACAGAT GCAGGCAGCA GGCAAAACGG CGGCGGATAT CGAAAAAGTG ACCATTCGCA CCCACGAAGC CTGTATTCGC ATCATCGACA AAAAAGGGCC GCTCAATAAC CCGGCAGACC GCGATCACTG CATTCAGTAC ATGGTGGCGA TCCCGCTGCT ATTCGGGCGC TTAACGGCGG CAGATTACGA GGACAACGTT GCGCAAGATA AACGCATTGA CGCCCTGCGC GAGAAGATCA ATTGCTTTGA AGATCCGGCA TTTACCGCTG ACTACCACGA CCCGGAAAAA CGCGCCATCG CCAATGCCAT TACCCTTGAG TTCACCGACG GCACACGATT TGAAGAAGTG GTGGTGGAGT ACCCCATTGG TCATGCTCGC CGCCGTCAGG ATGGTATTCC GAAACTGGTC GATAAATTCA AAATCAATCT CGCGCGCCAG TTCCCGACTC GCCAGCAGCA GCGCATTCTG GAGGTTTCTC TCGACAGAGC TCGCCTGGAA CAGATGCCGG TCAATGAGTA TCTCGACCTG TACGTCATTT AA
|
Protein sequence | MSAQINNIRP EFDREIVDIV DYVMNYEISS KVAYDTAHYC LLDTLGCGLE ALEYPACKKL LGPIVPGTVV PNGVRVPGTQ FQLDPVQAAF NIGAMIRWLD FNDTWLAAEW GHPSDNLGGI LATADWLSRN AVASGKAPLT MKQVLTAMIK AHEIQGCIAL ENSFNRVGLD HVLLVKVAST AVVAEMLGLT REEILNAVSL AWVDGQSLRT YRHAPNTGTR KSWAAGNATS RAVRLALMAK TGEMGYPSAL TAPVWGFYDV SFKGESFRFQ RPYGSYVMEN VLFKISFPAE FHSQTAVEAA MTLYEQMQAA GKTAADIEKV TIRTHEACIR IIDKKGPLNN PADRDHCIQY MVAIPLLFGR LTAADYEDNV AQDKRIDALR EKINCFEDPA FTADYHDPEK RAIANAITLE FTDGTRFEEV VVEYPIGHAR RRQDGIPKLV DKFKINLARQ FPTRQQQRIL EVSLDRARLE QMPVNEYLDL YVI
|
| |