Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0365 |
Symbol | prpD |
ID | 6145052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 375744 |
End bp | 377195 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641615261 |
Product | 2-methylcitrate dehydratase |
Protein accession | YP_001742468 |
Protein GI | 170681387 |
COG category | [R] General function prediction only |
COG ID | [COG2079] Uncharacterized protein involved in propionate catabolism |
TIGRFAM ID | [TIGR02330] 2-methylcitrate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.756913 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCTC AAATCAACAA CATCCGCCCG GAATTTGATC GTGAAATCGT TGATATCGTC GATTACGTGA TGAACTACGA AATCAGCTCC AGAGTAGCCT ACGACACTGC ACATTACTGC CTGCTCGACA CGCTCGGCTG CGGTCTTGAA GCTCTCGAAT ACCCAGCCTG TAAAAAACTG CTGGGGCCAA TTGTCCCCGG CACCGTCGTA CCTAACGGCG TGCGCGTCCC CGGAACTCAG TTCCAGCTCG ACCCCGTCCA GGCGGCATTT AACATCGGCG CGATGATCCG TTGGCTCGAT TTCAACGATA CCTGGCTGGC GGCGGAGTGG GGCCATCCTT CCGACAACCT CGGCGGCATT CTGGCAACGG CGGACTGGCT TTCGCGCAAC GCGGTCGCCA GCGGCAAAGC GCCGTTGACC ATGAAACAGG TGCTGACCGC AATGATTAAA GCCCATGAAA TTCAGGGCTG CATCGCGCTG GAAAACTCCT TTAATCGCGT CGGCCTTGAC CACGTTCTGT TAGTGAAAGT GGCTTCCACC GCCGTGGTCG CCGAAATGCT TGGCCTGACC CGTGATGAAA TCCTCAACGC TGTTTCGCTG GCGTGGGTGG ACGGTCAGTC GCTGCGCACC TATCGCCATG CGCCGAACAC CGGCACGCGT AAATCCTGGG CGGCGGGCGA TGCCACGTCC CGCGCGGTAC GTCTGGCACT GATGGCGAAA ACGGGCGAAA TGGGCTACCC GTCAGCCCTG ACTGCGCCTG TGTGGGGTTT CTACGACGTC TCCTTTAAAG GTGAATCGTT CCGCTTCCAG CGCCCGTACG GTTCTTACGT CATGGAGAAT GTGCTGTTCA AAATCTCCTT CCCGGCGGAG TTCCACTCCC AGACGGCAGT TGAAGCGGCG ATGACGCTCT ATGAACAGAT GCAGGCAGCA GGCAAAACGG CGGCGGATAT CGAAAAAGTG ACCATTCGCA CCCACGAAGC CTGTATTCGC ATCATCGACA AAAAAGGGCC GCTCAATAAC CCGGCTGACC GCGACCACTG CATTCAGTAC ATGGTGGCGA TCCCGCTGCT GTTCGGGCGC TTAACGGCGG CAGATTACGA GGACAACGTT GCGCAAGATA AACGCATCGA CGCCCTGCGC GAGAAGATCA ATTGCTTTGA AGATCCGGCG TTTACCGCTG ACTACCACGA CCCGGAAAAA CGCGCCATCG CCAATGCCAT AACCCTTGAG TTCACCGACG GCACGCGCTT TGAAGAAGAG GTGGTGGAGT ACCCAATTGG TCATGCTCGC CGCCGTCAGG ATGGAATTCC GAAGCTGGTC GATAAATTCA AAATCAATCT CGCGCGCCAG TTCCCGACTC GCCAACAGCA GCGCATTCTG GAGGTTTCTC TCGACAGAGC TCGCCTGGAA CAGATGCCGG TCAATGAGTA TCTCGACCTG TACGTCATTT AA
|
Protein sequence | MSAQINNIRP EFDREIVDIV DYVMNYEISS RVAYDTAHYC LLDTLGCGLE ALEYPACKKL LGPIVPGTVV PNGVRVPGTQ FQLDPVQAAF NIGAMIRWLD FNDTWLAAEW GHPSDNLGGI LATADWLSRN AVASGKAPLT MKQVLTAMIK AHEIQGCIAL ENSFNRVGLD HVLLVKVAST AVVAEMLGLT RDEILNAVSL AWVDGQSLRT YRHAPNTGTR KSWAAGDATS RAVRLALMAK TGEMGYPSAL TAPVWGFYDV SFKGESFRFQ RPYGSYVMEN VLFKISFPAE FHSQTAVEAA MTLYEQMQAA GKTAADIEKV TIRTHEACIR IIDKKGPLNN PADRDHCIQY MVAIPLLFGR LTAADYEDNV AQDKRIDALR EKINCFEDPA FTADYHDPEK RAIANAITLE FTDGTRFEEE VVEYPIGHAR RRQDGIPKLV DKFKINLARQ FPTRQQQRIL EVSLDRARLE QMPVNEYLDL YVI
|
| |