Gene ECH74115_0405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0405 
SymbolprpD 
ID6966826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp411510 
End bp412961 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content56% 
IMG OID643384457 
Product2-methylcitrate dehydratase 
Protein accessionYP_002268971 
Protein GI209398521 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID[TIGR02330] 2-methylcitrate dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCTC AAATCAACAA CATCCGCCCG GAATTTGATC GTGAAATCGT TGATATCGTC 
GATTACGTCA TGAACTACGA AATCAGCTCC AAAGTAGCCT ACGACACCGC ACATTACTGT
CTGCTCGACA CGCTCGGCTG CGGTCTTGAA GCTCTCGAAT ACCCAGCCTG TAAAAAACTG
CTGGGGCCAA TTGTCCCCGG CACCGTCGTA CCTAACGGCG TGCGCGTCCC CGGAACTCAG
TTCCAGCTCG ACCCCGTCCA GGCGGCATTT AACATCGGCG CGATGATCCG CTGGCTCGAT
TTCAACGATA CCTGGCTTGC GGCAGAGTGG GGCCATCCTT CCGACAATCT CGGCGGCATT
CTGGCAACGG CGGACTGGCT TTCACGCAAC GCGGTCGCCA GCGGCAAAGC GCCGTTGACC
ATGAAACAGG TGCTGACCGG AATGATTAAA GCCCATGAAA TTCAGGGCTG CATCGCGCTG
GAAAACTCCT TTAACCGCGT CGGCCTCGAC CACGTTCTGT TAGTGAAAGT GGCTTCCACC
GCCGTGGTCG CCGAAATGCT TGGCCTGACC CGCGAGGAAA TTCTCAACGC TGTTTCGCTG
GCGTGGGTGG ACGGTCAGTC GCTGCGCACC TATCGCCATG CGCCGAACAC CGGCACGCGT
AAATCCTGGG CGGCGGGCGA TGCCACTTCC CGCGCGGTAC GTCTGGCACT GATGGCGAAA
ACGGGCGAAA TGGGTTACCC GTCAGCCCTG ACTGCGCCTG TGTGGGGCTT CTACGACGTC
TCCTTTAAAG GTGAATCGTT CCGCTTCCAG CGTCCGTACG GTTCTTACGT GATGGAAAAT
GTGCTGTTCA AAATCTCCTT CCCGGCGGAG TTCCACTCCC AGACGGCAGT TGAAGCAGCG
ATGACGCTCT ATGAACAGAT GCAGGCAGCA GGCAAAACGG CGGCAGATAT CGAAAAAGTG
TCCATCCGCA CCCACGAAGC CTGTATTCGC ATCATCGACA AAAAGGGGCC GCTCAATAAC
CCGGCAGACC GCGACCACTG CATTCAGTAC ATGGTGGCGA TCCCACTGCT ATTCGGGCGC
TTAACGGCGG CAGATTACGA GGACAACGTT GCGCAAGATA AACGCATTGA CGCCCTGCGC
GAGAAGATCA ATTGCTTTGA AGATCCGGTA TTTACCGCTG ACTACCACGA CCCGGAAAAA
CGCGCCATCG CCAATGCCAT TACCCTTGAG TTCACCGACG GCACACGATT TGAAGAAGTG
GTGGTGGAGT ACCCCATTGG TCATGCTCGC CGCCGTCAGG ATGGTATTCC GAAACTGGTC
GATAAATTCA AAATCAATCT CGCGCGCCAG TTCCCGACTC GCCAACAGCA GCGCATTCTG
GAGGTTTCTC TCGACAGAGC TCGCCTGGAA CAGATGCCGG TCAATGAGTA TCTCGACCTG
TACGTCATTT AA
 
Protein sequence
MSAQINNIRP EFDREIVDIV DYVMNYEISS KVAYDTAHYC LLDTLGCGLE ALEYPACKKL 
LGPIVPGTVV PNGVRVPGTQ FQLDPVQAAF NIGAMIRWLD FNDTWLAAEW GHPSDNLGGI
LATADWLSRN AVASGKAPLT MKQVLTGMIK AHEIQGCIAL ENSFNRVGLD HVLLVKVAST
AVVAEMLGLT REEILNAVSL AWVDGQSLRT YRHAPNTGTR KSWAAGDATS RAVRLALMAK
TGEMGYPSAL TAPVWGFYDV SFKGESFRFQ RPYGSYVMEN VLFKISFPAE FHSQTAVEAA
MTLYEQMQAA GKTAADIEKV SIRTHEACIR IIDKKGPLNN PADRDHCIQY MVAIPLLFGR
LTAADYEDNV AQDKRIDALR EKINCFEDPV FTADYHDPEK RAIANAITLE FTDGTRFEEV
VVEYPIGHAR RRQDGIPKLV DKFKINLARQ FPTRQQQRIL EVSLDRARLE QMPVNEYLDL
YVI