Gene EcSMS35_0365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0365 
SymbolprpD 
ID6145052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp375744 
End bp377195 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content57% 
IMG OID641615261 
Product2-methylcitrate dehydratase 
Protein accessionYP_001742468 
Protein GI170681387 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID[TIGR02330] 2-methylcitrate dehydratase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.756913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCTC AAATCAACAA CATCCGCCCG GAATTTGATC GTGAAATCGT TGATATCGTC 
GATTACGTGA TGAACTACGA AATCAGCTCC AGAGTAGCCT ACGACACTGC ACATTACTGC
CTGCTCGACA CGCTCGGCTG CGGTCTTGAA GCTCTCGAAT ACCCAGCCTG TAAAAAACTG
CTGGGGCCAA TTGTCCCCGG CACCGTCGTA CCTAACGGCG TGCGCGTCCC CGGAACTCAG
TTCCAGCTCG ACCCCGTCCA GGCGGCATTT AACATCGGCG CGATGATCCG TTGGCTCGAT
TTCAACGATA CCTGGCTGGC GGCGGAGTGG GGCCATCCTT CCGACAACCT CGGCGGCATT
CTGGCAACGG CGGACTGGCT TTCGCGCAAC GCGGTCGCCA GCGGCAAAGC GCCGTTGACC
ATGAAACAGG TGCTGACCGC AATGATTAAA GCCCATGAAA TTCAGGGCTG CATCGCGCTG
GAAAACTCCT TTAATCGCGT CGGCCTTGAC CACGTTCTGT TAGTGAAAGT GGCTTCCACC
GCCGTGGTCG CCGAAATGCT TGGCCTGACC CGTGATGAAA TCCTCAACGC TGTTTCGCTG
GCGTGGGTGG ACGGTCAGTC GCTGCGCACC TATCGCCATG CGCCGAACAC CGGCACGCGT
AAATCCTGGG CGGCGGGCGA TGCCACGTCC CGCGCGGTAC GTCTGGCACT GATGGCGAAA
ACGGGCGAAA TGGGCTACCC GTCAGCCCTG ACTGCGCCTG TGTGGGGTTT CTACGACGTC
TCCTTTAAAG GTGAATCGTT CCGCTTCCAG CGCCCGTACG GTTCTTACGT CATGGAGAAT
GTGCTGTTCA AAATCTCCTT CCCGGCGGAG TTCCACTCCC AGACGGCAGT TGAAGCGGCG
ATGACGCTCT ATGAACAGAT GCAGGCAGCA GGCAAAACGG CGGCGGATAT CGAAAAAGTG
ACCATTCGCA CCCACGAAGC CTGTATTCGC ATCATCGACA AAAAAGGGCC GCTCAATAAC
CCGGCTGACC GCGACCACTG CATTCAGTAC ATGGTGGCGA TCCCGCTGCT GTTCGGGCGC
TTAACGGCGG CAGATTACGA GGACAACGTT GCGCAAGATA AACGCATCGA CGCCCTGCGC
GAGAAGATCA ATTGCTTTGA AGATCCGGCG TTTACCGCTG ACTACCACGA CCCGGAAAAA
CGCGCCATCG CCAATGCCAT AACCCTTGAG TTCACCGACG GCACGCGCTT TGAAGAAGAG
GTGGTGGAGT ACCCAATTGG TCATGCTCGC CGCCGTCAGG ATGGAATTCC GAAGCTGGTC
GATAAATTCA AAATCAATCT CGCGCGCCAG TTCCCGACTC GCCAACAGCA GCGCATTCTG
GAGGTTTCTC TCGACAGAGC TCGCCTGGAA CAGATGCCGG TCAATGAGTA TCTCGACCTG
TACGTCATTT AA
 
Protein sequence
MSAQINNIRP EFDREIVDIV DYVMNYEISS RVAYDTAHYC LLDTLGCGLE ALEYPACKKL 
LGPIVPGTVV PNGVRVPGTQ FQLDPVQAAF NIGAMIRWLD FNDTWLAAEW GHPSDNLGGI
LATADWLSRN AVASGKAPLT MKQVLTAMIK AHEIQGCIAL ENSFNRVGLD HVLLVKVAST
AVVAEMLGLT RDEILNAVSL AWVDGQSLRT YRHAPNTGTR KSWAAGDATS RAVRLALMAK
TGEMGYPSAL TAPVWGFYDV SFKGESFRFQ RPYGSYVMEN VLFKISFPAE FHSQTAVEAA
MTLYEQMQAA GKTAADIEKV TIRTHEACIR IIDKKGPLNN PADRDHCIQY MVAIPLLFGR
LTAADYEDNV AQDKRIDALR EKINCFEDPA FTADYHDPEK RAIANAITLE FTDGTRFEEE
VVEYPIGHAR RRQDGIPKLV DKFKINLARQ FPTRQQQRIL EVSLDRARLE QMPVNEYLDL
YVI