Gene SeD_A0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0402 
SymbolprpD 
ID6873119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp423494 
End bp424945 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content57% 
IMG OID642783634 
Product2-methylcitrate dehydratase 
Protein accessionYP_002214321 
Protein GI198241918 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID[TIGR02330] 2-methylcitrate dehydratase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.00000000295232 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGCAC ATATTTCGAA CATCCGCCCA GACTTTGACC GTGAAATCGT TGATATCGTT 
GATTACGTTA TGAATTACGA GATCACCTCA AAGGTGGCGT ACGACACCGC GCATTATTGC
CTGCTCGACA CGCTTGGCTG TGGTCTGGAA GCGCTGGAAT ACCCGGCCTG TAAAAAATTG
CTTGGGCCGA TCGTGCCAGG CACGGTGGTG CCCAACGGCG CACGCGTGCC GGGCACCCAG
TTCCAGCTCG ATCCGGTACA GGCAGCCTTT AACATTGGCG CGATGATCCG CTGGCTCGAT
TTTAATGATA CCTGGCTTGC CGCCGAGTGG GGCCATCCTT CTGATAACCT CGGCGGTATT
CTGGCGATTG CGGACTGGCT GTCACGCAAC GCCGTCGCCG CCGGCAAAGC GCCGCTGACC
ATGAAACAGG TATTGAGCGG GATGATCAAA GCCCATGAAA TTCAGGGTTG CATCGCGCTG
GAAAACGCCT TCAACCGTGT CGGGCTTGAC CATGTGCTGC TGGTGAAAGT GGCCTCGACT
GCGGTGGTCG CTGAAATGCT GGGGCTGACG CGCGATGAGA TCCTCAACGC GGTATCGCTG
GCGTGGGTGG ATGGGCAGTC GTTGCGCACT TATCGTCATG CGCCGAATAC CGGTACGCGC
AAATCCTGGG CGGCGGGCGA TGCGACTTCG CGCGCGGTAC GTCTGGCGCT GATGGCGAAA
ACCGGCGAGA TGGGTTATCC CTCGGCGCTC ACCGCCAAAA CCTGGGGCTT CTACGACGTT
TCATTCAAAG GTGAAACGTT CCGTTTCCAG CGTCCTTACG GCTCCTACGT GATGGAAAAC
GTGCTATTCA AAATTTCTTT CCCGGCAGAA TTCCACTCGC AAACCGCCGT CGAAGCGGCG
ATGACGCTGT ATGAGCAGAT GCAGGCCGCG GGTAAAACGG CAGCGGATAT CGAGAAAGTG
ACCATCCGCA CCCACGAAGC CTGTCTCCGC ATTATCGATA AAAAAGGCCC GCTCAATAAC
CCGGCGGACC GCGATCACTG TATCCAGTAT ATGGTCGCCG TGCCGTTGCT GTTCGGACGG
TTAACCGCGG CGGATTATGA AGACGAGGTG GCGCAGGACA AGCGTATTGA CGCCCTGCGC
GAGAAAATCG TGTGTTATGA GGACCCGGCT TTTACCGCCG ACTATCACGA CCCGGAAAAA
CGTGCTATCG GCAATGCGAT CACCGTGGAG TTTACTGATG GCTCACGCTT TGGCGAGGTT
GTCGTGGAGT ATCCGATTGG TCATGCGCGT CGCCGCGCCG GCGGTATTCC GAAGCTTATC
GAAAAATTTA AAATTAACCT GGCGCGTCAG TTCCCGACTC GCCAGCAGCA ACGCATTCTG
GATGTCTCCC TGGACAGATC CCGCCTGGAG CAGATGCCGG TTAACGAATA CCTCGATTTA
TATGTCATCT GA
 
Protein sequence
MSAHISNIRP DFDREIVDIV DYVMNYEITS KVAYDTAHYC LLDTLGCGLE ALEYPACKKL 
LGPIVPGTVV PNGARVPGTQ FQLDPVQAAF NIGAMIRWLD FNDTWLAAEW GHPSDNLGGI
LAIADWLSRN AVAAGKAPLT MKQVLSGMIK AHEIQGCIAL ENAFNRVGLD HVLLVKVAST
AVVAEMLGLT RDEILNAVSL AWVDGQSLRT YRHAPNTGTR KSWAAGDATS RAVRLALMAK
TGEMGYPSAL TAKTWGFYDV SFKGETFRFQ RPYGSYVMEN VLFKISFPAE FHSQTAVEAA
MTLYEQMQAA GKTAADIEKV TIRTHEACLR IIDKKGPLNN PADRDHCIQY MVAVPLLFGR
LTAADYEDEV AQDKRIDALR EKIVCYEDPA FTADYHDPEK RAIGNAITVE FTDGSRFGEV
VVEYPIGHAR RRAGGIPKLI EKFKINLARQ FPTRQQQRIL DVSLDRSRLE QMPVNEYLDL
YVI