Gene SNSL254_A0410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0410 
SymbolprpD 
ID6485768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp424654 
End bp426105 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content57% 
IMG OID642735834 
Product2-methylcitrate dehydratase 
Protein accessionYP_002039608 
Protein GI194445361 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID[TIGR02330] 2-methylcitrate dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0000282165 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTACCC AAGAACTGAA CATCCGCCCG GAATTTGACC GCGAAATCGT CGATATCGTG 
GATTACGTGA TGAACTACGA CATCACCTCA AAGGTGGCGT ACGACACCGC GCATTATTGC
CTGCTCGACA CGCTTGGCTG TGGTCTGGAA GCGCTGGAAT ACCCGGCCTG TAAAAAATTG
CTTGGGCCGA TCGTGCCAGG CACGGTGGTG CCCAACGGCG CACGCGTGCC GGGCACCCAG
TTTCAGCTCG ATCCGGTACA GGCAGCCTTT AACATTGGCG CGATGATCCG CTGGCTCGAT
TTTAACGATA CCTGGCTTGC CGCCGAGTGG GGCCATCCTT CTGATAACCT CGGCGGTATT
CTGGCGATTG CGGACTGGCT GTCACGCAAC GCCGTCGCCG CCGGCAAAGC GCCGCTGACC
ATGAAACAGG TATTGAGCGG GATGATCAAA GCCCATGAAA TTCAGGGTTG CATCGCGCTG
GAAAACGCCT TCAACCGTGT CGGGCTTGAC CATGTGCTGC TGGTGAAAGT GGCCTCGACT
GCGGTGGTCG CTGAAATGCT GGGGCTGACG CGCGATGAGA TCCTTAACGC GGTATCGTTG
GCGTGGGTGG ATGGGCAGTC GTTGCGCACT TATCGTCATG CGCCGAATAC CGGTACGCGC
AAATCCTGGG CGGCGGGCGA TGCGACTTCG CGCGCGGTAC GTCTGGCGCT GATGGCGAAA
ACCGGCGAGA TGGGTTATCC CTCGGCGCTC ACCGCCAAAA CCTGGGGCTT CTACGACGTT
TCATTCAAAG GTGAAACGTT CCGTTTCCAG CGTCCTTACG GCTCCTACGT GATGGAAAAC
GTGCTATTCA AAATTTCTTT CCCGGCAGAA TTCCACTCGC AAACCGCCGT CGAAGCGGCG
ATGACGCTGT ATGAGCAGAT GCAGGCCGCG GGTAAAACGG CAGCGGATAT CGAGAAAGTG
ACCATCCGCA CCCACGAAGC CTGTCTCCGC ATTATCGATA AAAAAGGCCC GCTCAATAAC
CCGGCGGACC GCGATCACTG TATCCAGTAT ATGGTCGCCG TGCCGCTGCT GTTCGGGCGG
TTAACCGCGG CGGATTATGA AGACGAGGTG GCGCAGGACA AGCGTATTGA CGCCCTGCGC
GAGAAGATCG TGTGTTATGA GGACCCGGCT TTTACCGCCG ACTATCACGA CCCGGAAAAA
CGTGCTATCG GCAATGCGAT CACCGTGGAG TTTACTGATG GCTCACGCTT TGGCGAGGTT
GTCGTGGAGT ATCCGATTGG TCATGCGCGT CGCCGCGCCG ACGGTATTCC GAAGCTTATC
GAAAAATTTA AAATTAACCT GGCGCGTCAG TTCCCGACTC GCCAGCAGCA ACGCATTCTG
GATGTCTCCC TGGACAGAGC CCGCCTGGAG CAGATGCCGG TTAACGAATA CCTCGATTTA
TATGTCATCT GA
 
Protein sequence
MSTQELNIRP EFDREIVDIV DYVMNYDITS KVAYDTAHYC LLDTLGCGLE ALEYPACKKL 
LGPIVPGTVV PNGARVPGTQ FQLDPVQAAF NIGAMIRWLD FNDTWLAAEW GHPSDNLGGI
LAIADWLSRN AVAAGKAPLT MKQVLSGMIK AHEIQGCIAL ENAFNRVGLD HVLLVKVAST
AVVAEMLGLT RDEILNAVSL AWVDGQSLRT YRHAPNTGTR KSWAAGDATS RAVRLALMAK
TGEMGYPSAL TAKTWGFYDV SFKGETFRFQ RPYGSYVMEN VLFKISFPAE FHSQTAVEAA
MTLYEQMQAA GKTAADIEKV TIRTHEACLR IIDKKGPLNN PADRDHCIQY MVAVPLLFGR
LTAADYEDEV AQDKRIDALR EKIVCYEDPA FTADYHDPEK RAIGNAITVE FTDGSRFGEV
VVEYPIGHAR RRADGIPKLI EKFKINLARQ FPTRQQQRIL DVSLDRARLE QMPVNEYLDL
YVI