Gene SeD_A4225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4225 
Symbol 
ID6871654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4069681 
End bp4070871 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content54% 
IMG OID642787158 
Productmandelate racemase/muconate lactonizing enzyme family protein 
Protein accessionYP_002217784 
Protein GI198243038 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA CCAGCGTTGA TATTATTGAT GTGGCGAACG ATTTTGCGTC CGCCACCAGC 
AAATGGCGTC CGGTGGTGGT AAAAATTAAT ACCGATGAGG GCATTTCCGG TTTTGGCGAA
GTCGGGTTGG CCTACGGCGT CGGTGCCTCC GCAGGCATCG GCATGGCAAA AGATTTAGCC
GCCATTATCA TCGGCATGGA CCCGATGAAT AACGAAGCTA TCTGGGAAAA GATGCTCAAA
AAAACCTTCT GGGGGCAGGG CGGCGGCGGC ATCTTTTCCG CTGCGATGAG CGGCATCGAT
ATCGCGCTGT GGGATATCAA AGGCAAAGCG TGGGGCGTGC CGCTGTATAA AATGCTTGGC
GGCAAAAGCC GCGAGAAAAT AAGAACCTAC GCCAGTCAGC TACAGTTTGG TTGGGGGGAC
GGCAGCGATA AAGATATGCT GACCGAGCCG GAGCAGTATG CACAGGCGGC ACTGACCGCC
GTCAGCGAAG GCTATGACGC AATAAAAGTG GATACCGTCG CAATGGATCG CCACGGCAAC
TGGAACCAGC AAAACCTCAA CGGGCCTCTC ACCGATAAAA TCCTGCGTCT GGGCTACGAC
CGTATGGCCG CCATTCGCGA TGCAGTCGGC CAGGATGTGG ATATCATCGC CGAAATGCAT
GCCTTTACGG ATACCACCTC GGCGATTCAG TTTGGCCGCA TGATCGAAGA ACTGGGCGTC
TTCTACTACG AAGAGCCGGT CATGCCGTTG AACCCCGCGC AGATGAAGCA GGTTGCCGAT
AAGGTCAATA TTCCACTGGC GGCTGGCGAA CGTATTTACT GGCGCTGGGG ATACCGTCCT
TTCCTGGAAA ACGGCAGCCT GAGCGTTATT CAGCCCGATA TCTGCACCTG CGGCGGCATC
ACCGAAGTGA AGAAAATCTG CGATATGGCG CATGTTTACG ACAAAACGGT GCAAATCCAC
GTTTGCGGCG GGCCAATTTC CACAGCAGTG GCGCTGCATA TGGAAACCGT GATCCCGAAC
TTCGTCATCC ACGAACTGCA CCGGTATGCG CTGCTGGAGC CGAATACACA GACCTGTAAA
TACAACTACC TGCCGAAGAA CGGCATGTAC GAAGTCCCGG AGCTTCCCGG CATCGGCCAG
GAACTGACCG AAGAAACCAT GAAAAAATCA CCAACCATCA CCGTAAAATA A
 
Protein sequence
MKITSVDIID VANDFASATS KWRPVVVKIN TDEGISGFGE VGLAYGVGAS AGIGMAKDLA 
AIIIGMDPMN NEAIWEKMLK KTFWGQGGGG IFSAAMSGID IALWDIKGKA WGVPLYKMLG
GKSREKIRTY ASQLQFGWGD GSDKDMLTEP EQYAQAALTA VSEGYDAIKV DTVAMDRHGN
WNQQNLNGPL TDKILRLGYD RMAAIRDAVG QDVDIIAEMH AFTDTTSAIQ FGRMIEELGV
FYYEEPVMPL NPAQMKQVAD KVNIPLAAGE RIYWRWGYRP FLENGSLSVI QPDICTCGGI
TEVKKICDMA HVYDKTVQIH VCGGPISTAV ALHMETVIPN FVIHELHRYA LLEPNTQTCK
YNYLPKNGMY EVPELPGIGQ ELTEETMKKS PTITVK