Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4225 |
Symbol | |
ID | 6871654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4069681 |
End bp | 4070871 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642787158 |
Product | mandelate racemase/muconate lactonizing enzyme family protein |
Protein accession | YP_002217784 |
Protein GI | 198243038 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTA CCAGCGTTGA TATTATTGAT GTGGCGAACG ATTTTGCGTC CGCCACCAGC AAATGGCGTC CGGTGGTGGT AAAAATTAAT ACCGATGAGG GCATTTCCGG TTTTGGCGAA GTCGGGTTGG CCTACGGCGT CGGTGCCTCC GCAGGCATCG GCATGGCAAA AGATTTAGCC GCCATTATCA TCGGCATGGA CCCGATGAAT AACGAAGCTA TCTGGGAAAA GATGCTCAAA AAAACCTTCT GGGGGCAGGG CGGCGGCGGC ATCTTTTCCG CTGCGATGAG CGGCATCGAT ATCGCGCTGT GGGATATCAA AGGCAAAGCG TGGGGCGTGC CGCTGTATAA AATGCTTGGC GGCAAAAGCC GCGAGAAAAT AAGAACCTAC GCCAGTCAGC TACAGTTTGG TTGGGGGGAC GGCAGCGATA AAGATATGCT GACCGAGCCG GAGCAGTATG CACAGGCGGC ACTGACCGCC GTCAGCGAAG GCTATGACGC AATAAAAGTG GATACCGTCG CAATGGATCG CCACGGCAAC TGGAACCAGC AAAACCTCAA CGGGCCTCTC ACCGATAAAA TCCTGCGTCT GGGCTACGAC CGTATGGCCG CCATTCGCGA TGCAGTCGGC CAGGATGTGG ATATCATCGC CGAAATGCAT GCCTTTACGG ATACCACCTC GGCGATTCAG TTTGGCCGCA TGATCGAAGA ACTGGGCGTC TTCTACTACG AAGAGCCGGT CATGCCGTTG AACCCCGCGC AGATGAAGCA GGTTGCCGAT AAGGTCAATA TTCCACTGGC GGCTGGCGAA CGTATTTACT GGCGCTGGGG ATACCGTCCT TTCCTGGAAA ACGGCAGCCT GAGCGTTATT CAGCCCGATA TCTGCACCTG CGGCGGCATC ACCGAAGTGA AGAAAATCTG CGATATGGCG CATGTTTACG ACAAAACGGT GCAAATCCAC GTTTGCGGCG GGCCAATTTC CACAGCAGTG GCGCTGCATA TGGAAACCGT GATCCCGAAC TTCGTCATCC ACGAACTGCA CCGGTATGCG CTGCTGGAGC CGAATACACA GACCTGTAAA TACAACTACC TGCCGAAGAA CGGCATGTAC GAAGTCCCGG AGCTTCCCGG CATCGGCCAG GAACTGACCG AAGAAACCAT GAAAAAATCA CCAACCATCA CCGTAAAATA A
|
Protein sequence | MKITSVDIID VANDFASATS KWRPVVVKIN TDEGISGFGE VGLAYGVGAS AGIGMAKDLA AIIIGMDPMN NEAIWEKMLK KTFWGQGGGG IFSAAMSGID IALWDIKGKA WGVPLYKMLG GKSREKIRTY ASQLQFGWGD GSDKDMLTEP EQYAQAALTA VSEGYDAIKV DTVAMDRHGN WNQQNLNGPL TDKILRLGYD RMAAIRDAVG QDVDIIAEMH AFTDTTSAIQ FGRMIEELGV FYYEEPVMPL NPAQMKQVAD KVNIPLAAGE RIYWRWGYRP FLENGSLSVI QPDICTCGGI TEVKKICDMA HVYDKTVQIH VCGGPISTAV ALHMETVIPN FVIHELHRYA LLEPNTQTCK YNYLPKNGMY EVPELPGIGQ ELTEETMKKS PTITVK
|
| |