Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4167 |
Symbol | |
ID | 6491796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 4050844 |
End bp | 4052034 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642744262 |
Product | mandelate racemase/muconate lactonizing enzyme family protein |
Protein accession | YP_002047866 |
Protein GI | 194451007 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 0.585922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTA CCAGCATTGA TATTATTGAT GTTGCTAATG ATTTTGCGTC CGCCACCAGC AAATGGCGTC CGGTGGTGGT AAAAATTAAT ACCGACGAGG GCATTTCCGG TTTTGGCGAA GTCGGGTTGG CCTACGGCGT CGGCGCCTCC GCAGGCATCG GCATGGCAAA AGATTTAGCC GCCATTATCA TCGGCATGGA CCCGATGAAT AACGAAGCTA TCTGGGAAAA AATGCTCAAA AAAACCTTCT GGGGGCAGGG CGGCGGCGGC ATTTTTTCCG CGGCGATGAG CGGCATCGAT ATCGCGCTGT GGGATATCAA AGGCAAAGCG TGGGGCGTGC CGCTGTATAA AATGCTTGGC GGCAAAAGCC GCGAGAAAAT TAGAACCTAC GCCAGTCAGT TACAGTTTGG TTGGGGGGAC GGCAGCGATA AAGATATGCT GACCGAGCCG GAGCAGTATG CACAGGCGGC ACTGACCGCC GTCAGCGAAG GCTATGACGC AATAAAAGTG GATACCGTCG CAATGGATCG CCACGGCAAC TGGAACCAGC AAAACCTCAA CGGGCCTCTC ACCGATAAAA TCCTGCGTCT GGGCTACGAC CGTATGGCCG CCATTCGCGA TGCAGTCGGC CCGGATGTGG ATATCATCGC CGAAATGCAT GCCTTTACGG ATACCACCTC GGCGATTCAG TTTGGCCGCA TGATCGAAGA ACTGGGCATC TTCTACTACG AAGAGCCGGT CATGCCGTTG AACCCCGCGC AGATGAAGCA GGTTGCCGAT AAGGTCAATA TTCCACTGGC GGCTGGCGAA CGTATTTACT GGCGCTGGGG ATACCGTCCT TTCCTGGAAA ACGGCAGCCT GAGCGTTATT CAGCCCGATA TCTGCACCTG CGGCGGCATC ACCGAAGTGA AGAAAATCTG CGATATGGCG CATGTTTACG ACAAAACGGT GCAAATCCAC GTTTGCGGCG GGCCAATTTC CACAGCAGTG GCGCTGCATA TGGAAACCGT GATCCCGAAC TTCGTCATCC ACGAACTGCA CCGGTATGCG CTGCTGGAGC CGAATACACA GACCTGTAAA TACAACTACC TGCCGAAGAA CGGCATGTAC GAAGTCCCGG AGCTTCCCGG CATCGGCCAG GAACTGACCG AAGAAACCAT GAAAAAATCA CCAACCATCA CCGTAAAATA A
|
Protein sequence | MKITSIDIID VANDFASATS KWRPVVVKIN TDEGISGFGE VGLAYGVGAS AGIGMAKDLA AIIIGMDPMN NEAIWEKMLK KTFWGQGGGG IFSAAMSGID IALWDIKGKA WGVPLYKMLG GKSREKIRTY ASQLQFGWGD GSDKDMLTEP EQYAQAALTA VSEGYDAIKV DTVAMDRHGN WNQQNLNGPL TDKILRLGYD RMAAIRDAVG PDVDIIAEMH AFTDTTSAIQ FGRMIEELGI FYYEEPVMPL NPAQMKQVAD KVNIPLAAGE RIYWRWGYRP FLENGSLSVI QPDICTCGGI TEVKKICDMA HVYDKTVQIH VCGGPISTAV ALHMETVIPN FVIHELHRYA LLEPNTQTCK YNYLPKNGMY EVPELPGIGQ ELTEETMKKS PTITVK
|
| |