Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4118 |
Symbol | |
ID | 6485659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4007218 |
End bp | 4008408 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642739374 |
Product | mandelate racemase/muconate lactonizing enzyme family protein |
Protein accession | YP_002043083 |
Protein GI | 194446418 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.128808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 85 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTA CCAGCGTTGA TATTATTGAT GTGGCGAACG ATTTTGCGTC CGCCACCAGC AAATGGCGTC CGGTGGTGGT AAAAATTAAT ACCGACGAGG GCATTTCCGG TTTTGGCGAA GTGGGGCTGG CTTACGGTGT CGGCGCCTCC GCAGGCATCG GCATGGCAAA AGATTTAGCC GCCATTATCA TCGGCATGGA CCCGATGAAT AACGAAGCTA TCTGGGAAAA GATGCTCAAA AAAACCTTCT GGGGGCAGGG CGGCGGCGGC ATCTTTTCCG CTGCGATGAG CGGTATTGAT ATCGCGCTGT GGGATATCAA AGGTAAAGCG TGGAGCGTGC CGCTGTATAA GATGCTCGGC GGTAAAAGCC GCGAGAAAAT CAGAACCTAC GCCAGTCAGC TACAGTTTGG TTGGGGGGAC GGCAGCGATA AAGATATGCT GACCGAGCCG GAACAGTATG CGCAGGCCGC GCTGACCGCC GTCAGCGAAG GCTATGACGC GATAAAAGTG GATACCGTCG CAATGGATCG CCACGGCAAC TGGAACAAAC AAAACCTCAA CGGGCCGCTC ACCGACAAAA TCCTGCGTCT GGGCTACGAC CGTATGGCCG CCATTCGCGA TGCGGTCGGC CCGGACGTGG ATATCATCGC CGAAATGCAT GCCTTTACCG ACACCACCTC GGCGATTCAG TTTGGCCGCA TGATAGAAGA GCTGGGCATC TTCTACTACG AGGAACCGGT TATGCCGCTG AACTCTGGGC AGATGAAGCA GGTTGCCGAT AAGGTCAATA TTCCGCTGGC AGCGGGTGAA CGTATCTACT GGCGCTGGGG ATACCGTCCT TTCCTGGAGA ACGGCAGCCT GAGCGTGATT CAGCCCGATA TCTGCACCTG CGGCGGCATC ACCGAAGTGA AGAAAATCTG CGATATGGCG CATGTTTACG ACAAAACGGT GCAAATCCAC GTTTGCGGCG GGCCGATTTC CACGGCGGTG GCGCTGCATA TGGAAACCGC GATCCCTAAC TTCGTCATCC ACGAACTGCA CCGGTATGCG CTGCTGGAGC CGAATACACA GACCTGTAAA TACAACTACC TGCCGAAGAA CGGCATGTAC GACGTCCCGG AGCTTCCCGG CATCGGCCAG GAACTGACCG AAGAAACCAT GAAAAAATCA CCAACCATCA CCGTAAAATA A
|
Protein sequence | MKITSVDIID VANDFASATS KWRPVVVKIN TDEGISGFGE VGLAYGVGAS AGIGMAKDLA AIIIGMDPMN NEAIWEKMLK KTFWGQGGGG IFSAAMSGID IALWDIKGKA WSVPLYKMLG GKSREKIRTY ASQLQFGWGD GSDKDMLTEP EQYAQAALTA VSEGYDAIKV DTVAMDRHGN WNKQNLNGPL TDKILRLGYD RMAAIRDAVG PDVDIIAEMH AFTDTTSAIQ FGRMIEELGI FYYEEPVMPL NSGQMKQVAD KVNIPLAAGE RIYWRWGYRP FLENGSLSVI QPDICTCGGI TEVKKICDMA HVYDKTVQIH VCGGPISTAV ALHMETAIPN FVIHELHRYA LLEPNTQTCK YNYLPKNGMY DVPELPGIGQ ELTEETMKKS PTITVK
|
| |