Gene SNSL254_A4118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4118 
Symbol 
ID6485659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4007218 
End bp4008408 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content55% 
IMG OID642739374 
Productmandelate racemase/muconate lactonizing enzyme family protein 
Protein accessionYP_002043083 
Protein GI194446418 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.128808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA CCAGCGTTGA TATTATTGAT GTGGCGAACG ATTTTGCGTC CGCCACCAGC 
AAATGGCGTC CGGTGGTGGT AAAAATTAAT ACCGACGAGG GCATTTCCGG TTTTGGCGAA
GTGGGGCTGG CTTACGGTGT CGGCGCCTCC GCAGGCATCG GCATGGCAAA AGATTTAGCC
GCCATTATCA TCGGCATGGA CCCGATGAAT AACGAAGCTA TCTGGGAAAA GATGCTCAAA
AAAACCTTCT GGGGGCAGGG CGGCGGCGGC ATCTTTTCCG CTGCGATGAG CGGTATTGAT
ATCGCGCTGT GGGATATCAA AGGTAAAGCG TGGAGCGTGC CGCTGTATAA GATGCTCGGC
GGTAAAAGCC GCGAGAAAAT CAGAACCTAC GCCAGTCAGC TACAGTTTGG TTGGGGGGAC
GGCAGCGATA AAGATATGCT GACCGAGCCG GAACAGTATG CGCAGGCCGC GCTGACCGCC
GTCAGCGAAG GCTATGACGC GATAAAAGTG GATACCGTCG CAATGGATCG CCACGGCAAC
TGGAACAAAC AAAACCTCAA CGGGCCGCTC ACCGACAAAA TCCTGCGTCT GGGCTACGAC
CGTATGGCCG CCATTCGCGA TGCGGTCGGC CCGGACGTGG ATATCATCGC CGAAATGCAT
GCCTTTACCG ACACCACCTC GGCGATTCAG TTTGGCCGCA TGATAGAAGA GCTGGGCATC
TTCTACTACG AGGAACCGGT TATGCCGCTG AACTCTGGGC AGATGAAGCA GGTTGCCGAT
AAGGTCAATA TTCCGCTGGC AGCGGGTGAA CGTATCTACT GGCGCTGGGG ATACCGTCCT
TTCCTGGAGA ACGGCAGCCT GAGCGTGATT CAGCCCGATA TCTGCACCTG CGGCGGCATC
ACCGAAGTGA AGAAAATCTG CGATATGGCG CATGTTTACG ACAAAACGGT GCAAATCCAC
GTTTGCGGCG GGCCGATTTC CACGGCGGTG GCGCTGCATA TGGAAACCGC GATCCCTAAC
TTCGTCATCC ACGAACTGCA CCGGTATGCG CTGCTGGAGC CGAATACACA GACCTGTAAA
TACAACTACC TGCCGAAGAA CGGCATGTAC GACGTCCCGG AGCTTCCCGG CATCGGCCAG
GAACTGACCG AAGAAACCAT GAAAAAATCA CCAACCATCA CCGTAAAATA A
 
Protein sequence
MKITSVDIID VANDFASATS KWRPVVVKIN TDEGISGFGE VGLAYGVGAS AGIGMAKDLA 
AIIIGMDPMN NEAIWEKMLK KTFWGQGGGG IFSAAMSGID IALWDIKGKA WSVPLYKMLG
GKSREKIRTY ASQLQFGWGD GSDKDMLTEP EQYAQAALTA VSEGYDAIKV DTVAMDRHGN
WNKQNLNGPL TDKILRLGYD RMAAIRDAVG PDVDIIAEMH AFTDTTSAIQ FGRMIEELGI
FYYEEPVMPL NSGQMKQVAD KVNIPLAAGE RIYWRWGYRP FLENGSLSVI QPDICTCGGI
TEVKKICDMA HVYDKTVQIH VCGGPISTAV ALHMETAIPN FVIHELHRYA LLEPNTQTCK
YNYLPKNGMY DVPELPGIGQ ELTEETMKKS PTITVK