Gene SeHA_C4167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4167 
Symbol 
ID6491796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4050844 
End bp4052034 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content54% 
IMG OID642744262 
Productmandelate racemase/muconate lactonizing enzyme family protein 
Protein accessionYP_002047866 
Protein GI194451007 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.585922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA CCAGCATTGA TATTATTGAT GTTGCTAATG ATTTTGCGTC CGCCACCAGC 
AAATGGCGTC CGGTGGTGGT AAAAATTAAT ACCGACGAGG GCATTTCCGG TTTTGGCGAA
GTCGGGTTGG CCTACGGCGT CGGCGCCTCC GCAGGCATCG GCATGGCAAA AGATTTAGCC
GCCATTATCA TCGGCATGGA CCCGATGAAT AACGAAGCTA TCTGGGAAAA AATGCTCAAA
AAAACCTTCT GGGGGCAGGG CGGCGGCGGC ATTTTTTCCG CGGCGATGAG CGGCATCGAT
ATCGCGCTGT GGGATATCAA AGGCAAAGCG TGGGGCGTGC CGCTGTATAA AATGCTTGGC
GGCAAAAGCC GCGAGAAAAT TAGAACCTAC GCCAGTCAGT TACAGTTTGG TTGGGGGGAC
GGCAGCGATA AAGATATGCT GACCGAGCCG GAGCAGTATG CACAGGCGGC ACTGACCGCC
GTCAGCGAAG GCTATGACGC AATAAAAGTG GATACCGTCG CAATGGATCG CCACGGCAAC
TGGAACCAGC AAAACCTCAA CGGGCCTCTC ACCGATAAAA TCCTGCGTCT GGGCTACGAC
CGTATGGCCG CCATTCGCGA TGCAGTCGGC CCGGATGTGG ATATCATCGC CGAAATGCAT
GCCTTTACGG ATACCACCTC GGCGATTCAG TTTGGCCGCA TGATCGAAGA ACTGGGCATC
TTCTACTACG AAGAGCCGGT CATGCCGTTG AACCCCGCGC AGATGAAGCA GGTTGCCGAT
AAGGTCAATA TTCCACTGGC GGCTGGCGAA CGTATTTACT GGCGCTGGGG ATACCGTCCT
TTCCTGGAAA ACGGCAGCCT GAGCGTTATT CAGCCCGATA TCTGCACCTG CGGCGGCATC
ACCGAAGTGA AGAAAATCTG CGATATGGCG CATGTTTACG ACAAAACGGT GCAAATCCAC
GTTTGCGGCG GGCCAATTTC CACAGCAGTG GCGCTGCATA TGGAAACCGT GATCCCGAAC
TTCGTCATCC ACGAACTGCA CCGGTATGCG CTGCTGGAGC CGAATACACA GACCTGTAAA
TACAACTACC TGCCGAAGAA CGGCATGTAC GAAGTCCCGG AGCTTCCCGG CATCGGCCAG
GAACTGACCG AAGAAACCAT GAAAAAATCA CCAACCATCA CCGTAAAATA A
 
Protein sequence
MKITSIDIID VANDFASATS KWRPVVVKIN TDEGISGFGE VGLAYGVGAS AGIGMAKDLA 
AIIIGMDPMN NEAIWEKMLK KTFWGQGGGG IFSAAMSGID IALWDIKGKA WGVPLYKMLG
GKSREKIRTY ASQLQFGWGD GSDKDMLTEP EQYAQAALTA VSEGYDAIKV DTVAMDRHGN
WNQQNLNGPL TDKILRLGYD RMAAIRDAVG PDVDIIAEMH AFTDTTSAIQ FGRMIEELGI
FYYEEPVMPL NPAQMKQVAD KVNIPLAAGE RIYWRWGYRP FLENGSLSVI QPDICTCGGI
TEVKKICDMA HVYDKTVQIH VCGGPISTAV ALHMETVIPN FVIHELHRYA LLEPNTQTCK
YNYLPKNGMY EVPELPGIGQ ELTEETMKKS PTITVK