Gene SeHA_C4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4022 
Symbol 
ID6488575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3904466 
End bp3905662 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content54% 
IMG OID642744123 
Productmandelate racemase/muconate lactonizing enzyme 
Protein accessionYP_002047728 
Protein GI194449480 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTAA GCGCGAATTC CGACGCCGTA ACGTATGCAA AAGCGGCGAA TACCAGAACC 
GCGGCAGAAA CCGGCGATCG TATCGAATGG GTGAAGCTAT CACTGGCTTT TCTACCGCTG
GCGACGCCAG TGAGTGACGC GAAGGTACTG ACCGGTCGCC AGAAACCTTT GACCGAAGTG
GCAATCATCA TTGCCGAAAT CCGCAGTCGC GATGGCTTTG AAGGCGTTGG TTTTAGCTAC
TCCAAACGTG CTGGCGGCCA GGGTATTTAT GCTCACGCCA AAGAGATAGC CGATAATCTA
CTGGGCGAAG ATCCCAATGA TATCGACAAA ATATACACTA AGCTGCTGTG GGCCGGAGCC
TCAGTGGGGC GTAGCGGGAT GGCGGTACAG GCTATCTCCC CTATCGATAT CGCCTTATGG
GATATGAAGG CTAAACGTGC CGGTCTGCCA CTGGCAAAAC TGTTGGGCGC GCACCGCGAC
TCCGTTCAGT GTTACAACAC CTCGGGGGGG TTCTTGCATA CACCGCTCGA TCAAGTGCTG
AAAAATGTGG TGATTTCCCG CGAAAATGGC ATTGGCGGTA TTAAGTTGAA AGTCGGACAA
CCCAACTGCG CTGAGGATAT TCGCCGCTTA ACCGCAGTAC GTGAAGCGCT TGGGGATGAG
TTCCCGTTAA TGGTTGACGC TAACCAGCAG TGGGATCGCG AAACGGCTAT CCGCATGGGG
CGTAAAATGG AACAGTTCAA TCTTATCTGG ATTGAAGAAC CACTAGATGC TTACGACATC
GAAGGCCACG CCCAGCTTGC TGCCGCGCTG GATACGCCTA TCGCCACCGG GGAAATGCTG
ACCAGTTTCC GGGAACACGA GCAGTTGATT CTGGGCAATG CCAGCGATTT CGTTCAGCCA
GATGCGCCGC GTGTCGGCGG TATCTCTCCT TTCCTGAAGA TTATGGATCT GGCGGCGAAA
CACGGGCGTA AGCTGGCGCC GCACTTTGCG ATGGAAGTAC ACCTGCACCT TTCCGCAGCG
TATCCCCTGG AGCCGTGGCT GGAACATTTC GAGTGGCTGA ACCCATTATT CAACGAGCAA
CTTGAGCTGC GCGATGGCCG CATGTGGATT TCCGATCGTC ATGGTCTGGG TTTCACGCTC
AGTGAACAAG CGCGCCGCTG GACACAATTA ACATGTGAAT TTGGCAAACG CCCTTAA
 
Protein sequence
MALSANSDAV TYAKAANTRT AAETGDRIEW VKLSLAFLPL ATPVSDAKVL TGRQKPLTEV 
AIIIAEIRSR DGFEGVGFSY SKRAGGQGIY AHAKEIADNL LGEDPNDIDK IYTKLLWAGA
SVGRSGMAVQ AISPIDIALW DMKAKRAGLP LAKLLGAHRD SVQCYNTSGG FLHTPLDQVL
KNVVISRENG IGGIKLKVGQ PNCAEDIRRL TAVREALGDE FPLMVDANQQ WDRETAIRMG
RKMEQFNLIW IEEPLDAYDI EGHAQLAAAL DTPIATGEML TSFREHEQLI LGNASDFVQP
DAPRVGGISP FLKIMDLAAK HGRKLAPHFA MEVHLHLSAA YPLEPWLEHF EWLNPLFNEQ
LELRDGRMWI SDRHGLGFTL SEQARRWTQL TCEFGKRP