Gene SeHA_C0423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0423 
Symbol 
ID6492348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp421964 
End bp422842 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content50% 
IMG OID642740695 
Product5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase 
Protein accessionYP_002044362 
Protein GI194450752 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.000322521 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACTGG TTCAATATCT TGTGAACGGT GGCAAACGCT ACGGCATTAT GCAGGAAATC 
GGAATAATTG ATCTCTCGCA GCGGCTTGGC GACAAATATC CCACTTTGAA ATCTCTCCTG
TGCGCTAATG CGCTGACGGA TGCGGCGTTA TGGTGTGATG AGCCGGCGGA TTATTACTAC
CAGGAAGTCA CTTTTCTGCC GGTGATTGAC GATCCGCAGA AGATCATCTG TGTCGGAATG
AATTATGCCG ATAAGCGTAT TGAGTTTAAT GAAACCAACC CGGCCCCAAC CCTTTTTGTC
CGCTTTGCGG ATTCTCAGAC CGGGCATAAT GGCCTGCTGC TGAAGCCTGA AAATACCAAT
GAGTTCGACT ACGAAGGTGA ATTAGCCGTA GTGATTGGGC GGCGATGCTC CCGGGTCAGC
GCTGAGGATG CTTTAGATTA TGTCGCCGGA TACAGCTGCT ATATGGATGG TTCAGTGAGG
GACTGGCAGC ATAGCTGGTT TACGGCTGGA AAAAACTGGC CTTCGACAGG ATCATTCGGT
CCGTGTCTGG TGACCACAGA CGACATTCCC GATCCCCAGA TGCTACGTTT ACTGACACGA
CTAAACGGGC GGGAGGTGCA GAACGAATCT ACGGCAAATA TGATCCATCC TATCGCTTCA
CTCATTGCTT ATATAAGCAC CTTTACTCTG CTTTCCCCTG GCGACACGAT CCTCACAGGG
TCGCCTGGTG GAGTGGGCAA AAAACGCGTT CCACCGCTGT TTTTACACGA TGGTGATGTT
ATTGAAGTTG AGATTGAACA TATTGGAACC CTGCGCAATG TCGTCCGGGA TAGCCGTTAT
TTAACATCAT CTGTTAGCTG GCATGACGGG AGAAAGTGA
 
Protein sequence
MKLVQYLVNG GKRYGIMQEI GIIDLSQRLG DKYPTLKSLL CANALTDAAL WCDEPADYYY 
QEVTFLPVID DPQKIICVGM NYADKRIEFN ETNPAPTLFV RFADSQTGHN GLLLKPENTN
EFDYEGELAV VIGRRCSRVS AEDALDYVAG YSCYMDGSVR DWQHSWFTAG KNWPSTGSFG
PCLVTTDDIP DPQMLRLLTR LNGREVQNES TANMIHPIAS LIAYISTFTL LSPGDTILTG
SPGGVGKKRV PPLFLHDGDV IEVEIEHIGT LRNVVRDSRY LTSSVSWHDG RK