Gene SeHA_C3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3551 
SymbolgatY 
ID6487616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3442387 
End bp3443379 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content50% 
IMG OID642743674 
Producttagatose-bisphosphate aldolase 
Protein accessionYP_002047288 
Protein GI194449436 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0191] Fructose/tagatose bisphosphate aldolase 
TIGRFAM ID[TIGR00167] ketose-bisphosphate aldolases
[TIGR01858] class II aldolase, tagatose bisphosphate family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.207369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTTT TACGCCTGAC TTTCGATAAC TTCCATTTTA TTTCGAAACT TTCATTTATC 
AGATCAAAAT CACAAAAAAA CATTCCGAAA GCCTATAAAT TCTTTCAAAG AAAACAAACG
AAAGATTTCG GAGGAAGGAT GTTTATCATT TCAGGCAGAA CAATGCTAAA GAAGGCGCAG
CAGGAAGGTT ATGCTGTGCC GGCGTTTAAC ATCCACAACC TGGAGACATT GCAAGTGGTG
GTGGAAACCG CCGCAGAATT GCGCTCTCCG CTGATTGTCG CCGGTACGCC AGGCACCTTT
AGCTACGCAG GCGTCGGTAA TATCGTGGCC ATCGCCGCAG AACTGGCGAA AAGCTGGAAC
CATCCTCTCG CGGTACATCT CGATCATCAT GAAAAACTGG CCGACATCAA AATGAAAGTC
GCCGCCGGGG TACGCTCGGT CATGATCGAC GGGTCGCATT TCCCCTTTGC CGACAATATT
GCGCTGGTGA AAAGTGTGGT TGATTACTGT CATCGCTACG ATGTCAGCGT TGAAGCTGAA
CTTGGGCGTC TCGGCGGGCA GGAAGACGAT CTTATCGTTG ACGGTAAAGA TGCGCTTTAT
ACCCATCCGG AACAGGCCCG GGAATTTGTA GAAAAAACGG GTATCGACTC GTTAGCCATT
GCTATCGGCA CCGCTCACGG CCTCTACACC GCTGAACCAA AACTTGATTT TGAACGACTG
ACGGAAATTC GTCAGCGGGT TGATGTCCCC TTAGTCCTTC ACGGCGCCTC TGGCCTGCCG
ACCCGCGATA TTACCCGCGC TATTTCGCTG GGCATCTGCA AAGTTAACGT CGCGACCGAG
CTTAAAATCG CCTTTTCCGG CGCGCTTAAA AACTATTTAA CGCAACACGC AGAGGCCAGC
GATCCCCGCC ATTACATGAT CCCGGCGAAA GCGGCCATGA AAGAGGTTGT ACGTAAAGTG
ATTGCCGACT GCGGTTGTGA AGGGAAGCTC TAA
 
Protein sequence
MLFLRLTFDN FHFISKLSFI RSKSQKNIPK AYKFFQRKQT KDFGGRMFII SGRTMLKKAQ 
QEGYAVPAFN IHNLETLQVV VETAAELRSP LIVAGTPGTF SYAGVGNIVA IAAELAKSWN
HPLAVHLDHH EKLADIKMKV AAGVRSVMID GSHFPFADNI ALVKSVVDYC HRYDVSVEAE
LGRLGGQEDD LIVDGKDALY THPEQAREFV EKTGIDSLAI AIGTAHGLYT AEPKLDFERL
TEIRQRVDVP LVLHGASGLP TRDITRAISL GICKVNVATE LKIAFSGALK NYLTQHAEAS
DPRHYMIPAK AAMKEVVRKV IADCGCEGKL