Gene ECH74115_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4454 
SymbolkbaY 
ID6971855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4127766 
End bp4128626 
Gene Length861 bp 
Protein Length286 aa 
Translation table11 
GC content52% 
IMG OID643388174 
Producttagatose-bisphosphate aldolase 
Protein accessionYP_002272611 
Protein GI209399873 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0191] Fructose/tagatose bisphosphate aldolase 
TIGRFAM ID[TIGR00167] ketose-bisphosphate aldolases
[TIGR01858] class II aldolase, tagatose bisphosphate family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.389752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATTA TCTCCACCAA ATATCTGTTA CAGGACGCCC AGGCCAATGG CTACGCGGTG 
CCTGCTTTTA ACATTCATAA CGCCGAGACG ATCCAGGCGA TCCTCGAAGT GTGCAGTGAA
ATGCGATCGC CGGTGATCCT CGCCGGAACG CCGGGGACCT TTAAACATAT CGCGCTGGAA
GAGATCTACG CCCTGTGCAG CGCCTATTCC ACAACCTACA ACATGCCACT GGCGCTGCAT
CTCGACCACC ACGAATCGCT GGATGATATT CGCCGTAAAG TCCACGCAGG TGTGCGCAGT
GCGATGATCG ACGGCAGCCA CTTCCCGTTT GCCGAGAACG TAAAGCTGGT GAAATCGGTT
GTCGACTTCT GCCACTCCCA GGATTGCAGC GTGGAAGCAG AACTGGGCCG CCTGGGCGGT
GTTGAAGATG ACATGAGCGT TGACGCCGAA AGTGCATTCC TGACCGATCC ACAAGAAGCT
AAACGCTTTG TCGAACTGAC TGGCGTCGAC AGCTTGGCGG TAGCGATTGG TACGGCGCAC
GGCTTATACA GCAAAACGCC GAAGATTGAT TTCCAGCGGC TGGCGGAAAT TCGTGAAGTG
GTGGATGTTC CTCTGGTGCT ACATGGTGCC AGCGATGTTC CGGATGAATT TGTCCGTCGC
ACTATTGAAC TTGGCGTCAC AAAAGTGAAC GTTGCCACAG AATTAAAAAT AGCCTTCGCC
GGCGCGGTTA AAGCCTGGTT TGCGGAAAAT CCGCAGGGTA ATGATCCTCG TTATTATATG
CGCGTCGGAA TGGATGCGAT GAAAGAAGTT GTCAGAAATA AAATTAATGT CTGTGGTTCA
GCGAATCGAA TTTCAGCATA A
 
Protein sequence
MSIISTKYLL QDAQANGYAV PAFNIHNAET IQAILEVCSE MRSPVILAGT PGTFKHIALE 
EIYALCSAYS TTYNMPLALH LDHHESLDDI RRKVHAGVRS AMIDGSHFPF AENVKLVKSV
VDFCHSQDCS VEAELGRLGG VEDDMSVDAE SAFLTDPQEA KRFVELTGVD SLAVAIGTAH
GLYSKTPKID FQRLAEIREV VDVPLVLHGA SDVPDEFVRR TIELGVTKVN VATELKIAFA
GAVKAWFAEN PQGNDPRYYM RVGMDAMKEV VRNKINVCGS ANRISA