Gene ECH74115_4446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4446 
Symbol 
ID6969785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4121232 
End bp4122512 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID643388166 
ProductD-tagatose-bisphosphate aldolase, class II, non-catalytic subunit 
Protein accessionYP_002272603 
Protein GI209398293 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4573] Predicted tagatose 6-phosphate kinase 
TIGRFAM ID[TIGR02810] D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.752498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACATC TGACAGAAAT GGTGAGACAG CACAAAGCGG GCAAAACAAA TGGAATTTAT 
GCCGTTTGTT CCGCACATCC GCTGGTGCTG GAGGCTGCAA TCCGCTACGC CAGTGCAAAC
CAAACGCCGT TACTGATTGA AGCAACCTCC AATCAGGTAG ACCAGTTCGG CGGTTATACC
GGAATGACGC CCGCCGATTT TCGCGGCTTT GTTTGTCAGC TCGCCGACTC GTTGAATTTC
CCGCAGGATG CGTTGATTCT GGGTGGTGAC CATCTGGGGC CAAACCGCTG GCAAAACCTG
CCGGCCGCTC AGGCAATGGC CAATGCCGAT GATTTGATTA AAAGCTACGT TGCGGCAGGA
TTCAAAAAAA TCCACCTTGA TTGCAGCATG TCCTGTCAGG ACGATCCGAT TCCCTTAACT
GATGACATCG TGGCTGAACG CGCCGCCCGT CTGGCGAAAG TGGCGGAAGA AACCTGTCTT
GAACACTTTG GCGAAGCCGA TCTGGAGTAT GTCATTGGTA CCGAAGTGCC GGTACCTGGC
GGCGCGCATG AAACCTTAAG CGAGCTGGCG GTCACCACGC CGGATGCCGC CCGCGCCACG
CTGGAAGCCC ATCGTCACGC CTTTGAAAAG CAAGGTTTGA ATGCCATCTG GCCACGCATC
ATTGCCCTGG TGGTTCAACC CGGCGTCGAA TTCGATCACA CCAACGTTAT TGATTATCAG
CCCGCCAAAG CGACCGCCTT AAGCCAGATG GTCGAAAGCT ACGAAACGCT GATTTTCGAA
GCGCACTCTA CCGATTATCA AACGCCGCAA TCGCTGCGCC AGCTGGTGAT TGACCACTTT
GCCATTCTGA AAGTTGGCCC GGCGCTGACC TTCGCCCTGC GTGAAGCTCT GTTCTCTCTG
GCAGCGATTG AAGAAGAACT GGTGCCAGCA AAAGCCTGTT CTGGTCTGCG TCAGGTGCTG
GAAAACGTGA TGCTCAACCG CCCGGAATAC TGGCAAAGCC ACTACCACGG TGACGGCAAC
GCGCGTCGTC TGGCGCGTGG TTATAGCTAC TCGGATCGCG TGCGCTATTA CTGGCCGGAC
AGCCAGATTG ATGACGCTTT CGCTCATCTG GTACGTAATC TGGCGGATTC ACCAATTCCG
CTGCCGCTGA TCAGCCAGTA TCTGCCGCTG CAGTACGTGA AAGTTCGCTC CGGCGAGCTG
CAGCCAACGC CACGGGAACT CATTATCAAC CATATTCAGG ACATCCTGGC GCAGTACCAC
ACAGCCTGTG AAGGCCAATA A
 
Protein sequence
MKHLTEMVRQ HKAGKTNGIY AVCSAHPLVL EAAIRYASAN QTPLLIEATS NQVDQFGGYT 
GMTPADFRGF VCQLADSLNF PQDALILGGD HLGPNRWQNL PAAQAMANAD DLIKSYVAAG
FKKIHLDCSM SCQDDPIPLT DDIVAERAAR LAKVAEETCL EHFGEADLEY VIGTEVPVPG
GAHETLSELA VTTPDAARAT LEAHRHAFEK QGLNAIWPRI IALVVQPGVE FDHTNVIDYQ
PAKATALSQM VESYETLIFE AHSTDYQTPQ SLRQLVIDHF AILKVGPALT FALREALFSL
AAIEEELVPA KACSGLRQVL ENVMLNRPEY WQSHYHGDGN ARRLARGYSY SDRVRYYWPD
SQIDDAFAHL VRNLADSPIP LPLISQYLPL QYVKVRSGEL QPTPRELIIN HIQDILAQYH
TACEGQ