Gene EcHS_A3322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3322 
Symbol 
ID5594473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3327534 
End bp3328814 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID640922440 
ProductD-tagatose-bisphosphate aldolase, class II, non-catalytic subunit 
Protein accessionYP_001459933 
Protein GI157162615 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4573] Predicted tagatose 6-phosphate kinase 
TIGRFAM ID[TIGR02810] D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACATC TAACAGAAAT GGTGAGACAG CACAAAGCGG GCAAAACAAA TGCAATTTAT 
GCCGTTTGTT CCGCACATCC GCTGGTGCTG GAAGCTGCAA TCCGCTACGC CAGTGCAAAC
CAAACGCCGT TACTGATTGA AGCAACCTCC AATCAGGTAG ACCAGTTCGG CGGTTATACC
GGAATGACGC CCGCCGATTT TCGCGGCTTT GTTTGTCAGC TCGCCGACTC GTTGAATTTC
CCGCAGGATG CGTTGATTCT GGGTGGTGAC CATCTGGGGC CAAACCGCTG GCAAAACCTG
CCAGCCGCTC AGGCAATGGC CAATGCCGAT GATTTGATTA AAAGCTACGT TGCGGCAGGA
TTCAAAAAAA TCCACCTTGA TTGCAGCATG TCCTGTCAGG ACGATCCAAT TCCCTTAACT
GATGACATCG TGGCTGAACG CGCCGCCCGT CTGGCGAAAG TGGCGGAAGA AACCTGTCTT
GAACACTTTG GCGAAGCCGA TCTGGAGTAT GTCATTGGTA CCGAAGTGCC GGTACCTGGC
GGCGCGCATG AAACCTTAAG CGAGCTGGCG GTCACCACGC CGGATGCCGC CCGCGCCACG
CTGGAAGCCC ATCGTCACGC CTTTGAAAAG CAAGGTTTGA ATGCCATCTG GCCACGCATC
ATTGCCCTGG TGGTTCAACC CGGCGTCGAA TTCGATCACA CCAACGTTAT TGATTATCAG
CCCGCCAAAG CGAGCGCCTT AAGCCAGATG GTCGAAAACT ACGAAACGCT GATTTTCGAA
GCGCACTCTA CCGATTATCA AACGCCGCAA TCGCTGCGCC AGCTGGTGAT TGACCACTTT
GCCATTCTGA AAGTTGGCCC AGCGCTGACC TTCGCCCTGC GTGAAGCTCT GTTCTCTCTG
GCGGCGATTG AAGAAGAACT GGTGCCAGCG AAAGTCTGTT CTGGTCTGCG TCAGGTGCTG
GAAGACGTGA TGCTCGACCG CCCGGAATAC TGGCAAAGCC ACTACCACGG TGACGGCAAC
GCGCGTCGTC TGGCGCGTGG TTATAGCTAC TCGGATCGCG TGCGCTATTA CTGGCCGGAC
AGCCAGATTG ATGACGCTTT CGCTCATCTG GTACGTAATC TGGCGGATTC ACCAATTCCG
CTGCCGCTGA TCAGCCAGTA TCTGCCGCTG CAGTACGTGA AAGTTCGCTC CGGCGAGCTG
CAGCCAACGC CACGGGAACT CATTATCAAC CATATTCAGG ACATCCTGGC GCAGTACCAC
ACAGCCTGTG AAGGCCAATA A
 
Protein sequence
MKHLTEMVRQ HKAGKTNAIY AVCSAHPLVL EAAIRYASAN QTPLLIEATS NQVDQFGGYT 
GMTPADFRGF VCQLADSLNF PQDALILGGD HLGPNRWQNL PAAQAMANAD DLIKSYVAAG
FKKIHLDCSM SCQDDPIPLT DDIVAERAAR LAKVAEETCL EHFGEADLEY VIGTEVPVPG
GAHETLSELA VTTPDAARAT LEAHRHAFEK QGLNAIWPRI IALVVQPGVE FDHTNVIDYQ
PAKASALSQM VENYETLIFE AHSTDYQTPQ SLRQLVIDHF AILKVGPALT FALREALFSL
AAIEEELVPA KVCSGLRQVL EDVMLDRPEY WQSHYHGDGN ARRLARGYSY SDRVRYYWPD
SQIDDAFAHL VRNLADSPIP LPLISQYLPL QYVKVRSGEL QPTPRELIIN HIQDILAQYH
TACEGQ