Gene ECD_02997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02997 
SymbolagaZ 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3145107 
End bp3146387 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID 
Producttagatose 6-phosphate aldolase 1, kbaZ subunit 
Protein accessionACT44801 
Protein GI253979131 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACATC TGACAGAAAT GGTGAGACAG CACAAAGCGG GCAAAACAAA TGCAATTTAT 
GCCGTTTGTT CCGCACATCC GCTGGTGCTG GAAGCTGCAA TCCGCTACGC CAGTGCAAAC
CAAACGCCGT TACTGATTGA AGCAACCTCC AATCAGGTAG ACCAGTTCGG CGGTTATACC
GGAATGACGC CCGCCGATTT TCGCGGCTTT GTTTGTCAGC TCGCCGACTC GTTGAATTTC
CCGCAGGATG CGTTGATTCT GGGTGGTGAC CATCTGGGGC CAAACCGCTG GCAAAACCTG
CCAGCCGCTC AGGCAATGGC CAATGCCGAT GATTTGATTA AAAGCTACGT TGCGGCAGGA
TTCAAAAAAA TCCACCTTGA TTGCAGCATG TCCTGTCAGG ACGATCCAAT TCCCTTAACT
GATGACATCG TGGCTGAACG CGCCGCCCGT CTGGCGAAAG TGGCGGAAGA AACCTGTCTT
GAACACTTTG GCGAAGCCGA TCTGGAGTAT GTCATTGGTA CCGAAGTGCC GGTACCTGGC
GGCGCGCATG AAACCTTAAG CGAGCTGGCG GTCACCACGC CGGATGCCGC CCGCGCCACG
CTGGAAGCCC ATCGTCACGC CTTTGAAAAG CAAGGTTTGA ATGCCATCTG GCCACGCATC
ATTGCCCTGG TGGTTCAACC CGGCGTCGAA TTCGATCACA CCAACGTTAT TGATTATCAG
CCCGCCAAAG CGAGCGCCTT AAGCCAGATG GTCGAAAACT ACGAAACGCT GATTTTCGAA
GCGCACTCTA CCGATTATCA AACGCCGCAA TCGCTGCGCC AGCTGGTGAT TGACCACTTT
GCCATTCTGA AAGTTGGCCC AGCGCTGACC TTCGCCCTGC GTGAAGCTCT GTTCTCTCTG
GCGGCGATTG AAGAAGAACT GGTGCCAGCG AAAGCCTGTT CTGGTCTGCG TCAGGTGCTG
GAAGACGTGA TGCTCGACCG CCCGGAATAC TGGCAAAGCC ACTACCACGG TGACGGCAAC
GCGCGTCGTC TGGCGCGTGG TTATAGCTAC TCGGATCGCG TGCGCTATTA CTGGCCGGAC
AGCCAGATTG ATGACGCTTT CGCTCATCTG GTACGTAATC TGGCGGATTC ACCAATTCCG
CTGCCGCTGA TCAGCCAGTA TCTGCCGCTG CAGTACGTGA AAGTTCGCTC CGGCGAGCTG
CAGCCAACGC CACGGGAACT CATTATCAAC CATATTCAGG ACATCCTGGC GCAGTACCAC
ACAGCCTGTG AAGGCCAATA A
 
Protein sequence
MKHLTEMVRQ HKAGKTNAIY AVCSAHPLVL EAAIRYASAN QTPLLIEATS NQVDQFGGYT 
GMTPADFRGF VCQLADSLNF PQDALILGGD HLGPNRWQNL PAAQAMANAD DLIKSYVAAG
FKKIHLDCSM SCQDDPIPLT DDIVAERAAR LAKVAEETCL EHFGEADLEY VIGTEVPVPG
GAHETLSELA VTTPDAARAT LEAHRHAFEK QGLNAIWPRI IALVVQPGVE FDHTNVIDYQ
PAKASALSQM VENYETLIFE AHSTDYQTPQ SLRQLVIDHF AILKVGPALT FALREALFSL
AAIEEELVPA KACSGLRQVL EDVMLDRPEY WQSHYHGDGN ARRLARGYSY SDRVRYYWPD
SQIDDAFAHL VRNLADSPIP LPLISQYLPL QYVKVRSGEL QPTPRELIIN HIQDILAQYH
TACEGQ