Gene EcSMS35_3429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3429 
Symbol 
ID6144342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3508750 
End bp3510030 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID641618258 
ProductD-tagatose-bisphosphate aldolase, class II, non-catalytic subunit 
Protein accessionYP_001745407 
Protein GI170680844 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4573] Predicted tagatose 6-phosphate kinase 
TIGRFAM ID[TIGR02810] D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACATC TGACAGAAAT GGTGAGACAG CACAAAGAGG GCAAAACAAA TGGGATTTAT 
GCCGTTTGTT CCGCACATCC GCTGGTGCTG GAAGCTGCAA TCCGCTACGC CAGTGCAAAC
CAAACGCCGT TACTGATTGA AGCAACCTCC AATCAGGTGG ACCAGTTCGG CGGTTATACC
GGAATGACGC CCGCCGATTT TCGCGGCTTT GTTTGTCAGC TCGCCGACTC GTTGAATTTC
CCACAGGATG CGTTGATTCT GGGTGGCGAC CATCTGGGGC CAAACCGCTG GCAAAACCTG
CCTGCCGCTC AGGCAATGGC CAATGCCGAT GATTTGATTA AAAGCTACGT GGCGGCAGGA
TTCAAAAAAA TTCACCTTGA TTGCAGCATG TCCTGTCAGG ACGATCCGAT TCCCTTAACT
GATGACATCG TGGCCGAACG CGCCGCTCGT CTGGCGAAAG TGGCGGAAGA AACCTGCCGT
GAACATTTTG GCGCGGCCGA TCTGGAGTAT GTCATTGGTA CCGAAGTACC GGTACCTGGC
GGCGCGCATG AAACCTTAAG CGAGCTGGCG GTCACCACGC CGGATGCCGC CCGCGCCACG
CTGGAAGCCC ATCGTCACGC CTTTGAAAAG CAAGGTCTGA GTGCCATCTG GCCACGCATT
ATTGCCCTGG TGGTTCAACC CGGCGTCGAA TTCGATCACA CCAACGTTAT TGATTATCAG
CCCGTCAAAG CGACCGCCTT AAGCCAGATG GTCGAAAACT ACGAAACGCT GATTTTCGAA
GCGCACTCTA CTGATTACCA AACGCCGCAA TCGCTGCGCC AGCTGGTGAT TGACCACTTT
GCCATTCTGA AAGTTGGCCC GGCGCTGACC TTTGCTCTGC GTGAAGCTCT GTTCTCTCTG
GCGGCGATTG AAGAAGAACT GGTACCAGCA AAAGCCTGTT CTGGTCTGCG TCAGGTGCTG
GAAAACGTGA TGCTCGACCG CCCGGAATAC TGGCAAAGCC ACTACCACGG TGACGGCAAC
GCGCGTCGTC TGGCGCGTGG TTATAGCTAC TCGGATCGCG TGCGCTATTA CTGGCCGGAC
AGCCAGATTG ATGACGCTTT CGCTCATCTG GTTCGTAATC TGGCGGATTC ACCAATTCCG
CTACCGCTGA TCAGCCAGTA TTTGCCGCTG CAATACGTGA AAGTTCGCTC CGGCGAGCTG
CAGCCAACAC CACGGGAACT CATTATCAAC CATATTCAGG ACATCCTGGC GCAGTACCAC
ACAGCCTGTG AAGGCCAATA A
 
Protein sequence
MKHLTEMVRQ HKEGKTNGIY AVCSAHPLVL EAAIRYASAN QTPLLIEATS NQVDQFGGYT 
GMTPADFRGF VCQLADSLNF PQDALILGGD HLGPNRWQNL PAAQAMANAD DLIKSYVAAG
FKKIHLDCSM SCQDDPIPLT DDIVAERAAR LAKVAEETCR EHFGAADLEY VIGTEVPVPG
GAHETLSELA VTTPDAARAT LEAHRHAFEK QGLSAIWPRI IALVVQPGVE FDHTNVIDYQ
PVKATALSQM VENYETLIFE AHSTDYQTPQ SLRQLVIDHF AILKVGPALT FALREALFSL
AAIEEELVPA KACSGLRQVL ENVMLDRPEY WQSHYHGDGN ARRLARGYSY SDRVRYYWPD
SQIDDAFAHL VRNLADSPIP LPLISQYLPL QYVKVRSGEL QPTPRELIIN HIQDILAQYH
TACEGQ