Gene EcSMS35_0007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0007 
Symboltal2 
ID6144239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp8061 
End bp9077 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content52% 
IMG OID641614908 
Producttransaldolase B 
Protein accessionYP_001742124 
Protein GI170681632 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0176] Transaldolase 
TIGRFAM ID[TIGR00874] transaldolase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATATCAT CAGGGCAGAC CGGTTACATC CCCCTAACAA GCTGTTTAAA GAGAAATACT 
ATCATGACGG ACAAATTGAC CTCCCTTCGT CAGTACACCA CCGTAGTGGC CGACACTGGG
GACATCGCGG CAATGAAGCT GTATCAACCG CAGGATGCCA CAACCAACCC TTCTCTCATT
CTTAACGCAG CGCAGATTCC GGAATACCGT AAGTTGATTG ATGATGCTGT CGCCTGGGCG
AAACAGCAGA GCAACGATCG CGCGCAGCAG ATCGTGGACG CGACCGACAA ACTGGCAGTA
AATATTGGTC TGGAAATCCT GAAACTGGTT CCGGGCCGTA TCTCAACTGA AGTTGATGCG
CGTCTTTCCT ATGACACCGA AGCGTCAATT GCGAAAGCAA AACGCCTGAT CAAACTCTAC
AACGATGCAG GGATTAGCAA CGATCGTATT CTGATCAAAC TGGCTTCTAC CTGGCAGGGT
ATCCGTGCTG CGGAACAGCT GGAAAAAGAA GGCATCAACT GTAACCTGAC CCTGCTGTTC
TCCTTTGCTC AGGCTCGTGC TTGTGCGGAA GCGGGCGTGT TCCTGATCTC GCCGTTTGTT
GGCCGTATTC TTGACTGGTA CAAAGCGAAT ACCGATAAGA AAGAGTACGC TCCGGCAGAA
GATCCGGGCG TGGTTTCTGT ATCTGAAATC TACCAGTACT ACAAAGAGCA CGGTTATGAA
ACCGTGGTTA TGGGCGCAAG CTTCCGTAAC ATCGGCGAAA TTCTGGAACT GGCAGGCTGC
GACCGTCTGA CCATCGCACC GGCACTGCTG AAAGAGCTGG CGGAGAGCGA AGGGGCTATC
GAACGTAAAC TGTCTTACAC CGGCGAAGTG AAAGCGCGTC CGGCGCGTAT CACTGAGTCC
GAGTTCCTGT GGCAGCACAA CCAGGATCCA ATGGCAGTAG ATAAACTGGC GGAAGGTATC
CGTAAGTTTG CTGTTGACCA GGAAAAACTG GAAAAAATGA TCGGCGATCT GCTGTAA
 
Protein sequence
MISSGQTGYI PLTSCLKRNT IMTDKLTSLR QYTTVVADTG DIAAMKLYQP QDATTNPSLI 
LNAAQIPEYR KLIDDAVAWA KQQSNDRAQQ IVDATDKLAV NIGLEILKLV PGRISTEVDA
RLSYDTEASI AKAKRLIKLY NDAGISNDRI LIKLASTWQG IRAAEQLEKE GINCNLTLLF
SFAQARACAE AGVFLISPFV GRILDWYKAN TDKKEYAPAE DPGVVSVSEI YQYYKEHGYE
TVVMGASFRN IGEILELAGC DRLTIAPALL KELAESEGAI ERKLSYTGEV KARPARITES
EFLWQHNQDP MAVDKLAEGI RKFAVDQEKL EKMIGDLL