Gene EcSMS35_1578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1578 
SymbolmalX 
ID6146635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1561717 
End bp1563309 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content52% 
IMG OID641616455 
Productbifunctional PTS system maltose and glucose-specific transporter subunits IICB 
Protein accessionYP_001743633 
Protein GI170681681 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR02004] PTS system, maltose and glucose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGA AAACAGCACC GAAAGTCACG CTGTGGGAGT TCTTCCAGCA GTTAGGCAAA 
ACCTTTATGT TACCCGTGGC ATTATTGTCG TTCTGCGGCA TTATGCTCGG CATTGGTAGT
TCTCTTAGCA GCCATGATGT CATCACCCTG ATCCCGGTCC TGGGCAACCC CGTGTTGCAG
GCTATCTTTA CCTGGATGAG TAAGATTGGC TCGTTTGCTT TTAGTTTCCT GCCAGTCATG
TTCTGTATCG CCATCCCGCT GGGTCTGGCA CGTGAAAACA AAGGCGTAGC GGCATTCGCT
GGCTTCGTCG GTTATGCGGT AATGAACCTC GCGGTAAACT TCTGGTTGAC CAATAAAGGC
ATTCTGCCAA CCACGGACGC CGCGGTTCTG AAAGCCAATA ACATCCAGAG CATTCTTGGG
ATCCAGTCGA TCGACACCGG GATCCTCGGT GCGGTGATCG CCGGTATTAT CGTCTGGATG
CTGCATGAGC GTTTCCATAA TATCCGCCTG CCGGATGCGC TGGCATTCTT CGGCGGTACG
CGCTTCGTAC CAATCATCTC CTCGCTGGTG ATGGGTCTTG TCGGCCTGGT GATTCCATTA
GTCTGGCCGA TTTTCGCCAT GGGTATTAGC GGCTTAGGCC ATATGATCAA CAGCGCGGGT
GATTTCGGAC CGATGCTGTT TGGTACCGGT GAACGTCTGC TGTTGCCGTT TGGTCTGCAT
CACATTCTGG TGGCATTAAT TCGCTTTACC GACGCAGGCG GCACGCAGGA AGTCTGCGGT
CAAACCGTCA GCGGTGCACT GACCATCTTC CAGGCGCAAT TGAGTTGCCC GACCACTCAC
GGTTTTTCTG AAAGCGCCAC GCGTTTCCTT TCGCAAGGTA AAATGCCTGC GTTTCTCGGC
GGTCTGCCAG GTGCAGCGTT AGCTATGTAT CACTGTGCGC GCCCGGAAAA TCGCCATAAA
ATTAAAGGTC TGCTGATTTC TGGCCTGATC GCCTGTGTCG TTGGCGGCAC GACCGAACCG
CTGGAATTCC TGTTCCTGTT CGTAGCGCCA GTTCTGTATG TCATCCACGC GCTGTTAACC
GGCCTCGGCT TCACTGTCAT GTCTGTGCTC GGCGTCACCA TCGGTAATAC CGACGGCAAT
ATCATCGACT TTGTGGTGTT CGGTATTTTG CATGGTCTGT CAACCAAGTG GTACATGGTG
CCAGTAGTGG CGGCAATATG GTTTGTCGTT TACTACGTCA TCTTCCGTTT CGCTATCACC
CGCTTCAATC TGAAAACCCC GGGGCGCGAT AGTGAAGTTG CCAGTTCAAT CGAAAAAGCC
GTTGCCGGTG CGCCGGGTAA ATCAGGTTAC AACGTTCCGG CAATCCTCGA AGCCTTAGGC
GGAGCCGACA ATATTGTCAG TCTCGATAAC TGCATTACCC GTCTGCGTTT GTCTGTAAAA
GATATGTCGC TTGTTAATGT GCAGGCACTG AAGGACAATC GGGCAATTGG CGTAGTACAA
CTTAATCAAC ATAACCTGCA GGTTGTTATC GGGCCACAAG TTCAGTCAGT AAAAGATGAA
ATGGCCGGTC TGATGCATAC TGTCCAGGCA TAA
 
Protein sequence
MTAKTAPKVT LWEFFQQLGK TFMLPVALLS FCGIMLGIGS SLSSHDVITL IPVLGNPVLQ 
AIFTWMSKIG SFAFSFLPVM FCIAIPLGLA RENKGVAAFA GFVGYAVMNL AVNFWLTNKG
ILPTTDAAVL KANNIQSILG IQSIDTGILG AVIAGIIVWM LHERFHNIRL PDALAFFGGT
RFVPIISSLV MGLVGLVIPL VWPIFAMGIS GLGHMINSAG DFGPMLFGTG ERLLLPFGLH
HILVALIRFT DAGGTQEVCG QTVSGALTIF QAQLSCPTTH GFSESATRFL SQGKMPAFLG
GLPGAALAMY HCARPENRHK IKGLLISGLI ACVVGGTTEP LEFLFLFVAP VLYVIHALLT
GLGFTVMSVL GVTIGNTDGN IIDFVVFGIL HGLSTKWYMV PVVAAIWFVV YYVIFRFAIT
RFNLKTPGRD SEVASSIEKA VAGAPGKSGY NVPAILEALG GADNIVSLDN CITRLRLSVK
DMSLVNVQAL KDNRAIGVVQ LNQHNLQVVI GPQVQSVKDE MAGLMHTVQA