Gene EcSMS35_2961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2961 
Symbol 
ID6146566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3033694 
End bp3035274 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content44% 
IMG OID641617830 
ProductPTS system, maltose and glucose-specific IIBC component family protein 
Protein accessionYP_001744982 
Protein GI170683282 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR02004] PTS system, maltose and glucose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAA AGAAAGCCTG GAGTTTTTTT CAGAGCCTGG GAAAGGCATT CATGTATCCC 
ATTGCTCTGC TAAGTGTATG TGGCATGATG CTAGGGCTGG GAAGTGGTTT AGCCAGTGAT
GATATGGCAA AGTTAATTCC ATTTCTGGCT ATTCCAATAA TTAAAACCAT ACTTGATTTC
ATTGTTAGTC TTGGTTTGTT TGCCTTTGTT AATTTACCTG TATTGTTTGC GATAGCGATT
CCCTTAGGAT TATTAAAAGA TAAAGAGGAT AAAGCCTATG GTGCTTTTTC TGGCTTAATT
GGTTTTATGG CGATGCATCT GGGAACGAAC TTTTATCTTA AACAGCACGA CTTATTGGTC
GCTGCTGACC AAATGTCGAC ACATGGGCAA ACCATCATTC TGGGGATCCA GTCCTACAAT
ACCAGCGTGT TAGGGGGAAT TGTTGCTGGG TTATTAGTCG CCAGCATGTA TAAAAAGATC
GTTAATTTAC GCATTCCTGA ATCATTAGGT TTTTATAGCG GCCCACGTCT GGTGCCTATC
ATTACACTGA TTGTGATGAG TGGATTTGGT CTGATCATCC CTTTTATCTG GCCGCCGTTT
TTCAATCTTT TCATGCTCAT TGGCCACTGG ATTTCAACTT CCGGTCCTGT TGGCTATTTT
TTCTATGCAG TCGCCGAACG CGTGACGATA CCTTTTGGTT TAAATCACCT GGTGACGTCA
GTTTTCCGCT TTACCCCCAT CGGCGGATCG GCAGTGATCG GCGGCGAAGA GTATTATGGC
ACCCTGAACA TGTTTATGGC GTACGTCAAA GAGAATGCGG TCATTCCGCT GGATTTGGCG
GGGAAAATGG AACAGGGCAA ACTGATGATT CAGTATGGTC TGGCTGGTGC CGCGCTGGCG
ATGTATCGCA CTGCTCATGC TCAAAACAGA AAAGCTATCA AAGCATTGCT TATTTCCGGG
GTGCTTACAG TCATTATTGG CGGCGTCAGC GAACCGATTG AGTTTCTGTT CTTATTTGTC
AGTCCACTGC TGTTTGTCTT CCATGCATTT ATGAATGGAT TCGCCAACAT GGTCTTGCCA
TATATGGGCG TGAAGATGGG ATTTACTGGC GATCTGATTC AATTTATTAG CTTTGGCGTA
TTGCGTGGCA CAAGAACAGG TTGGCCAATA GCGGTGTGTG TCGAAGTGGC CTATTTCTTT
ATTTATTACT TTGTGTTCCG TTGGACTATT CTCAAATTTA ACCTGATGAC GGTAGGCCGT
GAAGAGTCCA GTCCTGTCAC GCTGAACGCT CATGAGGATA CGGCTATAGC GGATATCCCA
ACTCCTGATA AATCAGAGCT GCAAGCGGCG GAGCAGATGG TTAAGGCACT TGGTGGTAAA
GAGAATATTA AGTCACTGGA TAATTGCGTA ACTCGTTTAC GTTTAACAAT CGCAGATATG
GGATTGCTTG ACGAGGCTGC AATAAAAAGA GCCGGCGGAA TTGCGGTTGT TAAACTTGAT
CAAAATACCC TACAAGTCAT TATCGGTACT AAAGTCATCG CCTTGCGTCG GGATATGGAT
AACTACATGG GGATATGCTG A
 
Protein sequence
MKQKKAWSFF QSLGKAFMYP IALLSVCGMM LGLGSGLASD DMAKLIPFLA IPIIKTILDF 
IVSLGLFAFV NLPVLFAIAI PLGLLKDKED KAYGAFSGLI GFMAMHLGTN FYLKQHDLLV
AADQMSTHGQ TIILGIQSYN TSVLGGIVAG LLVASMYKKI VNLRIPESLG FYSGPRLVPI
ITLIVMSGFG LIIPFIWPPF FNLFMLIGHW ISTSGPVGYF FYAVAERVTI PFGLNHLVTS
VFRFTPIGGS AVIGGEEYYG TLNMFMAYVK ENAVIPLDLA GKMEQGKLMI QYGLAGAALA
MYRTAHAQNR KAIKALLISG VLTVIIGGVS EPIEFLFLFV SPLLFVFHAF MNGFANMVLP
YMGVKMGFTG DLIQFISFGV LRGTRTGWPI AVCVEVAYFF IYYFVFRWTI LKFNLMTVGR
EESSPVTLNA HEDTAIADIP TPDKSELQAA EQMVKALGGK ENIKSLDNCV TRLRLTIADM
GLLDEAAIKR AGGIAVVKLD QNTLQVIIGT KVIALRRDMD NYMGIC