Gene EcSMS35_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4047 
Symbol 
ID6144341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4138752 
End bp4140368 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content52% 
IMG OID641618872 
ProductPTS system, alpha-glucoside-specific IIBC component 
Protein accessionYP_001746010 
Protein GI170684221 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR02005] PTS system, alpha-glucoside-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGTC AAATTCAACG CTTTGGCGGC GCGATGTTCA CGCCAGTGCT GCTGTTTCCC 
TTCGCCGGGA TTGTGGTGGG TCTTGCCATC TTGCTGCAAA ACCCGATGTT TGTCGGGGAA
TCACTGACCG ATCCGAACAG TTTATTCGCG CAAATCGTAC ATATTATTGA AGAGGGCGGT
TGGACGGTAT TCCGTAATAT GCCGCTGATT TTTGCTGTCG GTTTACCCAT TGGCCTTGCT
AAGCAAGCGC AGGGGCGAGC TTGTCTGGCG GTGATGGTGA GTTTCCTGAC CTGGAACTAT
TTCATCAACG CGATGGGAAT GACCTGGGGA AGCTACTTCG GCGTCGATTT CACTCAGGAC
GCGGTGGCAG GTAGCGGTCT GACAATGATG GCCGGGATTA AAACCCTCGA TACCAGCATT
ATCGGCGCAA TTATCATTTC CGGCATTGTG ACGGCGCTGC ATAACCGTCT GTTCGATAAA
AAACTGCCGG TGTTTCTCGG CATTTTCCAG GGGACATCTT ATGTGGTGAT TATCGCCTTC
CTGGTGATGA TCCCCTGCGC CTGGCTGACG TTGCTCGGCT GGCCAAAAGT ACAAATGGGG
ATTGAATCTC TGCAAGCGTT CCTGCGTTCG GCGGGTGCGC TTGGGGTCTG GGTTTACACC
TTCCTCGAAC GTATTCTGAT CCCAACCGGT TTACACCACT TCATCTACGG ACCGTTTATC
TTTGGTCCGG CAGCTGTTGA AGGCGGCATT CAGATGTACT GGGCGCAGCA TCTGCAAGAG
TTCAGTTTGA GCGCCGAGCC GCTGAAATCG TTGTTCCCGG AAGGCGGTTT TGCCCTGCAC
GGTAACTCAA AAATCTTTGG TGCCGTGGGC ATTTCTTTAG CGATGTACTT CACTGCCGCA
CCGGAAAATC GGGTAAAAGT GGCGGGCTTG CTGATCCCCG CAACCTTAAC CGCCATGCTG
GTGGGAATTA CCGAACCGCT GGAATTTACC TTCCTGTTCA TTTCACCGTT GCTGTTTGCG
GTACACGCTG TGCTTGCGGC CTCAATGTCG ACCGTGATGT ACCTCTTTGG TGTGGTGGGC
AACATGGGCG GAGGTCTGAT TGACCAGGTT TTACCGCAAA ACTGGATCCC GATGTTCAGC
AACCACGCGG ATATGATTTT GACCCAAATC GCCATTGGGT TGTGCTTTAG CCTGCTGTAC
TTCGTGGTTT TCCGCACCCT GATTCTGCAA TTCAACATGT GCACGCCGGG ACGTGAAGAT
GCGGAAGTGA AACTCTACTC AAAAGCCGAA TACAAAGCCT CGCGAGGCCA AACCACCGCG
GCAGAGCCAA AAAAAGAGCT GGATCAGGCT GCCGGTATCC TGCAAGCCCT GGGCGGGGTC
GGCAATATCT CCAGCATCAA CAATTGCGCG ACGCGTTTAC GTATTGCACT GCATGACATG
TCACAAACGC TGGATGACGA AGTCTTTAAA AAGCTGGGAG CGCACGGCGT CTTCCGTAGT
GGCGATGCCA TTCAGGTGAT CATTGGTTTG CATGTATCCC AGCTGCGTGA ACAGCTCGAT
AGCTTAATTA ATTCTCATCA ATCAGCAGAA AATGTTGCCA TTACGGAGGC AGTATAA
 
Protein sequence
MLSQIQRFGG AMFTPVLLFP FAGIVVGLAI LLQNPMFVGE SLTDPNSLFA QIVHIIEEGG 
WTVFRNMPLI FAVGLPIGLA KQAQGRACLA VMVSFLTWNY FINAMGMTWG SYFGVDFTQD
AVAGSGLTMM AGIKTLDTSI IGAIIISGIV TALHNRLFDK KLPVFLGIFQ GTSYVVIIAF
LVMIPCAWLT LLGWPKVQMG IESLQAFLRS AGALGVWVYT FLERILIPTG LHHFIYGPFI
FGPAAVEGGI QMYWAQHLQE FSLSAEPLKS LFPEGGFALH GNSKIFGAVG ISLAMYFTAA
PENRVKVAGL LIPATLTAML VGITEPLEFT FLFISPLLFA VHAVLAASMS TVMYLFGVVG
NMGGGLIDQV LPQNWIPMFS NHADMILTQI AIGLCFSLLY FVVFRTLILQ FNMCTPGRED
AEVKLYSKAE YKASRGQTTA AEPKKELDQA AGILQALGGV GNISSINNCA TRLRIALHDM
SQTLDDEVFK KLGAHGVFRS GDAIQVIIGL HVSQLREQLD SLINSHQSAE NVAITEAV