Gene EcSMS35_1812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1812 
Symbol 
ID6146963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1832647 
End bp1833939 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content51% 
IMG OID641616688 
Productputative sugar ABC transporter, periplasmic sugar-binding protein 
Protein accessionYP_001743866 
Protein GI170683434 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAT CAAAAATCGT GCTGTTATCA GCACTGGTTT CCTGCGCCCT GATTTCAGGT 
TGTAAAGAAG AAAATAAAAC GAATGTCTCC ATTGAGTTTA TGCATTCTTC GGTGGAACAG
GAACGCCAGG CGGTAATTAG TCAGTTAATT GCGCGTTTTG AAAAAGAAAA CCCGGGCATC
ACCGTTAAAC AAGTGCCCGT GGAAGAAGAC GCCTATAACA CCAAAGTCAT TACCCTTTCA
CGTAGTGGTT CGCTGCCGGA AGTGATCGAA ACCAGCCATG ACTACGCCAA AGTTATGGAC
AAAGAGCAGC TTATCGATCG CCTGGCGGTT GCCACGGTCA TCAGCAACGT CGGTGAAGGC
GCGTTCTACG ATGGCGTTCT GCGGATTGTG CGCACCGAAG ATGGTAGCGC ATGGACCGGT
GTTCCTGTCA GCGCCTGGAT TGGCGGCATC TGGTATCGCA AAGATGTGCT GGCAAAAGCC
GGGCTGGAGG AGCCGAAAAA CTGGCAGCAA TTGCTGGATG TCGCGCAGAA ACTGAATGAC
CCGGCGAATA AAAAATATGG CATTGCGCTG CCTACGGCAG AAAGTGTGCT GACGGAACAA
TCCTTCTCCC AGTTTGCCTT ATCCAACCAG GCCAACGTCT TTAACGCCGA AGGCAAAATC
ACCCTTGATA CACCAGAGAT GATGCAGGCG CTGACCTATT ACCGCGACCT TGCTGCCAAC
ACCATGCCGG GTTCTAACGA CATCATGGAG GTGAAAGACG CTTTTATGAA CGGCACCGCG
CCGATGGCGA TTTACTCCAC CTATATCCTT CCGGCTGTGA TTAAAGAGGG CGATCCGAAA
AACGTTGGTT TCGTTGTACC CACCGAGAAA AACTCTGCGG TCTACGGCAT GTTGACCTCG
CTGACCATTA CCGCCGGGCA AAAGACCGAA GAGACCGAAG CGGCTGAAAA ATTTGTCACC
TTTATGGAGC AGGCAGACAA CATTGCCGAC TGGGTGATGA TGTCGCCAGG CGCGGCGCTG
CCGGTGAACA AAGCGGTTGT CACTACCGCC ACCTGGAAAG ACAACGACGT TATTAAGGCG
CTGGGAGAAT TGCCAAATCA GTTAATCAGC GAACTGCCAA ATATTCAGGT CTTTGGCGCA
GTAGGGGATA AAAATTTTAC CCGCATGGGC GATGTGACGG GTTCTGGCGT GGTGAGCTCC
ATGGTGCATA ACGTCACCGT GGGTAAAGCC GACCTCTCTA CCACGCTGCA AACGAGCCAG
AAAAAGCTGG ATGAACTGAT CGAACAGCAC TAA
 
Protein sequence
MIKSKIVLLS ALVSCALISG CKEENKTNVS IEFMHSSVEQ ERQAVISQLI ARFEKENPGI 
TVKQVPVEED AYNTKVITLS RSGSLPEVIE TSHDYAKVMD KEQLIDRLAV ATVISNVGEG
AFYDGVLRIV RTEDGSAWTG VPVSAWIGGI WYRKDVLAKA GLEEPKNWQQ LLDVAQKLND
PANKKYGIAL PTAESVLTEQ SFSQFALSNQ ANVFNAEGKI TLDTPEMMQA LTYYRDLAAN
TMPGSNDIME VKDAFMNGTA PMAIYSTYIL PAVIKEGDPK NVGFVVPTEK NSAVYGMLTS
LTITAGQKTE ETEAAEKFVT FMEQADNIAD WVMMSPGAAL PVNKAVVTTA TWKDNDVIKA
LGELPNQLIS ELPNIQVFGA VGDKNFTRMG DVTGSGVVSS MVHNVTVGKA DLSTTLQTSQ
KKLDELIEQH