Gene EcSMS35_2907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2907 
SymbolscrA 
ID6147105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2980103 
End bp2981473 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content46% 
IMG OID641617776 
ProductPTS system sucrose-specific EIIBC component ScrA 
Protein accessionYP_001744931 
Protein GI170681405 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR01996] PTS system, sucrose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0119328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCG ATAAAATCGC CCAATCGTTG CTTCCTTTAT TAGGTGGTAA GGAGAACATC 
GCCAGTGCAG CACATTGCGC CACTCGCCTA CGGCTGGTAC TGGTTGACGA CACACTTGCC
GATCAACATG CCATTGGCCA GATTGATGGA GTGAAAGGTT GTTTTCGTAA TTCAGGGCAA
ATGCAGATCA TCTTTGGCAC CGGCGTGGTT AATAAAGTCT ATGCTGCCTT TATTCAGGTT
GCGGGTATTA GCGAATCCAG TAAAGCAGAT ACCGCTCGAC TCGCCGCTCA AAAACTCAAT
CCTTTTCAGC GAATAGCACG GCTGCTTTCT AATATTTTTG TCCCCATCAT TCCTGCCATT
GTTGCTTCTG GTTTATTAAT GGGGCTTCTG GGAATGGTGA AAACATATGG CTGGGTGAAT
GCCGATAATG CGATTTATAT CCTGCTGGAT ATGTGCAGTT CAGCCGCATT TATTATTCTC
CCCATCCTGA TTGGCTTTAC TGCTGCTCGT GAATTTGGTG GAAATCCTTA TCTTGGCGCG
ACATTAGGAG GGATTCTGAC TCACCCGGCA CTCACAAATG CCTGGGGCGT TGCTGCGGGA
TTTCAGACAA TGAACTTCTT CGGTTTTGAA ATCGCCATGA TTGGCTATCA GGGAACGGTT
TTCCCCGTGC TTCTGGCAGT ATGGTTTATG AGCATAGTGG AAAAACAGTT ACGCAGGTTT
ATCCCTGATG CTCTGGATCT CATTCTGACG CCATTTCTGA CTGTCGTCAT TTCTGGCTTT
ATCGCTCTTT TAATTATTGG TCCGGCAGGG CGAGCGTTAG GTGATGGTAT CTCTTTCGTT
CTCAGCACGC TGATTGCACA CGCTGGCTGG TTAGCGGGGT TGTTATTCGG CGGGCTATAT
TCAGCGATTG TTATTACGGG TATTCATCAT AGCTTCCACG CAATTGAAGC AGGACTTTTA
GGTAACCCTG CAATCGGTGT TAATTTCCTG CTGCCTATTT GGGCAATGGC AAACGTTGCG
CAAGGCGGTG CATGTCTGGC GGTATGGTTT AAAACCAAAG ATACAAAAAT TAAAGCAATC
ACCCTACCCT CGGCTTTTTC CGCAATGTTA GGGATTACTG AAGCCGCTAT TTTTGGTATC
AACCTTCGTT TTGTTAAACC TTTTATTGCG GCCTTAATTG GTGGCGCTGC CGGCGGTGCC
TGGGTTGTCT CTGTTCACGT TTATATGACT GCCGTTGGTC TGACCGCAAT ACCCGGCATG
GCAATCGTTC AGCCAACATC GTTGGTTAAT TATATTATTG GAATGGTGAT TGCTTTCGCT
GTCGCTTTTA GTCTCTCTTT ATTGCTCAAA TACAAAACAG ACGAGGAGTA A
 
Protein sequence
MDFDKIAQSL LPLLGGKENI ASAAHCATRL RLVLVDDTLA DQHAIGQIDG VKGCFRNSGQ 
MQIIFGTGVV NKVYAAFIQV AGISESSKAD TARLAAQKLN PFQRIARLLS NIFVPIIPAI
VASGLLMGLL GMVKTYGWVN ADNAIYILLD MCSSAAFIIL PILIGFTAAR EFGGNPYLGA
TLGGILTHPA LTNAWGVAAG FQTMNFFGFE IAMIGYQGTV FPVLLAVWFM SIVEKQLRRF
IPDALDLILT PFLTVVISGF IALLIIGPAG RALGDGISFV LSTLIAHAGW LAGLLFGGLY
SAIVITGIHH SFHAIEAGLL GNPAIGVNFL LPIWAMANVA QGGACLAVWF KTKDTKIKAI
TLPSAFSAML GITEAAIFGI NLRFVKPFIA ALIGGAAGGA WVVSVHVYMT AVGLTAIPGM
AIVQPTSLVN YIIGMVIAFA VAFSLSLLLK YKTDEE