Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2907 |
Symbol | scrA |
ID | 6147105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2980103 |
End bp | 2981473 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641617776 |
Product | PTS system sucrose-specific EIIBC component ScrA |
Protein accession | YP_001744931 |
Protein GI | 170681405 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component [TIGR01996] PTS system, sucrose-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0119328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTCG ATAAAATCGC CCAATCGTTG CTTCCTTTAT TAGGTGGTAA GGAGAACATC GCCAGTGCAG CACATTGCGC CACTCGCCTA CGGCTGGTAC TGGTTGACGA CACACTTGCC GATCAACATG CCATTGGCCA GATTGATGGA GTGAAAGGTT GTTTTCGTAA TTCAGGGCAA ATGCAGATCA TCTTTGGCAC CGGCGTGGTT AATAAAGTCT ATGCTGCCTT TATTCAGGTT GCGGGTATTA GCGAATCCAG TAAAGCAGAT ACCGCTCGAC TCGCCGCTCA AAAACTCAAT CCTTTTCAGC GAATAGCACG GCTGCTTTCT AATATTTTTG TCCCCATCAT TCCTGCCATT GTTGCTTCTG GTTTATTAAT GGGGCTTCTG GGAATGGTGA AAACATATGG CTGGGTGAAT GCCGATAATG CGATTTATAT CCTGCTGGAT ATGTGCAGTT CAGCCGCATT TATTATTCTC CCCATCCTGA TTGGCTTTAC TGCTGCTCGT GAATTTGGTG GAAATCCTTA TCTTGGCGCG ACATTAGGAG GGATTCTGAC TCACCCGGCA CTCACAAATG CCTGGGGCGT TGCTGCGGGA TTTCAGACAA TGAACTTCTT CGGTTTTGAA ATCGCCATGA TTGGCTATCA GGGAACGGTT TTCCCCGTGC TTCTGGCAGT ATGGTTTATG AGCATAGTGG AAAAACAGTT ACGCAGGTTT ATCCCTGATG CTCTGGATCT CATTCTGACG CCATTTCTGA CTGTCGTCAT TTCTGGCTTT ATCGCTCTTT TAATTATTGG TCCGGCAGGG CGAGCGTTAG GTGATGGTAT CTCTTTCGTT CTCAGCACGC TGATTGCACA CGCTGGCTGG TTAGCGGGGT TGTTATTCGG CGGGCTATAT TCAGCGATTG TTATTACGGG TATTCATCAT AGCTTCCACG CAATTGAAGC AGGACTTTTA GGTAACCCTG CAATCGGTGT TAATTTCCTG CTGCCTATTT GGGCAATGGC AAACGTTGCG CAAGGCGGTG CATGTCTGGC GGTATGGTTT AAAACCAAAG ATACAAAAAT TAAAGCAATC ACCCTACCCT CGGCTTTTTC CGCAATGTTA GGGATTACTG AAGCCGCTAT TTTTGGTATC AACCTTCGTT TTGTTAAACC TTTTATTGCG GCCTTAATTG GTGGCGCTGC CGGCGGTGCC TGGGTTGTCT CTGTTCACGT TTATATGACT GCCGTTGGTC TGACCGCAAT ACCCGGCATG GCAATCGTTC AGCCAACATC GTTGGTTAAT TATATTATTG GAATGGTGAT TGCTTTCGCT GTCGCTTTTA GTCTCTCTTT ATTGCTCAAA TACAAAACAG ACGAGGAGTA A
|
Protein sequence | MDFDKIAQSL LPLLGGKENI ASAAHCATRL RLVLVDDTLA DQHAIGQIDG VKGCFRNSGQ MQIIFGTGVV NKVYAAFIQV AGISESSKAD TARLAAQKLN PFQRIARLLS NIFVPIIPAI VASGLLMGLL GMVKTYGWVN ADNAIYILLD MCSSAAFIIL PILIGFTAAR EFGGNPYLGA TLGGILTHPA LTNAWGVAAG FQTMNFFGFE IAMIGYQGTV FPVLLAVWFM SIVEKQLRRF IPDALDLILT PFLTVVISGF IALLIIGPAG RALGDGISFV LSTLIAHAGW LAGLLFGGLY SAIVITGIHH SFHAIEAGLL GNPAIGVNFL LPIWAMANVA QGGACLAVWF KTKDTKIKAI TLPSAFSAML GITEAAIFGI NLRFVKPFIA ALIGGAAGGA WVVSVHVYMT AVGLTAIPGM AIVQPTSLVN YIIGMVIAFA VAFSLSLLLK YKTDEE
|
| |