Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1812 |
Symbol | |
ID | 6146963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1832647 |
End bp | 1833939 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616688 |
Product | putative sugar ABC transporter, periplasmic sugar-binding protein |
Protein accession | YP_001743866 |
Protein GI | 170683434 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAAAT CAAAAATCGT GCTGTTATCA GCACTGGTTT CCTGCGCCCT GATTTCAGGT TGTAAAGAAG AAAATAAAAC GAATGTCTCC ATTGAGTTTA TGCATTCTTC GGTGGAACAG GAACGCCAGG CGGTAATTAG TCAGTTAATT GCGCGTTTTG AAAAAGAAAA CCCGGGCATC ACCGTTAAAC AAGTGCCCGT GGAAGAAGAC GCCTATAACA CCAAAGTCAT TACCCTTTCA CGTAGTGGTT CGCTGCCGGA AGTGATCGAA ACCAGCCATG ACTACGCCAA AGTTATGGAC AAAGAGCAGC TTATCGATCG CCTGGCGGTT GCCACGGTCA TCAGCAACGT CGGTGAAGGC GCGTTCTACG ATGGCGTTCT GCGGATTGTG CGCACCGAAG ATGGTAGCGC ATGGACCGGT GTTCCTGTCA GCGCCTGGAT TGGCGGCATC TGGTATCGCA AAGATGTGCT GGCAAAAGCC GGGCTGGAGG AGCCGAAAAA CTGGCAGCAA TTGCTGGATG TCGCGCAGAA ACTGAATGAC CCGGCGAATA AAAAATATGG CATTGCGCTG CCTACGGCAG AAAGTGTGCT GACGGAACAA TCCTTCTCCC AGTTTGCCTT ATCCAACCAG GCCAACGTCT TTAACGCCGA AGGCAAAATC ACCCTTGATA CACCAGAGAT GATGCAGGCG CTGACCTATT ACCGCGACCT TGCTGCCAAC ACCATGCCGG GTTCTAACGA CATCATGGAG GTGAAAGACG CTTTTATGAA CGGCACCGCG CCGATGGCGA TTTACTCCAC CTATATCCTT CCGGCTGTGA TTAAAGAGGG CGATCCGAAA AACGTTGGTT TCGTTGTACC CACCGAGAAA AACTCTGCGG TCTACGGCAT GTTGACCTCG CTGACCATTA CCGCCGGGCA AAAGACCGAA GAGACCGAAG CGGCTGAAAA ATTTGTCACC TTTATGGAGC AGGCAGACAA CATTGCCGAC TGGGTGATGA TGTCGCCAGG CGCGGCGCTG CCGGTGAACA AAGCGGTTGT CACTACCGCC ACCTGGAAAG ACAACGACGT TATTAAGGCG CTGGGAGAAT TGCCAAATCA GTTAATCAGC GAACTGCCAA ATATTCAGGT CTTTGGCGCA GTAGGGGATA AAAATTTTAC CCGCATGGGC GATGTGACGG GTTCTGGCGT GGTGAGCTCC ATGGTGCATA ACGTCACCGT GGGTAAAGCC GACCTCTCTA CCACGCTGCA AACGAGCCAG AAAAAGCTGG ATGAACTGAT CGAACAGCAC TAA
|
Protein sequence | MIKSKIVLLS ALVSCALISG CKEENKTNVS IEFMHSSVEQ ERQAVISQLI ARFEKENPGI TVKQVPVEED AYNTKVITLS RSGSLPEVIE TSHDYAKVMD KEQLIDRLAV ATVISNVGEG AFYDGVLRIV RTEDGSAWTG VPVSAWIGGI WYRKDVLAKA GLEEPKNWQQ LLDVAQKLND PANKKYGIAL PTAESVLTEQ SFSQFALSNQ ANVFNAEGKI TLDTPEMMQA LTYYRDLAAN TMPGSNDIME VKDAFMNGTA PMAIYSTYIL PAVIKEGDPK NVGFVVPTEK NSAVYGMLTS LTITAGQKTE ETEAAEKFVT FMEQADNIAD WVMMSPGAAL PVNKAVVTTA TWKDNDVIKA LGELPNQLIS ELPNIQVFGA VGDKNFTRMG DVTGSGVVSS MVHNVTVGKA DLSTTLQTSQ KKLDELIEQH
|
| |