Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3714 |
Symbol | glgB |
ID | 6143963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3781702 |
End bp | 3783888 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618540 |
Product | glycogen branching enzyme |
Protein accession | YP_001745680 |
Protein GI | 170680453 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR01515] alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.470927 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.781278 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGATC GTATCGATAG AGACGTGATT AACGCGCTAA TTGCAGGCCA TTTTGCGGAT CCTTTTTCCG TACTGGGGAT GCATAAAACC ACCGCGGGAC TGGAAGTCCG TGCCCTTTTA CCCGACGCTA CCGATGTGTG GGTGATTGAA CCGAAAACCG GGCGCAAACT CGCAAAACTG GAGTGTCTCG ACTCCCGTGG ATTCTTTAGC GGTGTCATTC CGCGACGTAA GAATTTTTTC CGCTATCAGT TGGCTGTTGT CTGGCATGGT CAGCAAAACC TGATAGATGA TCCTTACCGT TTTGGTCCGC TAATCCAGGA AATGGATGCC TGGCTATTAT CTGAAGGTAC TCACCTGCGC CCGTATGAAA CCTTAGGCGC GCATGCAGAT ACTATGGATG GCGTCACAGG TACGCGTTTT TCTGTCTGGG CTCCAAACGC CCGTCGGGTC TCGGTGGTTG GGCAATTCAA CTACTGGGAC GGTCGCCGTC ATCCGATGCG CCTGCGTAAA GAGAGCGGCA TCTGGGAACT GTTTATCCCT GGGGCGCATA ACGGTCAGCT CTATAAATAC GAGATGATTG ATGCCAATGG CAACTTGCGT CTGAAGTCCG ACCCTTATGC CTTCGAAGCG CAAATGCGCC CGGAAACCGC GTCTCTTATT TGCGGGCTGC CGAAAAAGGT TGTACAGACT GAAGAGCGCA AAAAAGCGAA TCAGTTTGAT GCGCCAATCT CTATTTATGA AGTTCACCTG GGCTCCTGGC GTCGCCACAC CGACAACAAT TTCTGGTTAA GCTACCGCGA GCTGGCCGAT CAACTTGTGC CTTATGCTAA ATGGATGGGC TTTACCCATC TCGAGCTACT GCCCATTAAC GAGCATCCGT TCGATGGCAG TTGGGGTTAT CAGCCAACCG GCCTGTATGC ACCGACCCGC CGTTTTGGTA CCCGCGACGA CTTCCGTTAT TTCATTGATG CCGCACACGC AGCTGGTCTG AACGTGATTC TCGACTGGGT GCCAGGCCAC TTCCCGACCG ATGACTTTGC GCTTGCCGAA TTTGATGGCA CGAACTTGTA TGAACACAGC GATCCGCGCG AAGGCTATCA TCAGGACTGG AACACGCTGA TCTACAACTA TGGTCGCCGT GAAGTCAGTA ACTTCCTTGT CGGTAACGCG CTTTACTGGA TCGAACGTTT TGGTATTGAT GCGCTGCGCG TCGATGCGGT GGCGTCAATG ATTTATCGCG ACTACAGCCG TAAAGAGGGG GAGTGGATCC CGAACGAATT TGGCGGTCGC GAGAATCTTG AAGCGATTGA ATTCTTGCGT AATACCAACC GTATTCTTGG TGAGCAGGTT TCCGGTGCAG TGACAATGGC GGAGGAGTCT ACCGATTTCC CTGGCGTTTC TCGTCCGCAG GACATGGGCG GTCTGGGCTT CTGGTACAAG TGGAACCTCG GCTGGATGCA TGACACCCTG GACTACATGA AGCTCGACCC AGTTTATCGT CAGTATCATC ACGATAAACT GACCTTCGGG ATGCTCTACA ACTACACTGA AAACTTCGTC CTGCCGTTGT CGCATGATGA AGTGGTCCAC GGTAAAAAAT CGATTCTCGA CCGGATGCCG GGCGACGCAT GGCAGAAATT CGCTAACCTG CGCGCCTACT ACGGCTGGAT GTGGGCATTC CCGGGCAAGA AATTGCTGTT CATGGGGAAC GAATTTGCCC AGGGCCGCGA GTGGAACCAT GACGCCAGCC TCGACTGGCA TCTGTTGGAA GGCGGCGATA ACTGGCACCA CGGTGTCCAG CGTCTGGTGC GCGATCTGAA CCTCACCTAC CGCCACCATA AAGCAATGCA TGAACTGGAT TTTGACCCGT ACGGCTTTGA ATGGCTGGTG GTGGATGACA AAGAACGCTC GGTGCTGATC TTTGTGCGTC GCGATAAAGA GGGTAACGAA ATCATCGTTG CCAGTAACTT TACTCCGGTG CCGCGTCATG ATTATCGCTT CGGCATTAAC CAGCCGGGTA AATGGCGTGA GATCCTCAAT ACCGATTCCA TGCACTATCA CGGCAGTAAT GCAGGCAATG GCGGCACGGT ACACAGCGAT GAGATTGCCA GCCACGGTCG TCAGCATTCA CTAAGCCTGA CGCTACCACC TCTGGCCACT ATCTGGCTGG TTCGGGAGGC AGAATGA
|
Protein sequence | MSDRIDRDVI NALIAGHFAD PFSVLGMHKT TAGLEVRALL PDATDVWVIE PKTGRKLAKL ECLDSRGFFS GVIPRRKNFF RYQLAVVWHG QQNLIDDPYR FGPLIQEMDA WLLSEGTHLR PYETLGAHAD TMDGVTGTRF SVWAPNARRV SVVGQFNYWD GRRHPMRLRK ESGIWELFIP GAHNGQLYKY EMIDANGNLR LKSDPYAFEA QMRPETASLI CGLPKKVVQT EERKKANQFD APISIYEVHL GSWRRHTDNN FWLSYRELAD QLVPYAKWMG FTHLELLPIN EHPFDGSWGY QPTGLYAPTR RFGTRDDFRY FIDAAHAAGL NVILDWVPGH FPTDDFALAE FDGTNLYEHS DPREGYHQDW NTLIYNYGRR EVSNFLVGNA LYWIERFGID ALRVDAVASM IYRDYSRKEG EWIPNEFGGR ENLEAIEFLR NTNRILGEQV SGAVTMAEES TDFPGVSRPQ DMGGLGFWYK WNLGWMHDTL DYMKLDPVYR QYHHDKLTFG MLYNYTENFV LPLSHDEVVH GKKSILDRMP GDAWQKFANL RAYYGWMWAF PGKKLLFMGN EFAQGREWNH DASLDWHLLE GGDNWHHGVQ RLVRDLNLTY RHHKAMHELD FDPYGFEWLV VDDKERSVLI FVRRDKEGNE IIVASNFTPV PRHDYRFGIN QPGKWREILN TDSMHYHGSN AGNGGTVHSD EIASHGRQHS LSLTLPPLAT IWLVREAE
|
| |