Gene EcSMS35_3714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3714 
SymbolglgB 
ID6143963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3781702 
End bp3783888 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content53% 
IMG OID641618540 
Productglycogen branching enzyme 
Protein accessionYP_001745680 
Protein GI170680453 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR01515] alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.470927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.781278 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATC GTATCGATAG AGACGTGATT AACGCGCTAA TTGCAGGCCA TTTTGCGGAT 
CCTTTTTCCG TACTGGGGAT GCATAAAACC ACCGCGGGAC TGGAAGTCCG TGCCCTTTTA
CCCGACGCTA CCGATGTGTG GGTGATTGAA CCGAAAACCG GGCGCAAACT CGCAAAACTG
GAGTGTCTCG ACTCCCGTGG ATTCTTTAGC GGTGTCATTC CGCGACGTAA GAATTTTTTC
CGCTATCAGT TGGCTGTTGT CTGGCATGGT CAGCAAAACC TGATAGATGA TCCTTACCGT
TTTGGTCCGC TAATCCAGGA AATGGATGCC TGGCTATTAT CTGAAGGTAC TCACCTGCGC
CCGTATGAAA CCTTAGGCGC GCATGCAGAT ACTATGGATG GCGTCACAGG TACGCGTTTT
TCTGTCTGGG CTCCAAACGC CCGTCGGGTC TCGGTGGTTG GGCAATTCAA CTACTGGGAC
GGTCGCCGTC ATCCGATGCG CCTGCGTAAA GAGAGCGGCA TCTGGGAACT GTTTATCCCT
GGGGCGCATA ACGGTCAGCT CTATAAATAC GAGATGATTG ATGCCAATGG CAACTTGCGT
CTGAAGTCCG ACCCTTATGC CTTCGAAGCG CAAATGCGCC CGGAAACCGC GTCTCTTATT
TGCGGGCTGC CGAAAAAGGT TGTACAGACT GAAGAGCGCA AAAAAGCGAA TCAGTTTGAT
GCGCCAATCT CTATTTATGA AGTTCACCTG GGCTCCTGGC GTCGCCACAC CGACAACAAT
TTCTGGTTAA GCTACCGCGA GCTGGCCGAT CAACTTGTGC CTTATGCTAA ATGGATGGGC
TTTACCCATC TCGAGCTACT GCCCATTAAC GAGCATCCGT TCGATGGCAG TTGGGGTTAT
CAGCCAACCG GCCTGTATGC ACCGACCCGC CGTTTTGGTA CCCGCGACGA CTTCCGTTAT
TTCATTGATG CCGCACACGC AGCTGGTCTG AACGTGATTC TCGACTGGGT GCCAGGCCAC
TTCCCGACCG ATGACTTTGC GCTTGCCGAA TTTGATGGCA CGAACTTGTA TGAACACAGC
GATCCGCGCG AAGGCTATCA TCAGGACTGG AACACGCTGA TCTACAACTA TGGTCGCCGT
GAAGTCAGTA ACTTCCTTGT CGGTAACGCG CTTTACTGGA TCGAACGTTT TGGTATTGAT
GCGCTGCGCG TCGATGCGGT GGCGTCAATG ATTTATCGCG ACTACAGCCG TAAAGAGGGG
GAGTGGATCC CGAACGAATT TGGCGGTCGC GAGAATCTTG AAGCGATTGA ATTCTTGCGT
AATACCAACC GTATTCTTGG TGAGCAGGTT TCCGGTGCAG TGACAATGGC GGAGGAGTCT
ACCGATTTCC CTGGCGTTTC TCGTCCGCAG GACATGGGCG GTCTGGGCTT CTGGTACAAG
TGGAACCTCG GCTGGATGCA TGACACCCTG GACTACATGA AGCTCGACCC AGTTTATCGT
CAGTATCATC ACGATAAACT GACCTTCGGG ATGCTCTACA ACTACACTGA AAACTTCGTC
CTGCCGTTGT CGCATGATGA AGTGGTCCAC GGTAAAAAAT CGATTCTCGA CCGGATGCCG
GGCGACGCAT GGCAGAAATT CGCTAACCTG CGCGCCTACT ACGGCTGGAT GTGGGCATTC
CCGGGCAAGA AATTGCTGTT CATGGGGAAC GAATTTGCCC AGGGCCGCGA GTGGAACCAT
GACGCCAGCC TCGACTGGCA TCTGTTGGAA GGCGGCGATA ACTGGCACCA CGGTGTCCAG
CGTCTGGTGC GCGATCTGAA CCTCACCTAC CGCCACCATA AAGCAATGCA TGAACTGGAT
TTTGACCCGT ACGGCTTTGA ATGGCTGGTG GTGGATGACA AAGAACGCTC GGTGCTGATC
TTTGTGCGTC GCGATAAAGA GGGTAACGAA ATCATCGTTG CCAGTAACTT TACTCCGGTG
CCGCGTCATG ATTATCGCTT CGGCATTAAC CAGCCGGGTA AATGGCGTGA GATCCTCAAT
ACCGATTCCA TGCACTATCA CGGCAGTAAT GCAGGCAATG GCGGCACGGT ACACAGCGAT
GAGATTGCCA GCCACGGTCG TCAGCATTCA CTAAGCCTGA CGCTACCACC TCTGGCCACT
ATCTGGCTGG TTCGGGAGGC AGAATGA
 
Protein sequence
MSDRIDRDVI NALIAGHFAD PFSVLGMHKT TAGLEVRALL PDATDVWVIE PKTGRKLAKL 
ECLDSRGFFS GVIPRRKNFF RYQLAVVWHG QQNLIDDPYR FGPLIQEMDA WLLSEGTHLR
PYETLGAHAD TMDGVTGTRF SVWAPNARRV SVVGQFNYWD GRRHPMRLRK ESGIWELFIP
GAHNGQLYKY EMIDANGNLR LKSDPYAFEA QMRPETASLI CGLPKKVVQT EERKKANQFD
APISIYEVHL GSWRRHTDNN FWLSYRELAD QLVPYAKWMG FTHLELLPIN EHPFDGSWGY
QPTGLYAPTR RFGTRDDFRY FIDAAHAAGL NVILDWVPGH FPTDDFALAE FDGTNLYEHS
DPREGYHQDW NTLIYNYGRR EVSNFLVGNA LYWIERFGID ALRVDAVASM IYRDYSRKEG
EWIPNEFGGR ENLEAIEFLR NTNRILGEQV SGAVTMAEES TDFPGVSRPQ DMGGLGFWYK
WNLGWMHDTL DYMKLDPVYR QYHHDKLTFG MLYNYTENFV LPLSHDEVVH GKKSILDRMP
GDAWQKFANL RAYYGWMWAF PGKKLLFMGN EFAQGREWNH DASLDWHLLE GGDNWHHGVQ
RLVRDLNLTY RHHKAMHELD FDPYGFEWLV VDDKERSVLI FVRRDKEGNE IIVASNFTPV
PRHDYRFGIN QPGKWREILN TDSMHYHGSN AGNGGTVHSD EIASHGRQHS LSLTLPPLAT
IWLVREAE