Gene EcHS_A3632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3632 
SymbolglgB 
ID5594225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3618353 
End bp3620539 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content53% 
IMG OID640922748 
Productglycogen branching enzyme 
Protein accessionYP_001460229 
Protein GI157162911 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR01515] alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGATC GTATCGATAG AGACGTGATT AACGCGCTAA TTGCAGGCCA TTTTGCGGAT 
CCTTTTTCCG TACTGGGAAT GCATAAAACC ACCGCGGGAC TGGAAGTCCG TGCCCTTTTA
CCCGACGCTA CCGATGTGTG GGTGATTGAA CCGAAAACCG GGCGCAAACT CGCAAAACTG
GAGTGTCTCG ACTCACGGGG ATTCTTTAGC GGCGTCATTC CGCGACGTAA GAATTTTTTC
CGCTATCAGT TGGCTGTTGT CTGGCATGGT CAGCAAAACC TGATTGATGA TCCTTACCGT
TTTGGTCCGC TAATCCAGGA AATGGATGCC TGGCTATTAT CTGAAGGTAC TCACCTGCGC
CCGTATGAAA CCTTAGGCGC GCATGCAGAT ACTATGGATG GCGTCACAGG TACGCGTTTC
TCTGTCTGGG CTCCAAACGC CCGTCGGGTC TCGGTGGTTG GGCAATTCAA CTACTGGGAC
GGTCGCCGTC ACCCGATGCG CCTGCGTAAA GAGAGCGGCA TCTGGGAACT GTTTATCCCT
GGGGCGCATA ACGGTCAGCT CTATAAATAC GAGATGATTG ATGCCAATGG CAACTTGCGT
CTGAAGTCCG ACCCTTATGC CTTCGAAGCG CAAATGCGCC CGGAAACCGC GTCTCTTATT
TGCGGGCTGC CGGAAAAGGT TGTACAGACT GAAGAGCGCA AAAAAGCGAA TCAGTTTGAT
GCGCCAATCT CTATTTATGA AGTTCACCTG GGTTCCTGGC GTCGCCACAC CGACAACAAT
TTCTGGTTGA GCTACCGCGA GCTGGCCGAT CAACTGGTGC CTTATGCTAA ATGGATGGGC
TTTACCCACC TCGAACTACT GCCCATTAAC GAGCATCCCT TCGATGGCAG TTGGGGTTAT
CAGCCAACCG GCCTGTATGC ACCAACCCGC CGTTTTGGTA CTCGCGACGA CTTCCGTTAT
TTCATTGATG CCGCACACGC AGCTGGTCTG AACGTGATTC TCGACTGGGT GCCAGGCCAC
TTCCCGACCG ATGACTTTGC GCTTGCCGAA TTTGATGGCA CGAACTTGTA TGAACACAGC
GATCCGCGTG AAGGCTATCA TCAGGACTGG AACACGCTGA TCTACAACTA TGGTCGCCGT
GAAGTCAGTA ACTTCCTCGT CGGTAACGCG CTTTACTGGA TTGAACGTTT TGGTATTGAT
GCGCTGCGCG TCGATGCGGT GGCGTCAATG ATTTATCGCG ACTACAGCCG TAAAGAGGGG
GAGTGGATCC CGAACGAATT TGGCGGGCGC GAGAATCTTG AAGCGATTGA ATTCTTGCGT
AATACCAACC GTATTCTTGG TGAGCAGGTT TCCGGTGCGG TGACAATGGC TGAGGAGTCT
ACCGATTTCC CTGGCGTTTC TCGTCCGCAG GATATGGGCG GTCTGGGCTT CTGGTACAAG
TGGAACCTCG GCTGGATGCA TGACACCCTG GACTACATGA AGCTCGACCC GGTTTATCGT
CAGTATCATC ACGATAAACT GACCTTCGGG ATTCTCTACA ACTACACTGA AAACTTCGTC
CTGCCGTTGT CGCATGATGA AGTGGTCCAC GGTAAAAAAT CGATTCTCGA CCGCATGCCG
GGCGACGCAT GGCAGAAATT CGCGAACCTG CGCGCCTACT ATGGCTGGAT GTGGGCATTC
CCGGGCAAGA AACTACTGTT CATGGGTAAC GAATTTGCCC AGGGCCGCGA GTGGAACCAT
GACGCCAGCC TCGACTGGCA TCTGTTGGAA GGCGGCGATA ACTGGCACCA CGGTGTCCAG
CGTCTGGTGC GCGATCTGAA CCTCACCTAC CGCCACCATA AAGCAATGCA TGAACTGGAT
TTTGACCCGT ACGGCTTTGA ATGGCTGGTG GTGGATGACA AAGAACGCTC GGTGCTGATC
TTTGTGCGTC GCGATAAAGA GGGTAACGAA ATCATCGTTG CCAGTAACTT TACGCCGGTA
CCGCGTCATG ATTATCGCTT CGGCATTAAT CAGCCGGGCA AATGGCGTGA AATCCTCAAT
ACCGATTCCA TGCACTATCA CGGCAGTAAT GCAGGCAATG GCGGCACGGT ACACAGCGAT
GAGATTGCCA GCCACGGTCG TCAGCATTCA CTAAGCCTGA CGCTACCACC GCTGGCCACT
ATCTGGCTGG TTCGGGAGGC AGAATGA
 
Protein sequence
MSDRIDRDVI NALIAGHFAD PFSVLGMHKT TAGLEVRALL PDATDVWVIE PKTGRKLAKL 
ECLDSRGFFS GVIPRRKNFF RYQLAVVWHG QQNLIDDPYR FGPLIQEMDA WLLSEGTHLR
PYETLGAHAD TMDGVTGTRF SVWAPNARRV SVVGQFNYWD GRRHPMRLRK ESGIWELFIP
GAHNGQLYKY EMIDANGNLR LKSDPYAFEA QMRPETASLI CGLPEKVVQT EERKKANQFD
APISIYEVHL GSWRRHTDNN FWLSYRELAD QLVPYAKWMG FTHLELLPIN EHPFDGSWGY
QPTGLYAPTR RFGTRDDFRY FIDAAHAAGL NVILDWVPGH FPTDDFALAE FDGTNLYEHS
DPREGYHQDW NTLIYNYGRR EVSNFLVGNA LYWIERFGID ALRVDAVASM IYRDYSRKEG
EWIPNEFGGR ENLEAIEFLR NTNRILGEQV SGAVTMAEES TDFPGVSRPQ DMGGLGFWYK
WNLGWMHDTL DYMKLDPVYR QYHHDKLTFG ILYNYTENFV LPLSHDEVVH GKKSILDRMP
GDAWQKFANL RAYYGWMWAF PGKKLLFMGN EFAQGREWNH DASLDWHLLE GGDNWHHGVQ
RLVRDLNLTY RHHKAMHELD FDPYGFEWLV VDDKERSVLI FVRRDKEGNE IIVASNFTPV
PRHDYRFGIN QPGKWREILN TDSMHYHGSN AGNGGTVHSD EIASHGRQHS LSLTLPPLAT
IWLVREAE