Gene ECD_03763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03763 
SymbolyihQ 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3973934 
End bp3975937 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content54% 
IMG OID 
Productalpha-glucosidase 
Protein accessionACT45556 
Protein GI253979886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATCA TTTTACATAA GCCTGTAATA ACAGGAAGGC AACAACGTCT TATTTTAACC 
CATAGCAAAG ATAATCCTTG TTTATGGATT GGCTCAGGTA TAGCGGATAT CGATATGTTC
CGCGGTAATT TCAGCATTAA AGATAAACTA CAGGAGAAAA TTGCGCTTAC CGACGCCATC
GTCAGCCAGT CACCGGATGG TTGGTTAATT CATTTCAGCC GTGGTTCTGA CATTAGCGCC
ACGCTGAATA TCTCTGCCGA CGATCAGGGG CGTTTATTGC TGGAACTACA AAACGACAAC
CTTAACCACA ACCGTATCTG GCTGCGCCTT GCCGCTCAAC CAGAGGACCA TATCTACGGC
TGCGGCGAAC AGTTTTCCTA CTTCGATCTG CGTGGCAAAC CGTTCCCGCT ATGGACCAGT
GAACAAGGCG TTGGTCGCAA CAAACAAACC TATGTCACCT GGCAGGCCGA CTGCAAAGAA
AATGCGGGCG GCGACTATTA CTGGACTTTC TTCCCACAGC CTACGTTTGT CAGCACGCAG
AAGTATTACT GCCATGTTGA TAACAGTTGC TATATGAACT TCGACTTTAG TGCCCCGGAA
TACCATGAAC TGGCGCTGTG GGAAGACAAA GCAACGCTGC GTTTTGAATG TGCTGACACA
TACATTTCCC TGCTGGAAAA ATTAACCGCC CTGCTGGGAC GCCAGCCAGA ACTGCCCGAC
TGGATTTATG ACGGAGTAAC GCTCGGCATT CAGGGCGGGA CGGAAGTGTG CCAGAAGAAA
CTGGACACCA TGCGTAACGC GGGCGTGAAG GTCAACGGCA TCTGGGCGCA GGACTGGTCC
GGTATTCGTA TGACCTCTTT TGGCAAACGC GTGATGTGGA ACTGGAAGTG GAACAGCGAA
AACTACCCGC AACTGGATTC ACGCATTAAG CAGTGGAATC AGGAGGGCGT GCAGTTCCTG
GCCTATATCA ACCCGTATGT TGCCAGCGAT AAAGATCTCT GCGAAGAAGC GGCACAACAC
GGCTATCTGG CAAAAGATGC CTCTGGCGGT GACTATCTGG TGGAGTTTGG CGAGTTTTAC
GGCGGCGTTG TCGATCTCAC TAATCCAGAA GCCTACGCCT GGTTCAAGGA AGTGATCAAA
AAGAACATGA TTGAACTCGG CTGCGGCGGC TGGATGGCTG ACTTCGGCGA GTATCTGCCC
ACCGACACGT ACCTGCATAA CGGCGTCAGT GCCGAAATTA TGCATAACGC CTGGCCTGCG
CTGTGGGCGA AGTGTAACTA CGAAGCCCTT GAAGAAACGG GCAAGCTCAG CGAGATCCTT
TTCTTTATGC GCGCCGGTTC TACCGGTAGC CAGAAATACT CCACCATGAT GTGGGCGGGC
GACCAGAACG TCGACTGGAG TCTCGACGAT GGCCTGGCGT CGGTTGTCCC GGCGGCGCTG
TCGCTGGCAA TGACCGGACA TGGCCTGCAC CACAGCGACA TTGGCGGTTA CACCACCCTG
TTTGAGATGA AGCGCAGCAA AGAGCTGCTG CTGCGCTGGT GCGATTTCAG CGCCTTCACG
CCGATGATGC GCACCCACGA AGGTAACCGT CCTGGCGACA ACTGGCAGTT TGACGGCGAC
GCAGAAACCA TCGCCCATTT CGCCCGTATG ACCACCGTCT TCACCACCCT GAAACCTTAC
CTGAAAGAGG CCGTCGCGCT GAATGCGAAG TCCGGCCTGC CGGTTATGCG CCCGCTGTTC
CTGCATTACG AAGACGATGC GCACACTTAC ACCCTGAAAT ATCAGTACCT GTTAGGTCGC
GACATTCTGG TCGCTCCGGT GCATGAAGAA GGCCGTAGCG ACTGGACGCT CTATCTGCCG
GAGGATAACT GGGTCCACGC CTGGACGGGT GAAGCGTTCC GGGGCGGGGA AGTTACCGTT
AATGCGCCCA TCGGCAAGCC GCCGGTCTTT TATCGCGCCG ATAGCGAATG GGCGGCACTG
TTCGCGTCGT TAAAAAGCAT CTAA
 
Protein sequence
MRIILHKPVI TGRQQRLILT HSKDNPCLWI GSGIADIDMF RGNFSIKDKL QEKIALTDAI 
VSQSPDGWLI HFSRGSDISA TLNISADDQG RLLLELQNDN LNHNRIWLRL AAQPEDHIYG
CGEQFSYFDL RGKPFPLWTS EQGVGRNKQT YVTWQADCKE NAGGDYYWTF FPQPTFVSTQ
KYYCHVDNSC YMNFDFSAPE YHELALWEDK ATLRFECADT YISLLEKLTA LLGRQPELPD
WIYDGVTLGI QGGTEVCQKK LDTMRNAGVK VNGIWAQDWS GIRMTSFGKR VMWNWKWNSE
NYPQLDSRIK QWNQEGVQFL AYINPYVASD KDLCEEAAQH GYLAKDASGG DYLVEFGEFY
GGVVDLTNPE AYAWFKEVIK KNMIELGCGG WMADFGEYLP TDTYLHNGVS AEIMHNAWPA
LWAKCNYEAL EETGKLSEIL FFMRAGSTGS QKYSTMMWAG DQNVDWSLDD GLASVVPAAL
SLAMTGHGLH HSDIGGYTTL FEMKRSKELL LRWCDFSAFT PMMRTHEGNR PGDNWQFDGD
AETIAHFARM TTVFTTLKPY LKEAVALNAK SGLPVMRPLF LHYEDDAHTY TLKYQYLLGR
DILVAPVHEE GRSDWTLYLP EDNWVHAWTG EAFRGGEVTV NAPIGKPPVF YRADSEWAAL
FASLKSI