Gene BCG9842_B1694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1694 
Symbol 
ID7185352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp3440459 
End bp3441769 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content34% 
IMG OID643551347 
Productglycosidase, family 5 
Protein accessionYP_002447017 
Protein GI218898606 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000000000120172 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAATTT GCCATATTCC TGTAACAAAA CAAAGGTATG TTATGTGTGT TAAATATAAA 
GAGAAAGAAG GAAAAGTTGT GAAAAAAATT TTGCCAATTG TTGCGTTATT AGGCATGATG
AGTTTTGGAG TACAGGAAAT GAATGTAAGA GCTGATACAT ACCACAAGGG CGATTCGAAA
ATTAGTTTTT GGGATTCGAA AAGAAAGGGT ACTAATTTCA TGAATAGTAC GTCATTACCT
GAAAACTATA AAAGTGCAAA AGAAGCTAAT ATTGAATATG TACGTTTAGC ACCTGATAAA
TGGGCAAAAG ATAAAGATTT TCTATTTGAG GATAAACCAG ATACTTCTGG AAAGGATTTT
CTGATAGGTA ATGCAGATAA CTATCAGGGA TTAGTAAAGG AGGATTTAGA AAAATTAAAG
GCGGATTTAG ATGCCGCACA ATCACAAGGA ATGAAAGTTG TTCTTACAAT GTTATCTTTA
CCTGGTGATC GATGGCGCCA ATTTAATAAT AACAAGAATG ACGACAGAAT ATGGGAAGAA
GAGAGGTATC AAGAACAAGC AAGTCAATTT TGGAAGGACC TTGCTCTGGA ATTAAAAGAT
TATCCTGCGG TGGTGGGTTA TAATATTATA AATGAACCAC ATCCAGAAAC AGCTAAAAAT
AATAGATATA ATGATTTTTG GACAGAAGAT TACGAGAAAT GGTATGCAAA AGTGAAGGGA
ACGACAGCGG ATTTAAACAG ATTGTATCAA AAAGTAATCA ATTCCATTCG TGAAGTAGAC
CAAGAAACAC CAATTATTTT AGATTCAGGT TTATATGCTA CTCCATGGGC TTTTAAATAT
TTAAAACCAG TAAAGGATAA AAAAACGCTT TACGCATTTC ATATGTATGA ACCATATGAA
TTAACGAGTC AAGGTGAAAA GAAAAATAAA GAATATCAAT ATCCAGGATT AGTAAAAGTA
GGAGACTTAG AGAAACCTGT AATGTGGAAT AAGCAGGGAT TAGAGAAATT TTTGAAGCCA
ATCCAACAAT GGTCTAAGAA AAATCATGTA TCATCTAATC GAATTATTGC AGAGGAGTTT
GGAATTAACC GTACTGTTCC GGGAGCTACC CAATACATGC AAGATCTTAT TTCTATCTTC
AACCAAAAAG GTTGGCATAA ATCATTCTAT GCATTCCGTG AAGACACATG GACAGGGATG
AATTATGAAT TGGGAACAGG AAAAATAAAA TGGGATGAAG AGGGTAAACC GGTGCCTCAA
GATAATTCAC TCTGGGAAGT AATAAAAAAA GATTTACAAC CACATAAATA G
 
Protein sequence
MEICHIPVTK QRYVMCVKYK EKEGKVVKKI LPIVALLGMM SFGVQEMNVR ADTYHKGDSK 
ISFWDSKRKG TNFMNSTSLP ENYKSAKEAN IEYVRLAPDK WAKDKDFLFE DKPDTSGKDF
LIGNADNYQG LVKEDLEKLK ADLDAAQSQG MKVVLTMLSL PGDRWRQFNN NKNDDRIWEE
ERYQEQASQF WKDLALELKD YPAVVGYNII NEPHPETAKN NRYNDFWTED YEKWYAKVKG
TTADLNRLYQ KVINSIREVD QETPIILDSG LYATPWAFKY LKPVKDKKTL YAFHMYEPYE
LTSQGEKKNK EYQYPGLVKV GDLEKPVMWN KQGLEKFLKP IQQWSKKNHV SSNRIIAEEF
GINRTVPGAT QYMQDLISIF NQKGWHKSFY AFREDTWTGM NYELGTGKIK WDEEGKPVPQ
DNSLWEVIKK DLQPHK