Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B1694 |
Symbol | |
ID | 7185352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 3440459 |
End bp | 3441769 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643551347 |
Product | glycosidase, family 5 |
Protein accession | YP_002447017 |
Protein GI | 218898606 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00000000000120172 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAATTT GCCATATTCC TGTAACAAAA CAAAGGTATG TTATGTGTGT TAAATATAAA GAGAAAGAAG GAAAAGTTGT GAAAAAAATT TTGCCAATTG TTGCGTTATT AGGCATGATG AGTTTTGGAG TACAGGAAAT GAATGTAAGA GCTGATACAT ACCACAAGGG CGATTCGAAA ATTAGTTTTT GGGATTCGAA AAGAAAGGGT ACTAATTTCA TGAATAGTAC GTCATTACCT GAAAACTATA AAAGTGCAAA AGAAGCTAAT ATTGAATATG TACGTTTAGC ACCTGATAAA TGGGCAAAAG ATAAAGATTT TCTATTTGAG GATAAACCAG ATACTTCTGG AAAGGATTTT CTGATAGGTA ATGCAGATAA CTATCAGGGA TTAGTAAAGG AGGATTTAGA AAAATTAAAG GCGGATTTAG ATGCCGCACA ATCACAAGGA ATGAAAGTTG TTCTTACAAT GTTATCTTTA CCTGGTGATC GATGGCGCCA ATTTAATAAT AACAAGAATG ACGACAGAAT ATGGGAAGAA GAGAGGTATC AAGAACAAGC AAGTCAATTT TGGAAGGACC TTGCTCTGGA ATTAAAAGAT TATCCTGCGG TGGTGGGTTA TAATATTATA AATGAACCAC ATCCAGAAAC AGCTAAAAAT AATAGATATA ATGATTTTTG GACAGAAGAT TACGAGAAAT GGTATGCAAA AGTGAAGGGA ACGACAGCGG ATTTAAACAG ATTGTATCAA AAAGTAATCA ATTCCATTCG TGAAGTAGAC CAAGAAACAC CAATTATTTT AGATTCAGGT TTATATGCTA CTCCATGGGC TTTTAAATAT TTAAAACCAG TAAAGGATAA AAAAACGCTT TACGCATTTC ATATGTATGA ACCATATGAA TTAACGAGTC AAGGTGAAAA GAAAAATAAA GAATATCAAT ATCCAGGATT AGTAAAAGTA GGAGACTTAG AGAAACCTGT AATGTGGAAT AAGCAGGGAT TAGAGAAATT TTTGAAGCCA ATCCAACAAT GGTCTAAGAA AAATCATGTA TCATCTAATC GAATTATTGC AGAGGAGTTT GGAATTAACC GTACTGTTCC GGGAGCTACC CAATACATGC AAGATCTTAT TTCTATCTTC AACCAAAAAG GTTGGCATAA ATCATTCTAT GCATTCCGTG AAGACACATG GACAGGGATG AATTATGAAT TGGGAACAGG AAAAATAAAA TGGGATGAAG AGGGTAAACC GGTGCCTCAA GATAATTCAC TCTGGGAAGT AATAAAAAAA GATTTACAAC CACATAAATA G
|
Protein sequence | MEICHIPVTK QRYVMCVKYK EKEGKVVKKI LPIVALLGMM SFGVQEMNVR ADTYHKGDSK ISFWDSKRKG TNFMNSTSLP ENYKSAKEAN IEYVRLAPDK WAKDKDFLFE DKPDTSGKDF LIGNADNYQG LVKEDLEKLK ADLDAAQSQG MKVVLTMLSL PGDRWRQFNN NKNDDRIWEE ERYQEQASQF WKDLALELKD YPAVVGYNII NEPHPETAKN NRYNDFWTED YEKWYAKVKG TTADLNRLYQ KVINSIREVD QETPIILDSG LYATPWAFKY LKPVKDKKTL YAFHMYEPYE LTSQGEKKNK EYQYPGLVKV GDLEKPVMWN KQGLEKFLKP IQQWSKKNHV SSNRIIAEEF GINRTVPGAT QYMQDLISIF NQKGWHKSFY AFREDTWTGM NYELGTGKIK WDEEGKPVPQ DNSLWEVIKK DLQPHK
|
| |