Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B1652 |
Symbol | |
ID | 7182064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 3494093 |
End bp | 3495946 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643551389 |
Product | putative squalene-hopene cyclase |
Protein accession | YP_002447059 |
Protein GI | 218898648 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.155844 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTATTAT ATGAAAAAGT GTATGAAGAA ATAGCGAGAA GAACAACTGC ACTTCAAACG ATGCAACGGC AAGATGGTAC GTGGCGGTTT TGTTTTGAAG GAGCGCCACT AACAGATTGT CATATGATTT TTTTATTAAA ATTATTAGGT AGAGATAAAG AGATAGAACC GTTTGTAAAA AGATTAGCAT CACTCCAAAC AAATGAAGGA ACATGGAAAT TGTATGAAGA TGAAGTGGGT GGTAATTTAT CTGCTACAAT TCAATCTTAT GCTGCCTTAC TTGCATCGGA AAAATATACA AAAGAAGATG CGAATATGAA GCGAGCGGAA ATGTTTATAA ATGAGCGCGG GGGGGTGGCG CGTGCTCATT TTATGACGAA GTTTTTATTA GCGATTCATG GAGAATATGA ATATCCTTCT CTCTTTCATT TGCCAACACC AATAATGTTT CTGCAGAACG ATTCCCCCCT CAGTATATTT GAATTGAGTA GCTCAGCACG TATCCATTTA ATTCCGATGA TGTTGTGTTT AAATAAACGA TTTCGAGTAG GGAAAAAGTT ATTGCCAAAT TTAAATCACA TTGCAGGCGG GGGCGGAGAA TGGTTTCGGG AGGATCGGTC TCCAGTTTTT CAAACGTTAT TAAGTGAGGT GAAGAAAATT ATAACGTATC CACTTTCTTT GCATCATAAA GGATATGAGG AAGTAGAACG TTTTATGAAA GAGCGTATTG ATGAAAATGG AACATTATAT AGTTACGCAA CTGCCTCGTT TTATATGATT TATGCTTTAC TTGCATTAGG GCATTCTATT CAATCGCCAA TTATTCAGAA GGCTATAACG GGAATCGCAT CTTATATATG GAAGATGGAG AGAGGGAGCC ATTTGCAAAA CTCTCCGTCA ACTGTATGGG ATACAGCTTT ACTCAGTTAT GCTTTGCAAG AAGCTCAAGT TCCGAAAGCA AGTAAAGTGA TTCAAAATGC ATCAGCGTAT TTACTAAGAA AACAGCAAAC GAAGAAAGTA GATTGGAGTG TACATGCACC GAATCTATTC CCAGGTGGTT GGGGCTTTTC GGATGTGAAT ACGATGATTC CAGATATTGA TGATACAACT GCTGTGTTAA GAGCACTGGC GCGAAGTAGA GGGGACGAAA ATGTAGATAA TGCTTGGAAG AGAGCGGTTA ATTGGGTTAA AGGATTGCAA AATAATGATG GTGGTTGGGG AGCTTTTGAA AAAGGGGTAA CGAGCCGTAT ATTAGCAAAT TTACCAATCG AAAATGCAAG TGATATGATT ACAGATCCTT CTACACCAGA TATTACAGGA AGAGTGCTAG AGTTTTTTGG GACATATGCG CAAAATGAAT TGCCCGAGAA ACAAAAACAA AGTGCGATAA ATTGGTTAAT GAATGTACAA GAGGAAAATG GATCATGGTA TGGGAAATGG GGAATTTGTT ATATATATGG TACATGGGCA GTGTTGACTG GTTTACGGTC ACTAGGAATA CCATCTAGCG ATCCATCATT AAAACGAGCA GCTTTATGGC TTGAACATAT ACAGCATGAA GATGGTGGCT GGGGAGAATC TTGCCAAAGT AGTGTGGAAA AAAGATTTGT TACTTTGCCG TTTAGTACAC CATCACAAAC GGCATGGGCG TTAGATGCTC TCATTTCTTA CTATGAAAAA GAAACACCAA TCATTCGAAA AGGTATTTCA TATTTGCTCT CCAACCCTTA TGTAAATGAA AAATATCCTA CTGGAACAGG TTTACCAGGT GGGTTTTATA TTCGTTATCA TAGTTATGCT CATATATATC CGTTGCTTAC TTTGGCTCAT TATACAAAAA AATATAGAAA ATAA
|
Protein sequence | MLLYEKVYEE IARRTTALQT MQRQDGTWRF CFEGAPLTDC HMIFLLKLLG RDKEIEPFVK RLASLQTNEG TWKLYEDEVG GNLSATIQSY AALLASEKYT KEDANMKRAE MFINERGGVA RAHFMTKFLL AIHGEYEYPS LFHLPTPIMF LQNDSPLSIF ELSSSARIHL IPMMLCLNKR FRVGKKLLPN LNHIAGGGGE WFREDRSPVF QTLLSEVKKI ITYPLSLHHK GYEEVERFMK ERIDENGTLY SYATASFYMI YALLALGHSI QSPIIQKAIT GIASYIWKME RGSHLQNSPS TVWDTALLSY ALQEAQVPKA SKVIQNASAY LLRKQQTKKV DWSVHAPNLF PGGWGFSDVN TMIPDIDDTT AVLRALARSR GDENVDNAWK RAVNWVKGLQ NNDGGWGAFE KGVTSRILAN LPIENASDMI TDPSTPDITG RVLEFFGTYA QNELPEKQKQ SAINWLMNVQ EENGSWYGKW GICYIYGTWA VLTGLRSLGI PSSDPSLKRA ALWLEHIQHE DGGWGESCQS SVEKRFVTLP FSTPSQTAWA LDALISYYEK ETPIIRKGIS YLLSNPYVNE KYPTGTGLPG GFYIRYHSYA HIYPLLTLAH YTKKYRK
|
| |