Gene BCG9842_B1652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1652 
Symbol 
ID7182064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp3494093 
End bp3495946 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content38% 
IMG OID643551389 
Productputative squalene-hopene cyclase 
Protein accessionYP_002447059 
Protein GI218898648 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.155844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTATTAT ATGAAAAAGT GTATGAAGAA ATAGCGAGAA GAACAACTGC ACTTCAAACG 
ATGCAACGGC AAGATGGTAC GTGGCGGTTT TGTTTTGAAG GAGCGCCACT AACAGATTGT
CATATGATTT TTTTATTAAA ATTATTAGGT AGAGATAAAG AGATAGAACC GTTTGTAAAA
AGATTAGCAT CACTCCAAAC AAATGAAGGA ACATGGAAAT TGTATGAAGA TGAAGTGGGT
GGTAATTTAT CTGCTACAAT TCAATCTTAT GCTGCCTTAC TTGCATCGGA AAAATATACA
AAAGAAGATG CGAATATGAA GCGAGCGGAA ATGTTTATAA ATGAGCGCGG GGGGGTGGCG
CGTGCTCATT TTATGACGAA GTTTTTATTA GCGATTCATG GAGAATATGA ATATCCTTCT
CTCTTTCATT TGCCAACACC AATAATGTTT CTGCAGAACG ATTCCCCCCT CAGTATATTT
GAATTGAGTA GCTCAGCACG TATCCATTTA ATTCCGATGA TGTTGTGTTT AAATAAACGA
TTTCGAGTAG GGAAAAAGTT ATTGCCAAAT TTAAATCACA TTGCAGGCGG GGGCGGAGAA
TGGTTTCGGG AGGATCGGTC TCCAGTTTTT CAAACGTTAT TAAGTGAGGT GAAGAAAATT
ATAACGTATC CACTTTCTTT GCATCATAAA GGATATGAGG AAGTAGAACG TTTTATGAAA
GAGCGTATTG ATGAAAATGG AACATTATAT AGTTACGCAA CTGCCTCGTT TTATATGATT
TATGCTTTAC TTGCATTAGG GCATTCTATT CAATCGCCAA TTATTCAGAA GGCTATAACG
GGAATCGCAT CTTATATATG GAAGATGGAG AGAGGGAGCC ATTTGCAAAA CTCTCCGTCA
ACTGTATGGG ATACAGCTTT ACTCAGTTAT GCTTTGCAAG AAGCTCAAGT TCCGAAAGCA
AGTAAAGTGA TTCAAAATGC ATCAGCGTAT TTACTAAGAA AACAGCAAAC GAAGAAAGTA
GATTGGAGTG TACATGCACC GAATCTATTC CCAGGTGGTT GGGGCTTTTC GGATGTGAAT
ACGATGATTC CAGATATTGA TGATACAACT GCTGTGTTAA GAGCACTGGC GCGAAGTAGA
GGGGACGAAA ATGTAGATAA TGCTTGGAAG AGAGCGGTTA ATTGGGTTAA AGGATTGCAA
AATAATGATG GTGGTTGGGG AGCTTTTGAA AAAGGGGTAA CGAGCCGTAT ATTAGCAAAT
TTACCAATCG AAAATGCAAG TGATATGATT ACAGATCCTT CTACACCAGA TATTACAGGA
AGAGTGCTAG AGTTTTTTGG GACATATGCG CAAAATGAAT TGCCCGAGAA ACAAAAACAA
AGTGCGATAA ATTGGTTAAT GAATGTACAA GAGGAAAATG GATCATGGTA TGGGAAATGG
GGAATTTGTT ATATATATGG TACATGGGCA GTGTTGACTG GTTTACGGTC ACTAGGAATA
CCATCTAGCG ATCCATCATT AAAACGAGCA GCTTTATGGC TTGAACATAT ACAGCATGAA
GATGGTGGCT GGGGAGAATC TTGCCAAAGT AGTGTGGAAA AAAGATTTGT TACTTTGCCG
TTTAGTACAC CATCACAAAC GGCATGGGCG TTAGATGCTC TCATTTCTTA CTATGAAAAA
GAAACACCAA TCATTCGAAA AGGTATTTCA TATTTGCTCT CCAACCCTTA TGTAAATGAA
AAATATCCTA CTGGAACAGG TTTACCAGGT GGGTTTTATA TTCGTTATCA TAGTTATGCT
CATATATATC CGTTGCTTAC TTTGGCTCAT TATACAAAAA AATATAGAAA ATAA
 
Protein sequence
MLLYEKVYEE IARRTTALQT MQRQDGTWRF CFEGAPLTDC HMIFLLKLLG RDKEIEPFVK 
RLASLQTNEG TWKLYEDEVG GNLSATIQSY AALLASEKYT KEDANMKRAE MFINERGGVA
RAHFMTKFLL AIHGEYEYPS LFHLPTPIMF LQNDSPLSIF ELSSSARIHL IPMMLCLNKR
FRVGKKLLPN LNHIAGGGGE WFREDRSPVF QTLLSEVKKI ITYPLSLHHK GYEEVERFMK
ERIDENGTLY SYATASFYMI YALLALGHSI QSPIIQKAIT GIASYIWKME RGSHLQNSPS
TVWDTALLSY ALQEAQVPKA SKVIQNASAY LLRKQQTKKV DWSVHAPNLF PGGWGFSDVN
TMIPDIDDTT AVLRALARSR GDENVDNAWK RAVNWVKGLQ NNDGGWGAFE KGVTSRILAN
LPIENASDMI TDPSTPDITG RVLEFFGTYA QNELPEKQKQ SAINWLMNVQ EENGSWYGKW
GICYIYGTWA VLTGLRSLGI PSSDPSLKRA ALWLEHIQHE DGGWGESCQS SVEKRFVTLP
FSTPSQTAWA LDALISYYEK ETPIIRKGIS YLLSNPYVNE KYPTGTGLPG GFYIRYHSYA
HIYPLLTLAH YTKKYRK