Gene Bcer98_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_2232 
Symbol 
ID5343430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp2327405 
End bp2329264 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content39% 
IMG OID640839750 
Productsqualene/oxidosqualene cyclase 
Protein accessionYP_001375476 
Protein GI152975959 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTATTAT ACGGAAGAGT GTGTGCAGAA ATAGAGCGGA CAATCACTGC ACTTCATACA 
ATGCAACAGC AAGATGGTGC ATGGCGCTTT TGTTTTGAAG GATCGCCATT AACAGATTGT
CATATGATTT TTTTACTTCG ATTATTAGAA AAGGAAGAAG AAATAGAGCC GTTTGTAGCA
AGGTTAACGT CCATACAAAC AAATGAAGGA ACTTGGAAGC TATACGAAGA TGAACGTGCT
GGCAACGTAT CTACTACGAT TCAAGCATAT GCGGCGTTAC TTGCATCTGG CATGTATACG
AAAGAGGATG TCAATATGAA GCGTGCGGAA GCTTTTATTC AAGAAAGAGG AGGAATTGCA
CGTTCTCATT TTATGACGAA GTTTTTATTA GCACTGCATG GAGGATATGA ATATCCTAGA
ATGTTTTATT TTCCAACTCC TATTTTATTC TTGCCTGAAG ATTCTCCACT TAGCATATTT
GAACTTAGTA GTTCTGCGCG TATTCATCTT ATTCCAATGA TGATTTGTAT GAACAAAAGA
TTTACTGTAT CAAAAACTAT ACTTCCGAAT TTAGATCATA TTTCAGGAAG CAGTAAGTCC
GAATGGTTTC GTGAGGACCG CTCCTCATTA TTTGAAACGA TTCTTGGAGA AGTGAAAAAG
TTTGTAACGT ATCCTTTATC TCTTCATCAT AAAGGGGACA AAGAAGCAGA GCGCTTTATG
ATAGAGAGGA TTGACAGAAA TGGTACATTA TATAGCTATG CGAGCGCTAC ATTTTATATG
ATTTATGCAC TTCTGGCGCT AGGGCATCAT ATTCAATCAC CTCTTATTCA ACAAGCAGTC
GCAGGGCTCC GAACATACAA ATGGCATATG GAAGCGGGCA TTCATTTACA AAACTCTCCA
TCTACCGTAT GGGATACGGC TTTACTCAGC TATGCATTAC AAGAAGCGAA TGTAAATGAG
AGCACACCTA TGATTCAAAC AGCAACAGAA TATATATGGC AAAGACAACA TCATGAAAAG
AAAGACTGGA GCTTGCATGC ACCTACACTT TCTCCTGGAG GATGGGGGTT CTCAGATGTG
AATACAACAA TTCCAGACGT TGATGATACA ACTGCTGCTT TAAGAGCGTT AGCACGAAGC
AGAAAAAGGA ATAGAAGAAT AGAGGAGGCT TGGAAAAAAG GGGTGAACTG GGTGAAAGGA
TTACAAAATA AAGACGGAGG ATGGGCTGCT TTTGAAAAGG GGGTAACAAA TCGGTTTCTG
ACACATTTAC CATTAGAAAA TTCTGGTGAT ATGATGACGG ATCCTTCTAC TGCAGATATT
ACAGGACGGG TTTTGGAGTT TTTTGGGACG TATGCTCCTA ATGAATTACA AGATCATCAA
AAGAATCGCG CGATTACATG GCTAATGGAT GTTCAAGAGA ACAATGGATC ATGGTACGGA
AAATGGGGGG TCAGTTATAT ATATGGTACG TGGGCTGCTC TTACAGGATT GCGAGCTGTT
GGAGTTGCGA ATACACATCC AGCTTTAAAA AAGGCAGTTA TGTGGTTAGA ACGTATACAA
CATCGCGATG GAGGCTGGGG AGAATCTTGC CGAAGTAGTA TAGAAAAAAG ATTTGTTCCG
CTATCCTTTA GTACTCCTTC CCAAACTGCA TGGGCGATTG ATGCTCTTAT TTCTTATTAT
GATGAAGAAA CACCAGTCAT TCGAAAAGGA ATTTCTTATT TATTAGAGCA CGCTGCGAGT
CATCAAGAAT ATCCTACAGG AACAGGATTA CCAAATGGAT TTTATATTCG TTATCATAGT
TATTCTTATA TGTATCCATT ACTTACATTT GCACATTATA TAAACAAATA CCGAAAATAA
 
Protein sequence
MVLYGRVCAE IERTITALHT MQQQDGAWRF CFEGSPLTDC HMIFLLRLLE KEEEIEPFVA 
RLTSIQTNEG TWKLYEDERA GNVSTTIQAY AALLASGMYT KEDVNMKRAE AFIQERGGIA
RSHFMTKFLL ALHGGYEYPR MFYFPTPILF LPEDSPLSIF ELSSSARIHL IPMMICMNKR
FTVSKTILPN LDHISGSSKS EWFREDRSSL FETILGEVKK FVTYPLSLHH KGDKEAERFM
IERIDRNGTL YSYASATFYM IYALLALGHH IQSPLIQQAV AGLRTYKWHM EAGIHLQNSP
STVWDTALLS YALQEANVNE STPMIQTATE YIWQRQHHEK KDWSLHAPTL SPGGWGFSDV
NTTIPDVDDT TAALRALARS RKRNRRIEEA WKKGVNWVKG LQNKDGGWAA FEKGVTNRFL
THLPLENSGD MMTDPSTADI TGRVLEFFGT YAPNELQDHQ KNRAITWLMD VQENNGSWYG
KWGVSYIYGT WAALTGLRAV GVANTHPALK KAVMWLERIQ HRDGGWGESC RSSIEKRFVP
LSFSTPSQTA WAIDALISYY DEETPVIRKG ISYLLEHAAS HQEYPTGTGL PNGFYIRYHS
YSYMYPLLTF AHYINKYRK