Gene BAS3351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3351 
Symbol 
ID2848197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3321600 
End bp3323453 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content37% 
IMG OID637506595 
Productsqualene-hopene cyclase 
Protein accessionYP_029608 
Protein GI49186356 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.869066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTATTAT ACGAAAAAGC GCATGAAGAA ATAGTGAGAA GAGCAACAGC ACTTCAAACA 
ATGCAATGGC AAGATGGTAC GTGGCGATTT TGTTTTGAAG GAGCTCCATT AACAGATTGC
CATATGATTT TTTTATTAAA ATTATTAGGT AGAGATAAAG AGATAGAACC GTTCGTAGAA
AGAGTAGCAT CACTCCAAAC AAATGAAGGA ACATGGAAAT TGCACGAAGA TGAAGTAGGA
GGTAATTTAT CAGCTACAAT TCAATCTTAT GCCGCCTTAC TTGCATCGAA AAAATATACA
AAAGAAGATG CGAATATGAA ACGAGCAGAA AATTTTATTC AGGAACGCGG TGGTGTGGCG
CGTGCTCATT TTATGACGAA GTTTTTATTA GCAATTCATG GAGAATATGA ATATCCTTCA
CTCTTTCATT TACCAACACC AATCATGTTT TTACAGAATG ATTCCCCCTT TAGTATATTT
GAATTAAGTA GCTCAGCACG TATTCATTTA ATTCCGATGA TGCTATGTTT AAATAAAAGA
TTTCGAGTAG GGAAAAAGTT ATTACCAAAT TTAAATCACA TTGCGGGCGG AGGCGGAGAA
TGGTTTCGGG AGGATCGGTC TCCAGTTTTT CAAACGTTAT TAAGTGATGT AAAACAAATT
ATATCGTATC CACTTTCGTT ACATCATAAA GGATATGAGG AAATAGAACG TTTTATGAAA
GAGCGTATTG ATGAAAATGG AACGTTATAT AGTTACGCAA CTGCCTCGTT TTATATGATT
TATGCTTTAC TTGCGTTAGG GCATTCTCTT CAATCATCAA TGATTCAAAA GGCTATAGCT
GGGATAACAT CTTATATATG GAAGATGGAA AGAGGGAATC ATTTGCAAAA CTCTCCTTCA
ACCGTGTGGG ATACAGCTTT ATTAAGCTAT GCGTTACAAG AGGCTCAAGT TTCAAAGGAT
AATAAGATGA TTCAAAATGC AACAGCGTAT TTATTAAAAA AACAGCATAC AAAAAAAGCT
GATTGGAGCG TACATGCTCC GGCGCTTACT CCTGGCGGTT GGGGTTTTTC GGATGTGAAT
ACGACAATTC CAGATATAGA TGATACAACA GCTGTGCTAA GGGCATTGGC ACGAAGTAGA
GGAAACAAAA ATATAGATAA TGCTTGGAAG AAAGGGGGCA ATTGGATTAA AGGATTACAA
AATAATGATG GTGGCTGGGG AGCATTTGAA AAAGGTGTGA CGAGCAAATT ATTAGCAAAA
TTACCAATCG AAAACGCAAG TGATATGATT ACAGATCCTT CTACGCCAGA TATTACGGGG
AGAGTGTTAG AGTTTTTCGG GACGTATGCA CAAAACGAAT TGCCTGAGAA ACAGATACAA
AGGGCAATAA ATTGGTTAAT GAATGTACAA GAGGAAAATG GATCATGGTA TGGGAAATGG
GGGATTTGTT ATCTATATGG TACGTGGGCT GTTATGACTG GTTTACGGTC ACTCGGAATT
CCGTCTAGCA ATCCTTCATT GACACGAGCA GCTTCATGGC TTGAACATAT ACAGCATGAA
GATGGTGGTT GGGGAGAATC ATGCCACAGT AGTGTGGAGA AAAGGTTCGT TACTTTACCA
TTTAGTACAC CATCCCAAAC TGCATGGGCG TTAGATGCTC TCATTTCTTA CTATGATACA
GAAACGCCAG CTATTCGAAA AGGTGTTTCA TATTTGCTTT CGAATCCTTA TGTGAATGAA
AGATATCCTA CTGGAACAGG TTTACCAGGT GCGTTTTATA TTAGGTATCA TAGCTATGCC
CATATATATC CACTACTTAC TTTGGCACAT TATATAAAAA AATATAGAAA ATAA
 
Protein sequence
MLLYEKAHEE IVRRATALQT MQWQDGTWRF CFEGAPLTDC HMIFLLKLLG RDKEIEPFVE 
RVASLQTNEG TWKLHEDEVG GNLSATIQSY AALLASKKYT KEDANMKRAE NFIQERGGVA
RAHFMTKFLL AIHGEYEYPS LFHLPTPIMF LQNDSPFSIF ELSSSARIHL IPMMLCLNKR
FRVGKKLLPN LNHIAGGGGE WFREDRSPVF QTLLSDVKQI ISYPLSLHHK GYEEIERFMK
ERIDENGTLY SYATASFYMI YALLALGHSL QSSMIQKAIA GITSYIWKME RGNHLQNSPS
TVWDTALLSY ALQEAQVSKD NKMIQNATAY LLKKQHTKKA DWSVHAPALT PGGWGFSDVN
TTIPDIDDTT AVLRALARSR GNKNIDNAWK KGGNWIKGLQ NNDGGWGAFE KGVTSKLLAK
LPIENASDMI TDPSTPDITG RVLEFFGTYA QNELPEKQIQ RAINWLMNVQ EENGSWYGKW
GICYLYGTWA VMTGLRSLGI PSSNPSLTRA ASWLEHIQHE DGGWGESCHS SVEKRFVTLP
FSTPSQTAWA LDALISYYDT ETPAIRKGVS YLLSNPYVNE RYPTGTGLPG AFYIRYHSYA
HIYPLLTLAH YIKKYRK