Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS3351 |
Symbol | |
ID | 2848197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | - |
Start bp | 3321600 |
End bp | 3323453 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637506595 |
Product | squalene-hopene cyclase |
Protein accession | YP_029608 |
Protein GI | 49186356 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.869066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTATTAT ACGAAAAAGC GCATGAAGAA ATAGTGAGAA GAGCAACAGC ACTTCAAACA ATGCAATGGC AAGATGGTAC GTGGCGATTT TGTTTTGAAG GAGCTCCATT AACAGATTGC CATATGATTT TTTTATTAAA ATTATTAGGT AGAGATAAAG AGATAGAACC GTTCGTAGAA AGAGTAGCAT CACTCCAAAC AAATGAAGGA ACATGGAAAT TGCACGAAGA TGAAGTAGGA GGTAATTTAT CAGCTACAAT TCAATCTTAT GCCGCCTTAC TTGCATCGAA AAAATATACA AAAGAAGATG CGAATATGAA ACGAGCAGAA AATTTTATTC AGGAACGCGG TGGTGTGGCG CGTGCTCATT TTATGACGAA GTTTTTATTA GCAATTCATG GAGAATATGA ATATCCTTCA CTCTTTCATT TACCAACACC AATCATGTTT TTACAGAATG ATTCCCCCTT TAGTATATTT GAATTAAGTA GCTCAGCACG TATTCATTTA ATTCCGATGA TGCTATGTTT AAATAAAAGA TTTCGAGTAG GGAAAAAGTT ATTACCAAAT TTAAATCACA TTGCGGGCGG AGGCGGAGAA TGGTTTCGGG AGGATCGGTC TCCAGTTTTT CAAACGTTAT TAAGTGATGT AAAACAAATT ATATCGTATC CACTTTCGTT ACATCATAAA GGATATGAGG AAATAGAACG TTTTATGAAA GAGCGTATTG ATGAAAATGG AACGTTATAT AGTTACGCAA CTGCCTCGTT TTATATGATT TATGCTTTAC TTGCGTTAGG GCATTCTCTT CAATCATCAA TGATTCAAAA GGCTATAGCT GGGATAACAT CTTATATATG GAAGATGGAA AGAGGGAATC ATTTGCAAAA CTCTCCTTCA ACCGTGTGGG ATACAGCTTT ATTAAGCTAT GCGTTACAAG AGGCTCAAGT TTCAAAGGAT AATAAGATGA TTCAAAATGC AACAGCGTAT TTATTAAAAA AACAGCATAC AAAAAAAGCT GATTGGAGCG TACATGCTCC GGCGCTTACT CCTGGCGGTT GGGGTTTTTC GGATGTGAAT ACGACAATTC CAGATATAGA TGATACAACA GCTGTGCTAA GGGCATTGGC ACGAAGTAGA GGAAACAAAA ATATAGATAA TGCTTGGAAG AAAGGGGGCA ATTGGATTAA AGGATTACAA AATAATGATG GTGGCTGGGG AGCATTTGAA AAAGGTGTGA CGAGCAAATT ATTAGCAAAA TTACCAATCG AAAACGCAAG TGATATGATT ACAGATCCTT CTACGCCAGA TATTACGGGG AGAGTGTTAG AGTTTTTCGG GACGTATGCA CAAAACGAAT TGCCTGAGAA ACAGATACAA AGGGCAATAA ATTGGTTAAT GAATGTACAA GAGGAAAATG GATCATGGTA TGGGAAATGG GGGATTTGTT ATCTATATGG TACGTGGGCT GTTATGACTG GTTTACGGTC ACTCGGAATT CCGTCTAGCA ATCCTTCATT GACACGAGCA GCTTCATGGC TTGAACATAT ACAGCATGAA GATGGTGGTT GGGGAGAATC ATGCCACAGT AGTGTGGAGA AAAGGTTCGT TACTTTACCA TTTAGTACAC CATCCCAAAC TGCATGGGCG TTAGATGCTC TCATTTCTTA CTATGATACA GAAACGCCAG CTATTCGAAA AGGTGTTTCA TATTTGCTTT CGAATCCTTA TGTGAATGAA AGATATCCTA CTGGAACAGG TTTACCAGGT GCGTTTTATA TTAGGTATCA TAGCTATGCC CATATATATC CACTACTTAC TTTGGCACAT TATATAAAAA AATATAGAAA ATAA
|
Protein sequence | MLLYEKAHEE IVRRATALQT MQWQDGTWRF CFEGAPLTDC HMIFLLKLLG RDKEIEPFVE RVASLQTNEG TWKLHEDEVG GNLSATIQSY AALLASKKYT KEDANMKRAE NFIQERGGVA RAHFMTKFLL AIHGEYEYPS LFHLPTPIMF LQNDSPFSIF ELSSSARIHL IPMMLCLNKR FRVGKKLLPN LNHIAGGGGE WFREDRSPVF QTLLSDVKQI ISYPLSLHHK GYEEIERFMK ERIDENGTLY SYATASFYMI YALLALGHSL QSSMIQKAIA GITSYIWKME RGNHLQNSPS TVWDTALLSY ALQEAQVSKD NKMIQNATAY LLKKQHTKKA DWSVHAPALT PGGWGFSDVN TTIPDIDDTT AVLRALARSR GNKNIDNAWK KGGNWIKGLQ NNDGGWGAFE KGVTSKLLAK LPIENASDMI TDPSTPDITG RVLEFFGTYA QNELPEKQIQ RAINWLMNVQ EENGSWYGKW GICYLYGTWA VMTGLRSLGI PSSNPSLTRA ASWLEHIQHE DGGWGESCHS SVEKRFVTLP FSTPSQTAWA LDALISYYDT ETPAIRKGVS YLLSNPYVNE RYPTGTGLPG AFYIRYHSYA HIYPLLTLAH YIKKYRK
|
| |