Gene Aaci_2443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAaci_2443 
Symbol 
ID8425972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446 
KingdomBacteria 
Replicon accessionNC_013205 
Strand
Start bp2506260 
End bp2508155 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content67% 
IMG OID645028561 
Productsqualene-hopene cyclase 
Protein accessionYP_003185838 
Protein GI258512404 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.78759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGC AGTTGGTGGA AGCGCCGGCC TACGCGCGGA CGCTGGATCG CGCGGTGGAG 
TATCTCCTCT CCTGCCAAAA GGACGAAGGC TACTGGTGGG GGCCGCTTCT GAGCAACGTC
ACGATGGAAG CGGAGTACGT CCTCTTGTGC CACATTCTCG ATCGCGTCGA TCGGGATCGC
ATGGAGAAGA TCCGGCGGTA CCTGTTGCAC GAGCAGCGCG AGGACGGCAC GTGGGCCCTG
TACCCGGGTG GGCCGCCGGA CCTCGACACG ACCATCGAGG CGTACGTCGC GCTCAAGTAT
ATCGGCATGT CGCGCGACGA GGAGCCGATG CAGAAGGCGC TCCGGTTCAT TCAGAGCCAG
GGCGGGATCG AGTCGTCGCG CGTGTTCACG CGGATGTGGC TGGCGCTGGT GGGAGAATAT
CCGTGGGAGA AGGTGCCCAT GGTCCCGCCG GAGATCATGT TCCTCGGCAA GCGCATGCCG
CTCAACATCT ACGAGTTTGG CTCGTGGGCT CGGGCGACCG TCGTGGCGCT CTCGATTGTG
ATGAGCCGCC AGCCGGTGTT CCCGCTGCCC GAGCGGGCGC GCGTGCCCGA GCTGTACGAG
ACCGACGTGC CTCCGCGCCG GCGCGGCGCC AAGGGAGGGG GTGGGTGGAT CTTCGACGCG
CTCGACCGGG CGCTGCACGG GTATCAGAAG CTGTCGGTGC ACCCGTTCCG CCGCGCGGCC
GAGATCCGCG CCTTGGACTG GTTGCTCGAG CGCCAGGCCG GAGACGGCAG CTGGGGCGGG
ATTCAGCCGC CTTGGTTTTA CGCGCTCATC GCGCTCAAGA TTCTCGACAT GACGCAGCAT
CCGGCGTTCA TCAAGGGCTG GGAAGGTCTA GAGCTGTACG GCGTGGAGCT GGATTACGGA
GGATGGATGT TTCAGGCTTC CATCTCGCCG GTGTGGGACA CGGGCCTCGC CGTGCTCGCG
CTGCGCGCTG CGGGGCTTCC GGCCGATCAC GACCGCTTGG TCAAGGCGGG CGAGTGGCTG
TTGGACCGGC AGATCACGGT TCCGGGCGAC TGGGCGGTGA AGCGCCCGAA CCTCAAGCCG
GGCGGGTTCG CGTTCCAGTT CGACAACGTG TACTACCCGG ACGTGGACGA CACGGCCGTC
GTGGTGTGGG CGCTCAACAC CCTGCGCTTG CCGGACGAGC GCCGCAGGCG GGACGCCATG
ACGAAGGGAT TCCGCTGGAT TGTCGGCATG CAGAGCTCGA ACGGCGGTTG GGGCGCCTAC
GACGTCGACA ACACGAGCGA TCTCCCGAAC CACATCCCGT TCTGCGACTT CGGCGAAGTG
ACCGATCCGC CGTCAGAGGA CGTCACCGCC CACGTGCTCG AGTGTTTCGG CAGCTTCGGG
TACGATGACG CCTGGAAGGT CATCCGGCGC GCGGTGGAAT ATCTCAAGCG GGAGCAGAAG
CCGGACGGCA GCTGGTTCGG TCGTTGGGGC GTCAATTACC TCTACGGCAC GGGCGCGGTG
GTGTCGGCGC TGAAGGCGGT CGGGATCGAC ACGCGCGAGC CGTACATTCA AAAGGCGCTC
GACTGGGTCG AGCAGCATCA GAACCCGGAC GGCGGCTGGG GCGAGGACTG CCGCTCGTAC
GAGGATCCGG CGTACGCGGG TAAGGGCGCG AGCACCCCGT CGCAGACGGC CTGGGCGCTG
ATGGCGCTCA TCGCGGGCGG CAGGGCGGAG TCCGAGGCCG CGCGCCGCGG CGTGCAATAC
CTCGTGGAGA CGCAGCGCCC GGACGGCGGC TGGGATGAGC CGTACTACAC CGGCACGGGC
TTCCCAGGGG ATTTCTACCT CGGCTACACC ATGTACCGCC ACGTGTTTCC GACGCTCGCG
CTCGGCCGCT ACAAGCAAGC CATCGAGCGC AGGTGA
 
Protein sequence
MAEQLVEAPA YARTLDRAVE YLLSCQKDEG YWWGPLLSNV TMEAEYVLLC HILDRVDRDR 
MEKIRRYLLH EQREDGTWAL YPGGPPDLDT TIEAYVALKY IGMSRDEEPM QKALRFIQSQ
GGIESSRVFT RMWLALVGEY PWEKVPMVPP EIMFLGKRMP LNIYEFGSWA RATVVALSIV
MSRQPVFPLP ERARVPELYE TDVPPRRRGA KGGGGWIFDA LDRALHGYQK LSVHPFRRAA
EIRALDWLLE RQAGDGSWGG IQPPWFYALI ALKILDMTQH PAFIKGWEGL ELYGVELDYG
GWMFQASISP VWDTGLAVLA LRAAGLPADH DRLVKAGEWL LDRQITVPGD WAVKRPNLKP
GGFAFQFDNV YYPDVDDTAV VVWALNTLRL PDERRRRDAM TKGFRWIVGM QSSNGGWGAY
DVDNTSDLPN HIPFCDFGEV TDPPSEDVTA HVLECFGSFG YDDAWKVIRR AVEYLKREQK
PDGSWFGRWG VNYLYGTGAV VSALKAVGID TREPYIQKAL DWVEQHQNPD GGWGEDCRSY
EDPAYAGKGA STPSQTAWAL MALIAGGRAE SEAARRGVQY LVETQRPDGG WDEPYYTGTG
FPGDFYLGYT MYRHVFPTLA LGRYKQAIER R