Gene Caci_6664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6664 
Symbol 
ID8338028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7678067 
End bp7680001 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content71% 
IMG OID644959758 
Productsqualene-hopene cyclase 
Protein accessionYP_003117351 
Protein GI256395787 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.303504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGACG TCATCGACAA AGCCGTCGCG GCGACCGGGC CGGCCGATCC CTCCCAGGGC 
GCGGCCGCGA CGCTCCAGGC CGCCGCCGAC CACCTGCTCG GGCTGCAGGA CGACGCCGGC
TGGTGGAAGG GCGAACTCGA GACGAACGTC ACGATGGACG CCGAAGACCT TCTGCTGCGC
CAGTTTCTCG GCATCCGCAC CGAGGAGGTG ACCCGCGAAG CCGGCGACTG GATCCGCTCC
CAGCAACGCG CCGACGGCAC ATGGGCGAAC TTCTTCGACG GCCCGGCCGA CCTGTCGACG
ACCATCGAGG CCTACACGGC GCTGCGCATG GCCGGGGACG CGAAGGACGC CGAGCACATG
CGCGCGGCGC GTACCTACAT CCTGGACTCC GGCGGCATCG AGGCCAGCCG GGTCTTCACC
CGGATCTGGC TCGCGCTGTT CGGCGAATGG CAGTGGTCTG ACTTGCCGGT CATGCCGCCG
GAGCTGATCT ACCTGCCGAA GTGGTTCCCG CTCAACGTCT ACGACTGGGC GTGCTGGGCC
CGGCAGACCG TGGTGCCGTT GACGATCGTC AATGCTCTGC GTCCCGTCCG GCCGCTCGGC
TTCGACCTGA AGGAGCTGCG GACCGGTCGG CGCGCGCCGG CGCAGCGCGG CTTGTTCAGC
ACGCTCGATC GCGCGTTGCA CGTCTATGAG CGGAAGCCGC TGCGCTCGGT CCGGGACGCG
GCGCTGCGCC GCTCCGCGGA CTGGATCATC GCGCGTCAGG AGGCTGATGG TTCCTGGGGC
GGGATCCAGC CGCCGTGGGT GTATTCGCTG ATGGCGTTGA ACCTGCTCGG ATACGGCGTG
GACCACCCGG TGATGCGCAA GGGCATCGAG GGCTTGGACC GCTTCACGAT CCGCGACGAG
CGCGGGCGGC GGCTGGAGGC GTGTCAGTCC CCTGTGTGGG ACACCGTCCT GGCGATGACC
GCGCTGCGCG ACGCCGAGCT GCCCGAGAAT CATCCGGCGC TGGTGAAGGC CGCCGATTGG
GTGCTGGGGG AGGAGATCAC CAACCCCGGC GACTGGTCGG TGCGGCGTCC GCGCGTGGCG
CCCGGCGGCT GGGCGTTCGA GTTCGACAAC GACGGCTACC CGGATGTCGA CGACACCGCC
GAGGTGGTGC TCGCGCTGAA CCGCGTCGCG CATCCGGACG CCCCCGCCGC CATCCGCCGG
GGCGTCGACT GGCTGGAAGG CATGGCCTGC AAGGACGGCG GCTACGGCGC CTTCGACGCC
GACAACACCC GCACGCTGGC GCTCAAGCTG CCGTTCTGCG ACTTCGGGGC GGTCATCGAT
CCGCCGACCG CCGACGTCAC GGCGCACACG CTGGAGGCGT ACGCGGCCCT CGGGCTTGCG
AACTCGCGGG CGTCGCAACG CGCTTTGGAG TGGCTGGTGA AGGCGCAGGA GCGCGACGGC
TCGTGGTTCG GGCGCTGGGG CGCCAACCAT GTCTACGGCA CCGGCGCCGT GGTCCCGGCG
ATGGTCGCCG TCGGCGTCGA TCCCGAGGAC GAGATGATCC GCCGGGCCGT TCGCTGGCTG
GAGGAGCACC AGAACGACGA CGGCGGGTGG GGCGAGGACC TGCGCTCCTA CCGCGACAAG
AGCTGGATCG GGCGCGGGGT CTCGACGGCG TCGCAGACCG CGTGGGCGTT GCTGGCGTTG
CTGGCGGCGG GGGAGGAGCG CGGGACGGCG GTCGAGCAGG GCGTCAGGTT CCTGATCCGC
ACGCAGCGCG CGGACGGTAC GTGGGACGAG GACCACTACA CCGGCACCGG CTTCCCCGGC
GACTTCTACC TGAACTACCA CCTGTATCGG CTCGTCTTCC CGATCAGCGC CCTCGGCCGC
TATGTGCGTG CTGTGGGAGC GGCGGGAGAC GGGGGAGATG CGGGACATGC GGGACATGCG
GGGACCGTGT CATGA
 
Protein sequence
MTDVIDKAVA ATGPADPSQG AAATLQAAAD HLLGLQDDAG WWKGELETNV TMDAEDLLLR 
QFLGIRTEEV TREAGDWIRS QQRADGTWAN FFDGPADLST TIEAYTALRM AGDAKDAEHM
RAARTYILDS GGIEASRVFT RIWLALFGEW QWSDLPVMPP ELIYLPKWFP LNVYDWACWA
RQTVVPLTIV NALRPVRPLG FDLKELRTGR RAPAQRGLFS TLDRALHVYE RKPLRSVRDA
ALRRSADWII ARQEADGSWG GIQPPWVYSL MALNLLGYGV DHPVMRKGIE GLDRFTIRDE
RGRRLEACQS PVWDTVLAMT ALRDAELPEN HPALVKAADW VLGEEITNPG DWSVRRPRVA
PGGWAFEFDN DGYPDVDDTA EVVLALNRVA HPDAPAAIRR GVDWLEGMAC KDGGYGAFDA
DNTRTLALKL PFCDFGAVID PPTADVTAHT LEAYAALGLA NSRASQRALE WLVKAQERDG
SWFGRWGANH VYGTGAVVPA MVAVGVDPED EMIRRAVRWL EEHQNDDGGW GEDLRSYRDK
SWIGRGVSTA SQTAWALLAL LAAGEERGTA VEQGVRFLIR TQRADGTWDE DHYTGTGFPG
DFYLNYHLYR LVFPISALGR YVRAVGAAGD GGDAGHAGHA GTVS