Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6664 |
Symbol | |
ID | 8338028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7678067 |
End bp | 7680001 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644959758 |
Product | squalene-hopene cyclase |
Protein accession | YP_003117351 |
Protein GI | 256395787 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.303504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGACG TCATCGACAA AGCCGTCGCG GCGACCGGGC CGGCCGATCC CTCCCAGGGC GCGGCCGCGA CGCTCCAGGC CGCCGCCGAC CACCTGCTCG GGCTGCAGGA CGACGCCGGC TGGTGGAAGG GCGAACTCGA GACGAACGTC ACGATGGACG CCGAAGACCT TCTGCTGCGC CAGTTTCTCG GCATCCGCAC CGAGGAGGTG ACCCGCGAAG CCGGCGACTG GATCCGCTCC CAGCAACGCG CCGACGGCAC ATGGGCGAAC TTCTTCGACG GCCCGGCCGA CCTGTCGACG ACCATCGAGG CCTACACGGC GCTGCGCATG GCCGGGGACG CGAAGGACGC CGAGCACATG CGCGCGGCGC GTACCTACAT CCTGGACTCC GGCGGCATCG AGGCCAGCCG GGTCTTCACC CGGATCTGGC TCGCGCTGTT CGGCGAATGG CAGTGGTCTG ACTTGCCGGT CATGCCGCCG GAGCTGATCT ACCTGCCGAA GTGGTTCCCG CTCAACGTCT ACGACTGGGC GTGCTGGGCC CGGCAGACCG TGGTGCCGTT GACGATCGTC AATGCTCTGC GTCCCGTCCG GCCGCTCGGC TTCGACCTGA AGGAGCTGCG GACCGGTCGG CGCGCGCCGG CGCAGCGCGG CTTGTTCAGC ACGCTCGATC GCGCGTTGCA CGTCTATGAG CGGAAGCCGC TGCGCTCGGT CCGGGACGCG GCGCTGCGCC GCTCCGCGGA CTGGATCATC GCGCGTCAGG AGGCTGATGG TTCCTGGGGC GGGATCCAGC CGCCGTGGGT GTATTCGCTG ATGGCGTTGA ACCTGCTCGG ATACGGCGTG GACCACCCGG TGATGCGCAA GGGCATCGAG GGCTTGGACC GCTTCACGAT CCGCGACGAG CGCGGGCGGC GGCTGGAGGC GTGTCAGTCC CCTGTGTGGG ACACCGTCCT GGCGATGACC GCGCTGCGCG ACGCCGAGCT GCCCGAGAAT CATCCGGCGC TGGTGAAGGC CGCCGATTGG GTGCTGGGGG AGGAGATCAC CAACCCCGGC GACTGGTCGG TGCGGCGTCC GCGCGTGGCG CCCGGCGGCT GGGCGTTCGA GTTCGACAAC GACGGCTACC CGGATGTCGA CGACACCGCC GAGGTGGTGC TCGCGCTGAA CCGCGTCGCG CATCCGGACG CCCCCGCCGC CATCCGCCGG GGCGTCGACT GGCTGGAAGG CATGGCCTGC AAGGACGGCG GCTACGGCGC CTTCGACGCC GACAACACCC GCACGCTGGC GCTCAAGCTG CCGTTCTGCG ACTTCGGGGC GGTCATCGAT CCGCCGACCG CCGACGTCAC GGCGCACACG CTGGAGGCGT ACGCGGCCCT CGGGCTTGCG AACTCGCGGG CGTCGCAACG CGCTTTGGAG TGGCTGGTGA AGGCGCAGGA GCGCGACGGC TCGTGGTTCG GGCGCTGGGG CGCCAACCAT GTCTACGGCA CCGGCGCCGT GGTCCCGGCG ATGGTCGCCG TCGGCGTCGA TCCCGAGGAC GAGATGATCC GCCGGGCCGT TCGCTGGCTG GAGGAGCACC AGAACGACGA CGGCGGGTGG GGCGAGGACC TGCGCTCCTA CCGCGACAAG AGCTGGATCG GGCGCGGGGT CTCGACGGCG TCGCAGACCG CGTGGGCGTT GCTGGCGTTG CTGGCGGCGG GGGAGGAGCG CGGGACGGCG GTCGAGCAGG GCGTCAGGTT CCTGATCCGC ACGCAGCGCG CGGACGGTAC GTGGGACGAG GACCACTACA CCGGCACCGG CTTCCCCGGC GACTTCTACC TGAACTACCA CCTGTATCGG CTCGTCTTCC CGATCAGCGC CCTCGGCCGC TATGTGCGTG CTGTGGGAGC GGCGGGAGAC GGGGGAGATG CGGGACATGC GGGACATGCG GGGACCGTGT CATGA
|
Protein sequence | MTDVIDKAVA ATGPADPSQG AAATLQAAAD HLLGLQDDAG WWKGELETNV TMDAEDLLLR QFLGIRTEEV TREAGDWIRS QQRADGTWAN FFDGPADLST TIEAYTALRM AGDAKDAEHM RAARTYILDS GGIEASRVFT RIWLALFGEW QWSDLPVMPP ELIYLPKWFP LNVYDWACWA RQTVVPLTIV NALRPVRPLG FDLKELRTGR RAPAQRGLFS TLDRALHVYE RKPLRSVRDA ALRRSADWII ARQEADGSWG GIQPPWVYSL MALNLLGYGV DHPVMRKGIE GLDRFTIRDE RGRRLEACQS PVWDTVLAMT ALRDAELPEN HPALVKAADW VLGEEITNPG DWSVRRPRVA PGGWAFEFDN DGYPDVDDTA EVVLALNRVA HPDAPAAIRR GVDWLEGMAC KDGGYGAFDA DNTRTLALKL PFCDFGAVID PPTADVTAHT LEAYAALGLA NSRASQRALE WLVKAQERDG SWFGRWGANH VYGTGAVVPA MVAVGVDPED EMIRRAVRWL EEHQNDDGGW GEDLRSYRDK SWIGRGVSTA SQTAWALLAL LAAGEERGTA VEQGVRFLIR TQRADGTWDE DHYTGTGFPG DFYLNYHLYR LVFPISALGR YVRAVGAAGD GGDAGHAGHA GTVS
|
| |