Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_1685 |
Symbol | |
ID | 5162584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 1861136 |
End bp | 1863085 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640553600 |
Product | squalene-hopene cyclase |
Protein accession | YP_001234809 |
Protein GI | 148260682 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGATA CGATTTCCTT CGATTTCGAT GCCCTCGACC AGGCGATCAG CCGCGCCCAC GCGCGGCTTT CGGCGGAACA GCGCGCGGAC GGCCATTACG TCTACGAACT CGAGGCCGAC GCGACGATCC CCGCCGAATA CGTCCTGCTC GAACATTTCC TCGACCGGAT CGACCCCGAG CTTGAAGCGC GCATCGGCGT GTTCCTGCGC GGCATCCAGG GCAATTCGCC GCAGAATCCC GGCGGCTGGC CGCTCTTCCA TGACGGGGCG ATGGACATTT CGGCCAGCGT GAAGGCCTAT TTCGCGCTGA AGGCGATCGG CGACGACCCG GACGCCCCGC ACATGCGCCG CGCACGCGAA GCCATCCTCG CCCGCGGCGG GGCGGCGCGC ACCAATGTGT TCACCCGCAT CCAGCTCGCG CTGTTCGGCG CCGTCCCCTG GCGGGCCTGC CCGGTCATGC CGGTCGAGAT CATGCTGCTG CCGGACTGGT TCCCGATCAC CATCTGGAAG ATCAGCTACT GGTCGCGCAC CGTCATAGCA CCTCTGCTGG TGCTGCTGAC CGAGCGCCCC ATCGCGCGCA ACCCGCGCAA CGTCCGGATC GACGAGCTGT TCGTCACCCC GCCCGATCAG GTGACCGACT ATATCCGCGG CCCCTACCGC TCGAACTGGG GCTACCTGTT CAAGGCGATC GATTCGGCGC TGCGCCCGCT CGAGCGGCAT TTCCCCGCCC GCAGCCGCAA GCGCGCGATC CAGGCGGCGA TCGACTTCAT CACCCCGCGC CTGAACGGCG AGGACGGGCT GGGCGCCATC TACCCCGCCA TGGCCAACAC GGTGATGATG TATCACACGC TCGGCTACAG CCCCGACCAC CCGGATTACG CGACGGCCTG GGCCTCGGTC CGCAAGCTGG TGACCGATGC CTCCTACCGT TTCGAGGGGG CGAGCTACGT GCAGCCCTGC CTCTCCCCGG TCTGGGACAC CTCGCTGGCC GCACACGCCC TCGCCGAGGC CGGCAGCCCC GGCGATGCCC AACTCGCCGC CGCCTGCGAC TGGCTGATCC CCCGCCAGAT CCTCGACGTG AAGGGCGACT GGGCCTATCG CAAGCCGGAC GCCCCGCCCG GCGGCTGGGC CTTCCAGTAC AACAACGCGC ACTACCCCGA CGTGGACGAT ACCGCCGTGG TCGGCATGAT CCTCGACCGC AACGGCGATC CCGCCCATCG CGAGGCGGTG GAACGCGCCC GGCAATGGAT CCTCGGCATG CAGAGCCGCT CCGGCGGCTG GGGCGCCTTC GATTCGGACA ACGAGTTCCA CTACCTCAAC CACATCCCCT TCGCCGATCA CGGCGCCCTG CTCGACCCGC CGACGGCGGA CGTCACCGCG CGCTGCATCT CCTTCCTCGC CCAGCTCGGC CACGCGGAAG ATCGCCCGGC GATCGAGCGG GGCGTCGCCT ATCTGCGCCG CGAGCAGGAA CAGGACGGCT CCTGGTTCGG CCGCTGGGGC ACGAACTACA TCTACGGCAC CTGGTCTTCG CTCTGCGCGC TGAACGCCGC CGGCGTGGCG CAGGACGACC CGATGATGGT CCGCGCCGTC GAATGGCTGC TCGCCCGCCA GCGGCCGGAT GGCGGCTGGG GCGAGGATTG CGAGACCTAC GCCCACGCGA AGCCCGGCGA GTATCACGAA AGCCTGCCCT CGCAGACCGC CTGGGCGCTG CTCGGCCTGA TGGCCGCCGG CCAGGCCGAG CACGAGGCCG TCGCCCGCGG CATCGCCTGG CTGCAATCGG TGCAGGAAGA CGACGGCTCG TGGACCGAAC AGCCCTATAA CGCGGTCGGT TTCCCGCGGG TGTTCTACCT GCGCTACCAC GGCTATCCAC GGTTCTTCCC GCTGCTGGCG ATGGCGCGCT ACCGCAACCT CGCCCGCGGC AACAGCCGGC AGGTGCAGTT CGGATTCTGA
|
Protein sequence | MFDTISFDFD ALDQAISRAH ARLSAEQRAD GHYVYELEAD ATIPAEYVLL EHFLDRIDPE LEARIGVFLR GIQGNSPQNP GGWPLFHDGA MDISASVKAY FALKAIGDDP DAPHMRRARE AILARGGAAR TNVFTRIQLA LFGAVPWRAC PVMPVEIMLL PDWFPITIWK ISYWSRTVIA PLLVLLTERP IARNPRNVRI DELFVTPPDQ VTDYIRGPYR SNWGYLFKAI DSALRPLERH FPARSRKRAI QAAIDFITPR LNGEDGLGAI YPAMANTVMM YHTLGYSPDH PDYATAWASV RKLVTDASYR FEGASYVQPC LSPVWDTSLA AHALAEAGSP GDAQLAAACD WLIPRQILDV KGDWAYRKPD APPGGWAFQY NNAHYPDVDD TAVVGMILDR NGDPAHREAV ERARQWILGM QSRSGGWGAF DSDNEFHYLN HIPFADHGAL LDPPTADVTA RCISFLAQLG HAEDRPAIER GVAYLRREQE QDGSWFGRWG TNYIYGTWSS LCALNAAGVA QDDPMMVRAV EWLLARQRPD GGWGEDCETY AHAKPGEYHE SLPSQTAWAL LGLMAAGQAE HEAVARGIAW LQSVQEDDGS WTEQPYNAVG FPRVFYLRYH GYPRFFPLLA MARYRNLARG NSRQVQFGF
|
| |