Gene Acry_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1685 
Symbol 
ID5162584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1861136 
End bp1863085 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content69% 
IMG OID640553600 
Productsqualene-hopene cyclase 
Protein accessionYP_001234809 
Protein GI148260682 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGATA CGATTTCCTT CGATTTCGAT GCCCTCGACC AGGCGATCAG CCGCGCCCAC 
GCGCGGCTTT CGGCGGAACA GCGCGCGGAC GGCCATTACG TCTACGAACT CGAGGCCGAC
GCGACGATCC CCGCCGAATA CGTCCTGCTC GAACATTTCC TCGACCGGAT CGACCCCGAG
CTTGAAGCGC GCATCGGCGT GTTCCTGCGC GGCATCCAGG GCAATTCGCC GCAGAATCCC
GGCGGCTGGC CGCTCTTCCA TGACGGGGCG ATGGACATTT CGGCCAGCGT GAAGGCCTAT
TTCGCGCTGA AGGCGATCGG CGACGACCCG GACGCCCCGC ACATGCGCCG CGCACGCGAA
GCCATCCTCG CCCGCGGCGG GGCGGCGCGC ACCAATGTGT TCACCCGCAT CCAGCTCGCG
CTGTTCGGCG CCGTCCCCTG GCGGGCCTGC CCGGTCATGC CGGTCGAGAT CATGCTGCTG
CCGGACTGGT TCCCGATCAC CATCTGGAAG ATCAGCTACT GGTCGCGCAC CGTCATAGCA
CCTCTGCTGG TGCTGCTGAC CGAGCGCCCC ATCGCGCGCA ACCCGCGCAA CGTCCGGATC
GACGAGCTGT TCGTCACCCC GCCCGATCAG GTGACCGACT ATATCCGCGG CCCCTACCGC
TCGAACTGGG GCTACCTGTT CAAGGCGATC GATTCGGCGC TGCGCCCGCT CGAGCGGCAT
TTCCCCGCCC GCAGCCGCAA GCGCGCGATC CAGGCGGCGA TCGACTTCAT CACCCCGCGC
CTGAACGGCG AGGACGGGCT GGGCGCCATC TACCCCGCCA TGGCCAACAC GGTGATGATG
TATCACACGC TCGGCTACAG CCCCGACCAC CCGGATTACG CGACGGCCTG GGCCTCGGTC
CGCAAGCTGG TGACCGATGC CTCCTACCGT TTCGAGGGGG CGAGCTACGT GCAGCCCTGC
CTCTCCCCGG TCTGGGACAC CTCGCTGGCC GCACACGCCC TCGCCGAGGC CGGCAGCCCC
GGCGATGCCC AACTCGCCGC CGCCTGCGAC TGGCTGATCC CCCGCCAGAT CCTCGACGTG
AAGGGCGACT GGGCCTATCG CAAGCCGGAC GCCCCGCCCG GCGGCTGGGC CTTCCAGTAC
AACAACGCGC ACTACCCCGA CGTGGACGAT ACCGCCGTGG TCGGCATGAT CCTCGACCGC
AACGGCGATC CCGCCCATCG CGAGGCGGTG GAACGCGCCC GGCAATGGAT CCTCGGCATG
CAGAGCCGCT CCGGCGGCTG GGGCGCCTTC GATTCGGACA ACGAGTTCCA CTACCTCAAC
CACATCCCCT TCGCCGATCA CGGCGCCCTG CTCGACCCGC CGACGGCGGA CGTCACCGCG
CGCTGCATCT CCTTCCTCGC CCAGCTCGGC CACGCGGAAG ATCGCCCGGC GATCGAGCGG
GGCGTCGCCT ATCTGCGCCG CGAGCAGGAA CAGGACGGCT CCTGGTTCGG CCGCTGGGGC
ACGAACTACA TCTACGGCAC CTGGTCTTCG CTCTGCGCGC TGAACGCCGC CGGCGTGGCG
CAGGACGACC CGATGATGGT CCGCGCCGTC GAATGGCTGC TCGCCCGCCA GCGGCCGGAT
GGCGGCTGGG GCGAGGATTG CGAGACCTAC GCCCACGCGA AGCCCGGCGA GTATCACGAA
AGCCTGCCCT CGCAGACCGC CTGGGCGCTG CTCGGCCTGA TGGCCGCCGG CCAGGCCGAG
CACGAGGCCG TCGCCCGCGG CATCGCCTGG CTGCAATCGG TGCAGGAAGA CGACGGCTCG
TGGACCGAAC AGCCCTATAA CGCGGTCGGT TTCCCGCGGG TGTTCTACCT GCGCTACCAC
GGCTATCCAC GGTTCTTCCC GCTGCTGGCG ATGGCGCGCT ACCGCAACCT CGCCCGCGGC
AACAGCCGGC AGGTGCAGTT CGGATTCTGA
 
Protein sequence
MFDTISFDFD ALDQAISRAH ARLSAEQRAD GHYVYELEAD ATIPAEYVLL EHFLDRIDPE 
LEARIGVFLR GIQGNSPQNP GGWPLFHDGA MDISASVKAY FALKAIGDDP DAPHMRRARE
AILARGGAAR TNVFTRIQLA LFGAVPWRAC PVMPVEIMLL PDWFPITIWK ISYWSRTVIA
PLLVLLTERP IARNPRNVRI DELFVTPPDQ VTDYIRGPYR SNWGYLFKAI DSALRPLERH
FPARSRKRAI QAAIDFITPR LNGEDGLGAI YPAMANTVMM YHTLGYSPDH PDYATAWASV
RKLVTDASYR FEGASYVQPC LSPVWDTSLA AHALAEAGSP GDAQLAAACD WLIPRQILDV
KGDWAYRKPD APPGGWAFQY NNAHYPDVDD TAVVGMILDR NGDPAHREAV ERARQWILGM
QSRSGGWGAF DSDNEFHYLN HIPFADHGAL LDPPTADVTA RCISFLAQLG HAEDRPAIER
GVAYLRREQE QDGSWFGRWG TNYIYGTWSS LCALNAAGVA QDDPMMVRAV EWLLARQRPD
GGWGEDCETY AHAKPGEYHE SLPSQTAWAL LGLMAAGQAE HEAVARGIAW LQSVQEDDGS
WTEQPYNAVG FPRVFYLRYH GYPRFFPLLA MARYRNLARG NSRQVQFGF