Gene Acel_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1697 
Symbol 
ID4484697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1907344 
End bp1909245 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content68% 
IMG OID639730487 
Productsqualene-hopene cyclase 
Protein accessionYP_873455 
Protein GI117928904 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.633147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000255993 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCCAGG CGAGCGTACG AGAGGACGCG AAAGCGGCCC TTGACCGAGC CGTCGATTAC 
CTCCTCTCCT TGCAGGACGA GAAAGGTTTC TGGAAAGGCG AACTAGAAAC CAACGTCACG
ATCGAAGCGG AAGACCTGCT TCTTCGCGAA TTCCTCGGCA TCCGAACGCC GGACATCACG
GCGGAAACCG CGCGGTGGAT TCGCGCCAAG CAGCGGTCCG ACGGCACGTG GGCCACGTTT
TACGACGGGC CACCGGATTT GTCGACCTCA GTCGAAGCCT ACGTGGCGCT GAAACTCGCC
GGCGATGACC CGGCCGCGCC GCATATGGAG AAAGCCGCCG CATACATCCG CGGCGCCGGC
GGGGTGGAGC GGACTCGGGT GTTCACCCGG TTATGGTTGG CGCTCTTCGG CTTATGGCCG
TGGGACGATC TGCCGACGCT CCCGCCGGAG ATGATTTTTC TCCCGTCGTG GTTTCCGTTG
AACATTTACG ACTGGGGGTG CTGGGCCCGG CAGACCGTCG TACCGCTCAC TATTGTCAGC
GCGCTCCGGC CGGTGCGGCC GATACCGCTG TCCATCGACG AAATCCGGAC CGGCGCACCG
CCGCCGCCGC GGGATCCGGC CTGGACGATC CGCGGCTTCT TCCAGCGACT GGATGACCTG
CTGCGCGGAT ACCGGCGGGT CGCGGATCAC GGTCCGGCCC GACTGTTCCG GCGCTTGGCC
ATGCGGCGGG CGGCGGAATG GATCATCGCG CGACAGGAAG CCGACGGCTC GTGGGGCGGC
ATCCAGCCGC CATGGGTGTA TTCGTTGATT GCCTTGCATC TTCTCGGTTA TCCGCTTGAT
CATCCCGTGC TGCGCCGCGG CCTGGACGGA CTGAACGGCT TCACCATCCG GGAGGAGACC
GCTGACGGGG CGGTCCGCCG GTTGGAAGCC TGCCAGTCGC CGGTCTGGGA CACCGCGCTG
GCGGTCACCG CGCTCCGCGA CGCCGGCCTG CCCGCCGATC ATCCGAGGGT GCAGGCCGCC
GCCCGCTGGC TGGTCGGCGA AGAGGTGCGG GTCGCCGGGG ACTGGGCGGT ACGCCGTCCC
GGGCTGCCGC CAGGAGGATG GGCCTTCGAA TTCGCCAACG ACAACTACCC GGATACCGAT
GACACCGCGG AGGTGGTCCT CGCCCTCCGC CGAGTGCGCC TCGAGGACGC CGATCAGCAG
GCGCTGGAGG CTGCGGTCCG CCGCGCGACG ACGTGGGTCA TCGGCATGCA ATCCACGGAC
GGCGGCTGGG GCGCCTTCGA CGCGGACAAC ACCCGAGAGT TGGTGCTCCG CCTGCCGTTC
TGCGATTTCG GAGCCGTGAT CGATCCGCCG TCCGCGGACG TCACCGCGCA CATCGTGGAA
ATGCTCGCCG CCCTCGGCAT GCGCGACCAC CCAGCCACGG TCGCCGGGGT GCGCTGGCTG
CTCGCACACC AAGAGCCGGA CGGCTCGTGG TTCGGCCGGT GGGGTGCCAA TCACATCTAC
GGCACCGGCG CCGTGGTGCC GGCACTGATC GCCGCTGGGG TGTCGCCGGA CACGCCGCCG
ATCCGCCGGG CGATCCGCTG GCTGGAGGAG CATCAGAATC CGGACGGCGG GTGGGGCGAG
GATCTCCGGT CGTACACCGA TCCGGCGCTG TGGGTCGGCC GTGGGGTGTC CACCGCCTCA
CAGACCGCGT GGGCGCTGCT CGCGCTCCTC GCCGCCGGGG AGGAGGCGTC GCCCGCGGTG
GACCGCGGCG TGCGGTGGCT GGTCACAACG CAGCAGCCGG ACGGCGGGTG GGATGAGCCG
CACTACACGG GTACCGGATT TCCCGGCGAC TTCTACATCA ACTACCACCT GTACCGACTG
GTCTTCCCGA TCAGTGCGCT CGGACGATAC GTGAACCGAT GA
 
Protein sequence
MTQASVREDA KAALDRAVDY LLSLQDEKGF WKGELETNVT IEAEDLLLRE FLGIRTPDIT 
AETARWIRAK QRSDGTWATF YDGPPDLSTS VEAYVALKLA GDDPAAPHME KAAAYIRGAG
GVERTRVFTR LWLALFGLWP WDDLPTLPPE MIFLPSWFPL NIYDWGCWAR QTVVPLTIVS
ALRPVRPIPL SIDEIRTGAP PPPRDPAWTI RGFFQRLDDL LRGYRRVADH GPARLFRRLA
MRRAAEWIIA RQEADGSWGG IQPPWVYSLI ALHLLGYPLD HPVLRRGLDG LNGFTIREET
ADGAVRRLEA CQSPVWDTAL AVTALRDAGL PADHPRVQAA ARWLVGEEVR VAGDWAVRRP
GLPPGGWAFE FANDNYPDTD DTAEVVLALR RVRLEDADQQ ALEAAVRRAT TWVIGMQSTD
GGWGAFDADN TRELVLRLPF CDFGAVIDPP SADVTAHIVE MLAALGMRDH PATVAGVRWL
LAHQEPDGSW FGRWGANHIY GTGAVVPALI AAGVSPDTPP IRRAIRWLEE HQNPDGGWGE
DLRSYTDPAL WVGRGVSTAS QTAWALLALL AAGEEASPAV DRGVRWLVTT QQPDGGWDEP
HYTGTGFPGD FYINYHLYRL VFPISALGRY VNR