Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1697 |
Symbol | |
ID | 4484697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1907344 |
End bp | 1909245 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639730487 |
Product | squalene-hopene cyclase |
Protein accession | YP_873455 |
Protein GI | 117928904 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.633147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.000255993 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCCAGG CGAGCGTACG AGAGGACGCG AAAGCGGCCC TTGACCGAGC CGTCGATTAC CTCCTCTCCT TGCAGGACGA GAAAGGTTTC TGGAAAGGCG AACTAGAAAC CAACGTCACG ATCGAAGCGG AAGACCTGCT TCTTCGCGAA TTCCTCGGCA TCCGAACGCC GGACATCACG GCGGAAACCG CGCGGTGGAT TCGCGCCAAG CAGCGGTCCG ACGGCACGTG GGCCACGTTT TACGACGGGC CACCGGATTT GTCGACCTCA GTCGAAGCCT ACGTGGCGCT GAAACTCGCC GGCGATGACC CGGCCGCGCC GCATATGGAG AAAGCCGCCG CATACATCCG CGGCGCCGGC GGGGTGGAGC GGACTCGGGT GTTCACCCGG TTATGGTTGG CGCTCTTCGG CTTATGGCCG TGGGACGATC TGCCGACGCT CCCGCCGGAG ATGATTTTTC TCCCGTCGTG GTTTCCGTTG AACATTTACG ACTGGGGGTG CTGGGCCCGG CAGACCGTCG TACCGCTCAC TATTGTCAGC GCGCTCCGGC CGGTGCGGCC GATACCGCTG TCCATCGACG AAATCCGGAC CGGCGCACCG CCGCCGCCGC GGGATCCGGC CTGGACGATC CGCGGCTTCT TCCAGCGACT GGATGACCTG CTGCGCGGAT ACCGGCGGGT CGCGGATCAC GGTCCGGCCC GACTGTTCCG GCGCTTGGCC ATGCGGCGGG CGGCGGAATG GATCATCGCG CGACAGGAAG CCGACGGCTC GTGGGGCGGC ATCCAGCCGC CATGGGTGTA TTCGTTGATT GCCTTGCATC TTCTCGGTTA TCCGCTTGAT CATCCCGTGC TGCGCCGCGG CCTGGACGGA CTGAACGGCT TCACCATCCG GGAGGAGACC GCTGACGGGG CGGTCCGCCG GTTGGAAGCC TGCCAGTCGC CGGTCTGGGA CACCGCGCTG GCGGTCACCG CGCTCCGCGA CGCCGGCCTG CCCGCCGATC ATCCGAGGGT GCAGGCCGCC GCCCGCTGGC TGGTCGGCGA AGAGGTGCGG GTCGCCGGGG ACTGGGCGGT ACGCCGTCCC GGGCTGCCGC CAGGAGGATG GGCCTTCGAA TTCGCCAACG ACAACTACCC GGATACCGAT GACACCGCGG AGGTGGTCCT CGCCCTCCGC CGAGTGCGCC TCGAGGACGC CGATCAGCAG GCGCTGGAGG CTGCGGTCCG CCGCGCGACG ACGTGGGTCA TCGGCATGCA ATCCACGGAC GGCGGCTGGG GCGCCTTCGA CGCGGACAAC ACCCGAGAGT TGGTGCTCCG CCTGCCGTTC TGCGATTTCG GAGCCGTGAT CGATCCGCCG TCCGCGGACG TCACCGCGCA CATCGTGGAA ATGCTCGCCG CCCTCGGCAT GCGCGACCAC CCAGCCACGG TCGCCGGGGT GCGCTGGCTG CTCGCACACC AAGAGCCGGA CGGCTCGTGG TTCGGCCGGT GGGGTGCCAA TCACATCTAC GGCACCGGCG CCGTGGTGCC GGCACTGATC GCCGCTGGGG TGTCGCCGGA CACGCCGCCG ATCCGCCGGG CGATCCGCTG GCTGGAGGAG CATCAGAATC CGGACGGCGG GTGGGGCGAG GATCTCCGGT CGTACACCGA TCCGGCGCTG TGGGTCGGCC GTGGGGTGTC CACCGCCTCA CAGACCGCGT GGGCGCTGCT CGCGCTCCTC GCCGCCGGGG AGGAGGCGTC GCCCGCGGTG GACCGCGGCG TGCGGTGGCT GGTCACAACG CAGCAGCCGG ACGGCGGGTG GGATGAGCCG CACTACACGG GTACCGGATT TCCCGGCGAC TTCTACATCA ACTACCACCT GTACCGACTG GTCTTCCCGA TCAGTGCGCT CGGACGATAC GTGAACCGAT GA
|
Protein sequence | MTQASVREDA KAALDRAVDY LLSLQDEKGF WKGELETNVT IEAEDLLLRE FLGIRTPDIT AETARWIRAK QRSDGTWATF YDGPPDLSTS VEAYVALKLA GDDPAAPHME KAAAYIRGAG GVERTRVFTR LWLALFGLWP WDDLPTLPPE MIFLPSWFPL NIYDWGCWAR QTVVPLTIVS ALRPVRPIPL SIDEIRTGAP PPPRDPAWTI RGFFQRLDDL LRGYRRVADH GPARLFRRLA MRRAAEWIIA RQEADGSWGG IQPPWVYSLI ALHLLGYPLD HPVLRRGLDG LNGFTIREET ADGAVRRLEA CQSPVWDTAL AVTALRDAGL PADHPRVQAA ARWLVGEEVR VAGDWAVRRP GLPPGGWAFE FANDNYPDTD DTAEVVLALR RVRLEDADQQ ALEAAVRRAT TWVIGMQSTD GGWGAFDADN TRELVLRLPF CDFGAVIDPP SADVTAHIVE MLAALGMRDH PATVAGVRWL LAHQEPDGSW FGRWGANHIY GTGAVVPALI AAGVSPDTPP IRRAIRWLEE HQNPDGGWGE DLRSYTDPAL WVGRGVSTAS QTAWALLALL AAGEEASPAV DRGVRWLVTT QQPDGGWDEP HYTGTGFPGD FYINYHLYRL VFPISALGRY VNR
|
| |