Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_5491 |
Symbol | |
ID | 7119274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011758 |
Strand | - |
Start bp | 116385 |
End bp | 118322 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643528164 |
Product | squalene-hopene cyclase |
Protein accession | YP_002424160 |
Protein GI | 218533345 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.264934 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACGG AGCCTCGCTT CTCCGCGCCC GAGACCCTGC GCGCTATCGC CGGCGCGGGG CGTGCCCTGG GCCGCCACCA GCGCCGGGAC GGCCACTGGG TCTTCGAGTT GGAGGCGGAC GCGACCATCC CGGCCGAGTA CGTGCTCCTG GAGCACTACA TGGACCGCAT CACGCCCGAG CGGCAGGCCC GGATCGGAGC CTACCTCCGG CGCATCCAGG GCGAGCATGG CGGCTGGCCC ATGTTCCACG CGGGCGAGTT CAACATCTCG GCCAGCGTGA AGGCCTACTG CGCGCTGAAG GCCATCGGCG ACGATCCGCA AGCGCCGCAT ATGGTCCGGG CCCGCCAGGC CATCCTCGGC CATGGGGGCG CGGAGCGCGC CAACGTCTTC ACGCGCATCC AACTCGCCCT GTTCGGCGCC ATCCCCTGGC GCGGCGTGCC GGTGATGCCG GTCGAGATCA TGCACCTGCC CAAGTGGTTC TTCTTCAACA TCTGGGCGAT GTCCTACTGG GCCCGCACCT GCGTGGTGCC GCTCCTCGTG CTGCAGGCGC GGAAGCCCCG TGCCCGCAAC CCGCGCCAGG TGAGCTTCGA CGAGATCTTC CGGACCGAGC CGGACGAGGT CCGAGACTGG ATCCGCGGCC CCTACCGCTC ACGCTGGGGC GTGGTGTTCA AGCACATCGA CACGGTGCTG CGCTGGACCG AGCCCCTGTT CTCGAAGGTC GCGCGCGAGA GCGCCATCTT CAAGGCCGTC GACTTCGTGG AGGAGCGTCT GAACGGCGAG GACGGGCTCG GCGCGATCTA CCCGGCCATG GCCTACGCGC TGATGATGTA CGACGTGCTC GGCTACCCCG AGGACGACCC GCGCTGCGTC ACGATCTGGA AGGCCATCGA CAAGCTTCTC ATCGAGACGG ACGAGGAGGT TTACTGCCAG CCCTGCGTCT CGCCCGTATG GGACACGAGC CTGTCCGGGC ATGCCATGAT CGAGGCGGCG CGCACCGGGG GCATCGAGGC CCAAGCGGAG CTCGACGCCG CGTGCGACTG GCTGGTGGCG CGCCAGGTCA AGGACGTGCG GGGCGACTGG GCCGAGACGC GGCCGGACGC CGAGCCCGGC GGCTGGGCCT TCCAGTACCG CAACGACCAC TACCCCGACG TCGACGACAC GGCGGTGGTC GCCATGCTGC TCCACCGCAA CGGCCGGCCC GAGCACGCGG AGGCAATCGA GAAGGCGCGC CGCTGGGTCG TCGGCGTGCA GAGCCGCAAT GGAGGCTGGG GTGCCTTCGA CGCCGACAAC GACCGCGAGT TCCTCAACCA CATCCCGTTC TCGGACCACG GCGCGCTGCT CGACCCGCCG ACCGCCGACG TGACCGGCCG CTGCATCTCC TTCCTGTCCC AGCTCGGGCA CGAGGAGGAC CGGCCGGTGA TCGAGCGCGC CTTGGCCTAC CTTCGGGCCG AACAGGAGCG CGACGGCAGT TGGTACGGGC GCTGGGGCAC CAACTACGTC TACGGCACCT GGACGGTCCT GTGCGGCCTG AACGCGGCCG GCATCCCGCA CGACGACCCG ATGGTGCGCC GGGCCGTGGA CTGGCTGGTC TCGATCCAGC GCGCGGACGG CGGCTGGGGC GAGGACGAGC GCAGCTACGA CGTCGGCCAC TACGTCGAGA ACGCCGAGAG CCTGCCTTCG CAGACGGCCT GGGCGATGCT CGGCCTGATG TCGGTCGGCC AGGCCGACCA CCCCGCCGTC CTACGCGGTG CGGCCTACCT GCAGCGCACG CAAGGGCCGG ACGGCGAGTG GCAGGAGCGG GCCTACAACG CCGTCGGCTT CCCGCGCGTG TTCTACCTCA AGTATCACGG CTACCGGCTG TTCTTCCCAC TGTTCGCCCT CTCGCGCCTT CACAACCTAC AACGGGGCAA CAGCCGGGAG GTCAGCTTCG GCTTTTGA
|
Protein sequence | MNTEPRFSAP ETLRAIAGAG RALGRHQRRD GHWVFELEAD ATIPAEYVLL EHYMDRITPE RQARIGAYLR RIQGEHGGWP MFHAGEFNIS ASVKAYCALK AIGDDPQAPH MVRARQAILG HGGAERANVF TRIQLALFGA IPWRGVPVMP VEIMHLPKWF FFNIWAMSYW ARTCVVPLLV LQARKPRARN PRQVSFDEIF RTEPDEVRDW IRGPYRSRWG VVFKHIDTVL RWTEPLFSKV ARESAIFKAV DFVEERLNGE DGLGAIYPAM AYALMMYDVL GYPEDDPRCV TIWKAIDKLL IETDEEVYCQ PCVSPVWDTS LSGHAMIEAA RTGGIEAQAE LDAACDWLVA RQVKDVRGDW AETRPDAEPG GWAFQYRNDH YPDVDDTAVV AMLLHRNGRP EHAEAIEKAR RWVVGVQSRN GGWGAFDADN DREFLNHIPF SDHGALLDPP TADVTGRCIS FLSQLGHEED RPVIERALAY LRAEQERDGS WYGRWGTNYV YGTWTVLCGL NAAGIPHDDP MVRRAVDWLV SIQRADGGWG EDERSYDVGH YVENAESLPS QTAWAMLGLM SVGQADHPAV LRGAAYLQRT QGPDGEWQER AYNAVGFPRV FYLKYHGYRL FFPLFALSRL HNLQRGNSRE VSFGF
|
| |