Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_6347 |
Symbol | |
ID | 6134918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 6973249 |
End bp | 6975240 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641646441 |
Product | squalene-hopene cyclase |
Protein accession | YP_001773045 |
Protein GI | 170744390 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.323444 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.123319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCAAGG TCGAGACGCT CCACCGCACG AGCACGCAGG ACATCACGCT CGACGACGTC GAGCGGCGCG TCACGCTCGC GTCGAAGGCT CTCATGCGGC TCGCGAACGC GGACGGGCAC TGGTGCTTCG AGCTGGAGGC CGACGCCACC ATTCCGTCCG AGTACATCCT CTACCATCAT TTCCGCGGCT CGATCCCGAC GGCCGAATTG GAGGGGAAGA TCGCCGCCTA CCTGCGCCGC ACCCAGAGCG CGCAGCACGA CGGCTGGGCC CTGATCCATG ACGGCCCCTT CGACATGAGC GCGACCGTCA AGGCCTACTT CGCCCTCAAG ATGGTCGGCG ACCCGATCGA CGCGCCCCAC ATGCGCCGGG CCCGCGACGC GATCCTGCGC CGGGGCGGCG CCGCCCACGC CAACGTCTTC ACCCGGATCA TGCTCGCCCT CTACGGCGAG GTGCCGTGGA CCGCCGTGCC GGTGATGCCG GTCGAGGTGA TGCTGCTGCC GCGGTGGTTC CCCTTCCACC TCGACAAGGT CTCCTACTGG GCCCGCACCG TGATGGTGCC GCTCTTCGTC CTGCAGGCCA AGAAGCCGCG GGCCCGCAAC CCGCGCGGCA TCGGCATCCG CGAACTCTTC GTCGAGGCGC CCGAGCGCGT GAAGCGCTGG CCGGCCGGCC CGCAGGAATC CTCGCCCTGG CGCCCGGTCT TCGCGGCCAT CGACAAGGTG CTGCAGAAGG TCGAGGGCTT CTTCCCGGCC GGCTCGCGGG CGCGGGCGAT CGACAAGGCG GTGGCCTTCG TCAGCGAGCG CCTGAACGGC GAGGACGGGC TCGGCGCGAT CTTCCCAGCC ATGGTCAACA CCGTGCTGAT GTTCGAGGCG CTCGGCTACC CGGACGACCA CCCCTTCGCG GTCACGGCCC GCTCTTCGGT CGAGAAGCTC GTCACCGTCA AGGAGCACGA GGCCTACGTC CAGCCCTGCC TGTCCCCGGT CTGGGACACG GCGCTCGCCG CCCACGCCCT GATGGAAGCC GGCGGGACCG AGGCGGAGCG CCACGCCAAG CGCGCCATGG ACTGGCTGAA GCCCCTGCAG GTGCTCGACA TCAAGGGCGA CTGGGCGGCC TCCAAGCCGG ACGTGCGGCC GGGCGGCTGG GCCTTCCAGT ACGCCAACCC GCACTACCCG GACCTCGACG ACACCGCGGT CGTGGTGATG GCCATGGACC GGGTGCAGAG CCGCCGCAGC CCCGGGCCCG ACGCGGCCGA TTACGGGCTC TCGATCGCCC GCGCCCGCGA ATGGGTCGAG GGCCTGCAGA GCCGCGACGG CGGCTGGGCG GCCTTCGACG CGGACAACAC CTACCACTAC CTCAACTACA TTCCGTTCTC GGATCACGGG GCGCTGCTCG ACCCGCCGAC CGCCGACGTG ACGGCGCGCT GCGTCTCGAT GCTGTCCCAG CTCGGCGAGA CCCGGGAGAC CTGCCCGCCC CTCGACCGCG GCGTCGCCTA CCTGCTCGCC GATCAGGAGG CGGATGGCAG CTGGTACGGC CGCTGGGGCA TGAACTACAT CTACGGGACG TGGTCGGTGC TCTGCGCCCT CAACGCGGCC GGGATCGACC CCGCCTGCGA GCCGGTGCGG CGGGCGGTGA CCTGGCTCAC CGCGATCCAG AACCCCGACG GCGGCTGGGG CGAGGACGCG TCGAGCTACA AGCTCGAATA TCGCGGCTAC GAGCGGGCGC CGAGCACGGC CTCGCAGACC GCCTGGGCGC TGCTCGCGCT GATGGCGGCC GGCGAGGCGG ACAACCCGGC CGTGGCGCGC GGCATCAACT ACCTGACCCG CACCCAGGGG GCGGACGGGC TCTGGGCCGA GGACCGCTAC ACGGCGACCG GGTTCCCGCG CGTCTTCTAC CTGCGCTACC ACGGCTACGC GAAGTTCTTC CCCCTCTGGG CGCTGGCCCG CTACCGCAAC CTCCAGCGGG GCAACAGCCT CAAGGTGGCG GTGGGGATGT GA
|
Protein sequence | MGKVETLHRT STQDITLDDV ERRVTLASKA LMRLANADGH WCFELEADAT IPSEYILYHH FRGSIPTAEL EGKIAAYLRR TQSAQHDGWA LIHDGPFDMS ATVKAYFALK MVGDPIDAPH MRRARDAILR RGGAAHANVF TRIMLALYGE VPWTAVPVMP VEVMLLPRWF PFHLDKVSYW ARTVMVPLFV LQAKKPRARN PRGIGIRELF VEAPERVKRW PAGPQESSPW RPVFAAIDKV LQKVEGFFPA GSRARAIDKA VAFVSERLNG EDGLGAIFPA MVNTVLMFEA LGYPDDHPFA VTARSSVEKL VTVKEHEAYV QPCLSPVWDT ALAAHALMEA GGTEAERHAK RAMDWLKPLQ VLDIKGDWAA SKPDVRPGGW AFQYANPHYP DLDDTAVVVM AMDRVQSRRS PGPDAADYGL SIARAREWVE GLQSRDGGWA AFDADNTYHY LNYIPFSDHG ALLDPPTADV TARCVSMLSQ LGETRETCPP LDRGVAYLLA DQEADGSWYG RWGMNYIYGT WSVLCALNAA GIDPACEPVR RAVTWLTAIQ NPDGGWGEDA SSYKLEYRGY ERAPSTASQT AWALLALMAA GEADNPAVAR GINYLTRTQG ADGLWAEDRY TATGFPRVFY LRYHGYAKFF PLWALARYRN LQRGNSLKVA VGM
|
| |