Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1720 |
Symbol | |
ID | 3972373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1866578 |
End bp | 1868542 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637924833 |
Product | squalene cyclase |
Protein accession | YP_531598 |
Protein GI | 90423228 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0718096 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCCG GGAACAACAA GCAGCCCGCG GCGGCAATCG GCGCTCTCGA TGCGAGCATC GAGAGCGCGA CCAACGCCTT GCTGGGCTAT CGGCAGCCCG ACGGGCACTG GGTGTTCGAA CTTGAGGCCG ACTGCACCAT TCCTGCGGAA TACGTGCTGC TGCGGCATTA CCTCGGCGAG CCGGTCGACG CCGCGTTGGA GGCCAAGATC GCCAACTATC TGCGCCGCGT GCAGGGCGCC CATGGCGGCT GGCCGCTGGT GCACGACGGC GGCTTCGACA TGAGCGCCAG CGTCAAGGGC TACTTCGCGC TGAAGATGAT CGGTGACGAC ATCGACGCGC CGCACATGGC GAAGGCGCGC GAGGCGATCC GCTCGCGCGG CGGCGCGATC CACAGCAACG TGTTCACCCG CTTCCTGCTG TCGATGTTCG GCATCACCAC CTGGCGCAGC GTGCCGGTGC TGCCGGTCGA GATCATGCTG CTGCCGATGT GGTCGCCGTT CCATCTCAAC AAGATCTCCT ATTGGGCGCG CACCACCATC GTGCCGCTGA TGGTGCTGGC GGCCTTGAAG CCGCGCGCGG TCAACCGGCT CGACATCGGA CTCGACGAAC TGTTCTTGCA GGATCCGAAG TCGATCAAGA TGCCGGCCAA GGCGCCGCAT CAGAGCTGGG CGCTGTTCAA GCTGTTCGCC GGCATCGATG CGGTGTTGCG CACGATCGAG CCGTTGTTCC CGAAGCGGCT GCGCGATCAT GCGATCAAGC TCGCGGTGGA TTTCGTCGAG GAGCGGCTGA ACGGCGAGGA CGGGCTCGGC GCGATCTATC CGCCGATGGC CAACACCGTG ATGATGTACA AGGTGCTGGG CTTTCCCGAG GATCATCCGC CGCGCGCGAT CACCCGGCGC GGCATCGACA AGCTGTTGGT GATCGGCGAG GACGAAGCCT ATTGCCAGCC TTGCGTGTCG CCGGTGTGGG ACACCGCGCT GACCTGCCAC GCGCTGCTCG AAGTCGGCGG CGAGGCGGCG GTGCCGCCGG CCAAGCGCGG TATGGACTGG CTGCTGCCCA AGCAGGTGCT CGACCTCAAG GGCGACTGGG CGGTGAAGCG GCCGAACCTG CGGCCCGGCG GCTGGGCGTT CCAGTACAAC AACGCGCACT ATCCAGACCT CGACGACACC GCGGTGGTGG TGATGGCGAT GGACCGCTCG CGCCGCGCCA CCGGCAGCCG CGAATATGAC GAGGCGATCG CCCGGGCCCG GGAGTGGATC GAGGGCATGC AGTCCGACGA CGGCGGCTGG GCGGCGTTCG ACGTCAACAA TCTGGAATAT TACCTCAACA ACATCCCGTT CTCCGACCAC GGCGCGATGC TCGACCCGCC GACCGAGGAC GTCACCGCGC GCTGTGTTTC GATGCTGTCA CAGCTCGGCG AGACCGCGGC GAGCAGCAAG GCGGTCGCCG ACGGCGTCGA ATATCTGCGC AGGACTCAGC TGCCGGACGG CTCCTGGTAC GGCCGCTGGG GGCTGAATTA CATCTACGGC ACCTGGTCGG TGCTGTGCGC GCTGAACGCC GCCGGGGTCG ATCATCAGGA TCCGGTGATT CGCAAGGCGG TGACCTGGCT GGCTTCGGTC CAGAACCCCG ACGGCGGTTG GGGCGAGGGT GCCGAGAGCT ACCGGCTGAA TTACACGCGA TACGAGCAGG CGCCGACCAC CGCCTCGCAG ACCTCATGGG CTTTGCTCGG CCTGATGGCG GCCGGTGAGG TGGATTCCCC CGTAGTTGCC CGCGGCGTGG AGTACCTAAA AAGCACACAG ACCGGAAAAG GGCTCTGGGA CGAGCAGCGA TACACCGCGA CGGGCTTTCC GCGGGTGTTT TATTTGCGTT ATCATGGCTA TGCGAAGTTC TTTCCGCTGT GGGCGCTGGC GCGGTATCGA AACCTGAGGA GCACCAACAG TAAGGTGGTA GGGGTCGGGA TGTGA
|
Protein sequence | MESGNNKQPA AAIGALDASI ESATNALLGY RQPDGHWVFE LEADCTIPAE YVLLRHYLGE PVDAALEAKI ANYLRRVQGA HGGWPLVHDG GFDMSASVKG YFALKMIGDD IDAPHMAKAR EAIRSRGGAI HSNVFTRFLL SMFGITTWRS VPVLPVEIML LPMWSPFHLN KISYWARTTI VPLMVLAALK PRAVNRLDIG LDELFLQDPK SIKMPAKAPH QSWALFKLFA GIDAVLRTIE PLFPKRLRDH AIKLAVDFVE ERLNGEDGLG AIYPPMANTV MMYKVLGFPE DHPPRAITRR GIDKLLVIGE DEAYCQPCVS PVWDTALTCH ALLEVGGEAA VPPAKRGMDW LLPKQVLDLK GDWAVKRPNL RPGGWAFQYN NAHYPDLDDT AVVVMAMDRS RRATGSREYD EAIARAREWI EGMQSDDGGW AAFDVNNLEY YLNNIPFSDH GAMLDPPTED VTARCVSMLS QLGETAASSK AVADGVEYLR RTQLPDGSWY GRWGLNYIYG TWSVLCALNA AGVDHQDPVI RKAVTWLASV QNPDGGWGEG AESYRLNYTR YEQAPTTASQ TSWALLGLMA AGEVDSPVVA RGVEYLKSTQ TGKGLWDEQR YTATGFPRVF YLRYHGYAKF FPLWALARYR NLRSTNSKVV GVGM
|
| |