Gene RPC_1720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1720 
Symbol 
ID3972373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1866578 
End bp1868542 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content66% 
IMG OID637924833 
Productsqualene cyclase 
Protein accessionYP_531598 
Protein GI90423228 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0718096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCCG GGAACAACAA GCAGCCCGCG GCGGCAATCG GCGCTCTCGA TGCGAGCATC 
GAGAGCGCGA CCAACGCCTT GCTGGGCTAT CGGCAGCCCG ACGGGCACTG GGTGTTCGAA
CTTGAGGCCG ACTGCACCAT TCCTGCGGAA TACGTGCTGC TGCGGCATTA CCTCGGCGAG
CCGGTCGACG CCGCGTTGGA GGCCAAGATC GCCAACTATC TGCGCCGCGT GCAGGGCGCC
CATGGCGGCT GGCCGCTGGT GCACGACGGC GGCTTCGACA TGAGCGCCAG CGTCAAGGGC
TACTTCGCGC TGAAGATGAT CGGTGACGAC ATCGACGCGC CGCACATGGC GAAGGCGCGC
GAGGCGATCC GCTCGCGCGG CGGCGCGATC CACAGCAACG TGTTCACCCG CTTCCTGCTG
TCGATGTTCG GCATCACCAC CTGGCGCAGC GTGCCGGTGC TGCCGGTCGA GATCATGCTG
CTGCCGATGT GGTCGCCGTT CCATCTCAAC AAGATCTCCT ATTGGGCGCG CACCACCATC
GTGCCGCTGA TGGTGCTGGC GGCCTTGAAG CCGCGCGCGG TCAACCGGCT CGACATCGGA
CTCGACGAAC TGTTCTTGCA GGATCCGAAG TCGATCAAGA TGCCGGCCAA GGCGCCGCAT
CAGAGCTGGG CGCTGTTCAA GCTGTTCGCC GGCATCGATG CGGTGTTGCG CACGATCGAG
CCGTTGTTCC CGAAGCGGCT GCGCGATCAT GCGATCAAGC TCGCGGTGGA TTTCGTCGAG
GAGCGGCTGA ACGGCGAGGA CGGGCTCGGC GCGATCTATC CGCCGATGGC CAACACCGTG
ATGATGTACA AGGTGCTGGG CTTTCCCGAG GATCATCCGC CGCGCGCGAT CACCCGGCGC
GGCATCGACA AGCTGTTGGT GATCGGCGAG GACGAAGCCT ATTGCCAGCC TTGCGTGTCG
CCGGTGTGGG ACACCGCGCT GACCTGCCAC GCGCTGCTCG AAGTCGGCGG CGAGGCGGCG
GTGCCGCCGG CCAAGCGCGG TATGGACTGG CTGCTGCCCA AGCAGGTGCT CGACCTCAAG
GGCGACTGGG CGGTGAAGCG GCCGAACCTG CGGCCCGGCG GCTGGGCGTT CCAGTACAAC
AACGCGCACT ATCCAGACCT CGACGACACC GCGGTGGTGG TGATGGCGAT GGACCGCTCG
CGCCGCGCCA CCGGCAGCCG CGAATATGAC GAGGCGATCG CCCGGGCCCG GGAGTGGATC
GAGGGCATGC AGTCCGACGA CGGCGGCTGG GCGGCGTTCG ACGTCAACAA TCTGGAATAT
TACCTCAACA ACATCCCGTT CTCCGACCAC GGCGCGATGC TCGACCCGCC GACCGAGGAC
GTCACCGCGC GCTGTGTTTC GATGCTGTCA CAGCTCGGCG AGACCGCGGC GAGCAGCAAG
GCGGTCGCCG ACGGCGTCGA ATATCTGCGC AGGACTCAGC TGCCGGACGG CTCCTGGTAC
GGCCGCTGGG GGCTGAATTA CATCTACGGC ACCTGGTCGG TGCTGTGCGC GCTGAACGCC
GCCGGGGTCG ATCATCAGGA TCCGGTGATT CGCAAGGCGG TGACCTGGCT GGCTTCGGTC
CAGAACCCCG ACGGCGGTTG GGGCGAGGGT GCCGAGAGCT ACCGGCTGAA TTACACGCGA
TACGAGCAGG CGCCGACCAC CGCCTCGCAG ACCTCATGGG CTTTGCTCGG CCTGATGGCG
GCCGGTGAGG TGGATTCCCC CGTAGTTGCC CGCGGCGTGG AGTACCTAAA AAGCACACAG
ACCGGAAAAG GGCTCTGGGA CGAGCAGCGA TACACCGCGA CGGGCTTTCC GCGGGTGTTT
TATTTGCGTT ATCATGGCTA TGCGAAGTTC TTTCCGCTGT GGGCGCTGGC GCGGTATCGA
AACCTGAGGA GCACCAACAG TAAGGTGGTA GGGGTCGGGA TGTGA
 
Protein sequence
MESGNNKQPA AAIGALDASI ESATNALLGY RQPDGHWVFE LEADCTIPAE YVLLRHYLGE 
PVDAALEAKI ANYLRRVQGA HGGWPLVHDG GFDMSASVKG YFALKMIGDD IDAPHMAKAR
EAIRSRGGAI HSNVFTRFLL SMFGITTWRS VPVLPVEIML LPMWSPFHLN KISYWARTTI
VPLMVLAALK PRAVNRLDIG LDELFLQDPK SIKMPAKAPH QSWALFKLFA GIDAVLRTIE
PLFPKRLRDH AIKLAVDFVE ERLNGEDGLG AIYPPMANTV MMYKVLGFPE DHPPRAITRR
GIDKLLVIGE DEAYCQPCVS PVWDTALTCH ALLEVGGEAA VPPAKRGMDW LLPKQVLDLK
GDWAVKRPNL RPGGWAFQYN NAHYPDLDDT AVVVMAMDRS RRATGSREYD EAIARAREWI
EGMQSDDGGW AAFDVNNLEY YLNNIPFSDH GAMLDPPTED VTARCVSMLS QLGETAASSK
AVADGVEYLR RTQLPDGSWY GRWGLNYIYG TWSVLCALNA AGVDHQDPVI RKAVTWLASV
QNPDGGWGEG AESYRLNYTR YEQAPTTASQ TSWALLGLMA AGEVDSPVVA RGVEYLKSTQ
TGKGLWDEQR YTATGFPRVF YLRYHGYAKF FPLWALARYR NLRSTNSKVV GVGM