Gene RPB_1726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1726 
Symbol 
ID3908251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1967570 
End bp1969534 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content66% 
IMG OID637883620 
Productsqualene cyclase 
Protein accessionYP_485345 
Protein GI86748849 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00651785 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.158685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCCG GTACGACCAT TCTGGGCGCA GAGCGCGGCA GGACGCTCGA CGCCTCGATC 
GATGCGGCGC GTGCCGCGCT GCTCGGTTAT CGCCGCGACG ACGGCCATTG GGTGTTCGAG
CTTGAGGCCG ATTGTACCAT TCCGGCCGAA TACGTGCTGC TGCGGCACTA TCTCGGCGAG
CCGGTCGATG CCGCGCTGGA GGCCAAGATC GCGGTCTACC TGCGCCGGAC CCAGGGCGCG
CATGGCGGCT GGCCGCTGGT GCACGACGGC GAGTTCGATG TCAGCGCGAC GGTAAAGGCC
TACTTCGCCC TCAAGATGAT CGGCGACAGC ATCGACGCGC CGCATATGGC CAAGGCCCGC
GAGGCGATCC TGGCGCGCGG CGGCGCGATC CACGTCAACG TGTTCACCCG CTTCCTTCTG
TCGATGTTCG GCATTCTGAC CTGGCGCAGC GTGCCGGTGC TGCCGGTCGA GATCATGCTG
CTGCCGATGT GGGCGCCGTT CCACCTCAAC AAGATCTCCT ATTGGGCGCG CACCACGATC
GTGCCGCTGA TGGTGCTGGC GGCGCTGAAG CCGCGCGCGG TCAACAAGCT CGACATCGGC
CTCGATGAAT TGTTCCTGCA GGACCCGCAA TCGATCGGCA TGCCGGCCAA GGCGCCGCAT
CAGAGCTGGG GCCTGTTCAC GCTGTTCGGT TCGATCGACG CGGTGCTGCG GGTGATCGAG
CCGCTGATCC CGAAAAAGCT GCGCAGTTAT GCGATCGGCC GCGCGGTCGC CTTCATCGAG
GAGCGGCTGA ACGGCGAGGA CGGGCTCGGT GCGATCTATC CGCCGATGGC CAACACGGTG
ATGATGTACA AGGTGCTGGG CTATGGTGAG GACCATCCGC CGCGCGCCAT CACCCGACGC
GGCATCGACC TGCTGCTGGT GGTCGGCGAG GAGGAGGCCT ACTGCCAGCC CTGCGTCTCG
CCGATCTGGG ACACCTCGCT GACCTGCCAC GCGCTGCTGG AGGCGGGCGG CGCCGAGGCC
GCGCTGCCGG TGCGCAAGGG GCTGGACTGG CTGATTCCGA AGCAGGTGCT CGACCTCAAG
GGCGACTGGG CGGTGAAGGC GCCCAACGTC CGCCCCGGGG GCTGGGCGTT CCAGTACAAC
AATGCCCATT ACCCCGATCT GGACGACACC GCCGTGGTGG TTATGGCGCT CGACCGCGCC
CGCCGCGATC AGCCGAGTGC GGCCTACGAC AATGCGATCG CGCGCGGGCG CGAGTGGATC
GAGGGGATGC AGAGCGACGA TGGCGGCTGG GCTGCCTTCG ACGTGAACAA CACCGAGTAT
TATTTGAACA ACATCCCGTT CTCGGATCAC GGCGCGCTGC TCGACCCGCC GACCGAGGAC
GTCACTGCGC GCTGCGTCTC GATGCTGGCG CAGCTCGGCG AGACCGCGGA GACCAGCTCG
GCGCTGGCCC GCGGCGTCGC CTATCTGCGC AAGACCCAAC TCGCCGAAGG CTCGTGGTAC
GGCCGCTGGG GCCTGAATTA CATCTACGGA ACCTGGTCGG TGCTGTGCGC GCTGAATGCG
GCCGGGGTCG CCCATCAGGA TCCGGCGATG CGCAAGGCGG TGGCCTGGCT GGCATCGATC
CAGAATGCCG ATGGCGGCTG GGGCGAGGAT GCGGTCAGCT ATCGCCTGGA CTATCGGGGC
TACGAAAGTG CACCGTCCAC GGCGTCCCAA ACGGCATGGG CCTTGCTTGC CTTGATGGCT
GCCGGGGAAG TCGATCATCC GGCGGTGGCG CGCGGGGTTG AGTACCTAAA AGGCACACAG
ACCGAAAAAG GCGTGTGGGA CGAGCAGCGC TACACCGCTA CAGGCTTTCC GCGGGTGTTT
TATCTGCGGT ACCATGGCTA TTCAAAGTTC TTTCCGCTCT GGGCGCTGGC GCGGTATCGA
AATTTGAGAG CCACGAACAG CAAGGTCGTA GGGGTCGGAA TGTGA
 
Protein sequence
MTSGTTILGA ERGRTLDASI DAARAALLGY RRDDGHWVFE LEADCTIPAE YVLLRHYLGE 
PVDAALEAKI AVYLRRTQGA HGGWPLVHDG EFDVSATVKA YFALKMIGDS IDAPHMAKAR
EAILARGGAI HVNVFTRFLL SMFGILTWRS VPVLPVEIML LPMWAPFHLN KISYWARTTI
VPLMVLAALK PRAVNKLDIG LDELFLQDPQ SIGMPAKAPH QSWGLFTLFG SIDAVLRVIE
PLIPKKLRSY AIGRAVAFIE ERLNGEDGLG AIYPPMANTV MMYKVLGYGE DHPPRAITRR
GIDLLLVVGE EEAYCQPCVS PIWDTSLTCH ALLEAGGAEA ALPVRKGLDW LIPKQVLDLK
GDWAVKAPNV RPGGWAFQYN NAHYPDLDDT AVVVMALDRA RRDQPSAAYD NAIARGREWI
EGMQSDDGGW AAFDVNNTEY YLNNIPFSDH GALLDPPTED VTARCVSMLA QLGETAETSS
ALARGVAYLR KTQLAEGSWY GRWGLNYIYG TWSVLCALNA AGVAHQDPAM RKAVAWLASI
QNADGGWGED AVSYRLDYRG YESAPSTASQ TAWALLALMA AGEVDHPAVA RGVEYLKGTQ
TEKGVWDEQR YTATGFPRVF YLRYHGYSKF FPLWALARYR NLRATNSKVV GVGM