Gene RPD_3573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3573 
Symbol 
ID4024087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3977754 
End bp3979718 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content65% 
IMG OID637963777 
Productsqualene cyclase 
Protein accessionYP_570697 
Protein GI91978038 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.191218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCCG GAACTTTCAA TCCGGGTGGA GAGCGCGGCA ACACGCTCGA CGCCTCGATC 
GACGCGGCGC GCGCCGCGCT GCTGGGTTAT CGTCGTGACG ACGGCCATTG GGTGTTCGAA
CTCGAGGCCG ACTGCACCAT TCCGGCCGAG TACGTGCTGC TCCGGCACTA TCTTGGTGAA
CCGATCGACG CCGCGCTGGA AGCCAAGATC GCCGTTTATC TGCGCCGGAC CCAGGGCGCA
CATGGCGGCT GGCCGCTGGT GTATGACGGC GAATTCGACA TGAGCGCCAC CGTGAAGGGC
TATTTCGCGC TCAAGATGAT CGGCGACAGC ATCGACGCGC CGCATATGGC CAAGGCGCGC
GAGGCGATCC TGTCGCGCGG CGGCGCGGTC CACGCCAACG TGTTCACGCG ATTCCTGCTG
GCGATGTTCG GCATCCTGAC CTGGCGCGCC GTTCCGGTGC TGCCGGTCGA GATCATGCTG
CTGCCGATGT GGTCGCCGTT CCATCTCAAC AAGATCTCGT ATTGGGCGCG CACCACGATC
GTGCCGCTGA TGGTGCTGGC GGCGCTGAAG CCGCGCGCGG TCAACCGGCT CGGCGTCGGG
CTCGACGAGC TGTTCCTGCA GGACCCGAAA TCGATCGGGA TGCCCGCCAG GGCGCCGCAT
CAGAATCGCG GCCTGTTCGC GCTGTTCGGT GCGATCGACG CGGTGCTGCG GGTGATCGAA
CCACTGATCC CGAAGAAGCT GCGGAAACAC GCGATCGACC GCGCCGTCGC CTTCGTCGAG
GAGCGGCTGA ACGGCGAGGA CGGTCTCGGC GCGATCTATC CGCCGATGGC CAACACCGTG
ATGATGTACA AGGTGCTCGG CTATCCCGAG GACCATCCGC CGCGGGCGAT CACCCGGCGC
GGCATCGATC TGCTGCTGGT GATCGGTGAG GAGGAGGCCT ATTGCCAGCC CTGCGTCTCG
CCGATCTGGG ACACTTCGCT GACCTGCCAC GCGCTGATCG AGGCGGGCGG CGCCGAGGCC
GCGCAGCCGG TGCGCGAGGG CTTGGACTGG CTGCTGCCGA AGCAGGTGCT CGACCTCAAG
GGCGACTGGG CGGTGAAGGC CCCCAATGTC CGCCCCGGCG GCTGGGCGTT CCAGTACAAC
AACGCCCATT ATCCCGATCT CGACGACACC GCGGTGGTGG TGATGGCGCT CGACCGCGCC
CGCCGCGATC AGCCGAGCGC GGCCTACGAC AATGCCATCG CCCGCGGCCG CGAATGGATC
GAGGGGATGC AGAGCGACGA CGGCGGCTGG GCTGCCTTCG ATGTGAACAA CACCGAATAT
TATTTGAACA ACATCCCGTT CTCGGACCAC GGCGCGATGC TCGATCCGCC GACCGAGGAC
GTAACCGCGC GTTGCGTTTC GATGTTGGCG CAACTCGGCG AGACCGAGCA GACCAGCAAG
GCGGTGGCGC GGGGCGTTGC CTATCTGCGC AAGACCCAGC TTCCGGATGG CTCGTGGTAC
GGCCGATGGG GCATGAACTA CATCTATGGC ACCTGGGCGG TGCTGTGCGC GCTGAACGCC
GCCGGCGTCG ATCATCAGGA CCCGGCGATC CGCAAGGCGG TCGCCTGGCT GGCGTCGATT
CAGAACGCCG ATGGCGGCTG GGGCGAGGAC GGGGTCAGCT ACCGGTTGGA CTACCGGGGC
TACGAAACTG CGCCGTCCAC GGCGTCGCAA ACGGCATGGG CCTTGCTTTC AATCATGGCT
GCAGGGGAAG TCGATCATCC GGCGGTGGCG CGCGGGATTG AGTACCTAAA AGGCACACAG
ACCGAAAAAG GACTGTGGGA CGAGCAGCGC CACACCGCTA CAGGCTTTCC GCGCGTGTTT
TATCTGCGGT ATCATGGCTA CTCAAAGTTC TTTCCGCTCT GGGCGCTGGC GCGGTATCGA
AATTTGAGAG CCACGAACAG CAAGGTCGTA GGGGTCGGAA TGTGA
 
Protein sequence
MDSGTFNPGG ERGNTLDASI DAARAALLGY RRDDGHWVFE LEADCTIPAE YVLLRHYLGE 
PIDAALEAKI AVYLRRTQGA HGGWPLVYDG EFDMSATVKG YFALKMIGDS IDAPHMAKAR
EAILSRGGAV HANVFTRFLL AMFGILTWRA VPVLPVEIML LPMWSPFHLN KISYWARTTI
VPLMVLAALK PRAVNRLGVG LDELFLQDPK SIGMPARAPH QNRGLFALFG AIDAVLRVIE
PLIPKKLRKH AIDRAVAFVE ERLNGEDGLG AIYPPMANTV MMYKVLGYPE DHPPRAITRR
GIDLLLVIGE EEAYCQPCVS PIWDTSLTCH ALIEAGGAEA AQPVREGLDW LLPKQVLDLK
GDWAVKAPNV RPGGWAFQYN NAHYPDLDDT AVVVMALDRA RRDQPSAAYD NAIARGREWI
EGMQSDDGGW AAFDVNNTEY YLNNIPFSDH GAMLDPPTED VTARCVSMLA QLGETEQTSK
AVARGVAYLR KTQLPDGSWY GRWGMNYIYG TWAVLCALNA AGVDHQDPAI RKAVAWLASI
QNADGGWGED GVSYRLDYRG YETAPSTASQ TAWALLSIMA AGEVDHPAVA RGIEYLKGTQ
TEKGLWDEQR HTATGFPRVF YLRYHGYSKF FPLWALARYR NLRATNSKVV GVGM