Gene Rpal_4261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4261 
Symbol 
ID6411945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4581257 
End bp4583218 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content65% 
IMG OID642714143 
Productsqualene-hopene cyclase 
Protein accessionYP_001993232 
Protein GI192292627 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.942083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCCG GCAGCTACAC GACTGGTGTG GAGCGCAACG CGCTCGAAGC TTCGATCGAT 
GCGGCGCGCA GCGCGCTGCT GAATTATCGT CGCGACGATG GCCATTGGGT GTTCGAACTC
GAGGCCGATT GCACCATTCC TGCCGAATAC GTGCTGCTGC GGCATTACCT CGGCGAGCCG
GTCGATGCCG AGCTCGAAGC CAAGATCGCG GTTTATCTGC GCCGCATCCA GGGTGCCCAT
GGCGGCTGGC CGCTGGTGCA CGACGGCGAC TTCGACATGA GCGCCAGCGT GAAGGGTTAC
TTCGCGCTGA AGATGATCGG CGACAGCATC GATGCCCCGC ATATGGTGCG GGCGCGCGAG
GCGATCCGTT CGCGCGGCGG CGCGATCCAC TCCAACGTCT TCACCCGGTT TCTGCTCACG
TTGTACGGCG TTACGACCTG GCGCGCGGTT CCGGTACTGC CGGTCGAGAT CATGCTGCTG
CCGAGCTGGT CGCCGTTCAC ACTGACCAAG ATCTCGTATT GGGCGCGTAC CACGATGGTG
CCGCTGCTCG TGCTGTGCGC GCTGAAGCCG CAGGCCAAGA ATCCGAAGGG CGTCGGCATC
GACGAACTAT TCCTTCAGGA CCCGAAGACG ATCGGGATGC CGGTCAAGGC GCCGCATCAG
AACTGGGCGC TGTTCAAGCT GTTCGGATCG ATCGACGCGG TGCTGCGCGT GATCGAGCCT
GTGATGCCCA AAGGCATCCG CAAGCGCGCG ATCGACAAGG CGCTCGCCTT CATCGAGGAG
CGGCTCAACG GCGAGGACGG CATGGGCGCG ATCTTCCCGC CGATGGCCAA CGCCGTGATG
ATGTACGAGG CGCTCGGCTA TCCCGAGGAC TATCCGCCGC GCGCCAGCCA GCGCCGCGGC
ATTGATCTCT TGCTGGTCGA TCGCGGCGAC GAAGCCTACT GCCAGCCCTG CGTGTCGCCG
GTGTGGGACA CCGCGCTCGC CAGCCATGCG GTGCTCGAGG CGGACGGTCA CGAGGGCGCC
AAGTCGGTGC GGCCGGCGCT CGACTGGCTG CTCCCGCGCC AGGTGCTCGA CGTCAAGGGC
GACTGGGCCG TCAAGGCCCC GAACGTCCGC CCCGGCGGCT GGGCGTTCCA GTACAACAAC
GCCCACTATC CGGATCTCGA CGATACCGCG GTGGTGGTGA TGGCGCTCGA CCGCGCCCGC
AAGGACCAGC CGAATCCCGC CTACGATGCC GCGATTGCCC GCGCCCGCGA GTGGATCGAG
GGGATGCAGA GCGACGATGG CGGCTGGGGT GCCTTCGACA TCAACAACAC TGAGTATTAT
TTGAACAACA TCCCGTTCTC GGACCATGGC GCGATGCTCG ATCCGCCGAC CGAGGACGTC
ACCGCGCGCT GCGTCTCGAT GCTGGCTCAG CTCGGTGAGA CCATGGACAG CAGCCCGGCG
CTGGCCCGCG CCGTCGGCTA TCTGCGCGAC ACCCAGCTCG CCGAGGGCTC CTGGTACGGC
CGCTGGGGCA TGAATTACAT CTACGGCACC TGGTCGGTGC TGTGCGCCCT CAACGCCGCC
GGCGTTCCCC ATGCCGATCC GATGATCCGC AAGGCGGTCG CCTGGCTGGA GTCGGTGCAG
AATCGCGACG GCGGCTGGGG CGAGGACGCG GTCAGCTACC GACTGGATTA CCGCGGCTAC
GAAAGTGCAC CTTCGACCGC CTCTCAGACG GCATGGGCTT TGCTTGCTCT GATGGCTGCG
GGTGAGGTCG ATCATCCCGC CGTGGCACGG GGCATCGAGT ACCTGAAAAG CACACAGACC
GAAAAAGGAC TGTGGGACGA GCAGCGTTAC ACGGCGACGG GCTTCCCGCG GGTGTTTTAT
CTGCGGTATC ATGGCTATTC GAAGTTCTTC CCACTCTGGG CGCTCGCCCG GTATCGGAAC
TTGCAGGCCA CGAACAGCAA GGTGGTAGGG GTCGGAATGT GA
 
Protein sequence
MDSGSYTTGV ERNALEASID AARSALLNYR RDDGHWVFEL EADCTIPAEY VLLRHYLGEP 
VDAELEAKIA VYLRRIQGAH GGWPLVHDGD FDMSASVKGY FALKMIGDSI DAPHMVRARE
AIRSRGGAIH SNVFTRFLLT LYGVTTWRAV PVLPVEIMLL PSWSPFTLTK ISYWARTTMV
PLLVLCALKP QAKNPKGVGI DELFLQDPKT IGMPVKAPHQ NWALFKLFGS IDAVLRVIEP
VMPKGIRKRA IDKALAFIEE RLNGEDGMGA IFPPMANAVM MYEALGYPED YPPRASQRRG
IDLLLVDRGD EAYCQPCVSP VWDTALASHA VLEADGHEGA KSVRPALDWL LPRQVLDVKG
DWAVKAPNVR PGGWAFQYNN AHYPDLDDTA VVVMALDRAR KDQPNPAYDA AIARAREWIE
GMQSDDGGWG AFDINNTEYY LNNIPFSDHG AMLDPPTEDV TARCVSMLAQ LGETMDSSPA
LARAVGYLRD TQLAEGSWYG RWGMNYIYGT WSVLCALNAA GVPHADPMIR KAVAWLESVQ
NRDGGWGEDA VSYRLDYRGY ESAPSTASQT AWALLALMAA GEVDHPAVAR GIEYLKSTQT
EKGLWDEQRY TATGFPRVFY LRYHGYSKFF PLWALARYRN LQATNSKVVG VGM