Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3573 |
Symbol | |
ID | 4024087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3977754 |
End bp | 3979718 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637963777 |
Product | squalene cyclase |
Protein accession | YP_570697 |
Protein GI | 91978038 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.191218 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTCCG GAACTTTCAA TCCGGGTGGA GAGCGCGGCA ACACGCTCGA CGCCTCGATC GACGCGGCGC GCGCCGCGCT GCTGGGTTAT CGTCGTGACG ACGGCCATTG GGTGTTCGAA CTCGAGGCCG ACTGCACCAT TCCGGCCGAG TACGTGCTGC TCCGGCACTA TCTTGGTGAA CCGATCGACG CCGCGCTGGA AGCCAAGATC GCCGTTTATC TGCGCCGGAC CCAGGGCGCA CATGGCGGCT GGCCGCTGGT GTATGACGGC GAATTCGACA TGAGCGCCAC CGTGAAGGGC TATTTCGCGC TCAAGATGAT CGGCGACAGC ATCGACGCGC CGCATATGGC CAAGGCGCGC GAGGCGATCC TGTCGCGCGG CGGCGCGGTC CACGCCAACG TGTTCACGCG ATTCCTGCTG GCGATGTTCG GCATCCTGAC CTGGCGCGCC GTTCCGGTGC TGCCGGTCGA GATCATGCTG CTGCCGATGT GGTCGCCGTT CCATCTCAAC AAGATCTCGT ATTGGGCGCG CACCACGATC GTGCCGCTGA TGGTGCTGGC GGCGCTGAAG CCGCGCGCGG TCAACCGGCT CGGCGTCGGG CTCGACGAGC TGTTCCTGCA GGACCCGAAA TCGATCGGGA TGCCCGCCAG GGCGCCGCAT CAGAATCGCG GCCTGTTCGC GCTGTTCGGT GCGATCGACG CGGTGCTGCG GGTGATCGAA CCACTGATCC CGAAGAAGCT GCGGAAACAC GCGATCGACC GCGCCGTCGC CTTCGTCGAG GAGCGGCTGA ACGGCGAGGA CGGTCTCGGC GCGATCTATC CGCCGATGGC CAACACCGTG ATGATGTACA AGGTGCTCGG CTATCCCGAG GACCATCCGC CGCGGGCGAT CACCCGGCGC GGCATCGATC TGCTGCTGGT GATCGGTGAG GAGGAGGCCT ATTGCCAGCC CTGCGTCTCG CCGATCTGGG ACACTTCGCT GACCTGCCAC GCGCTGATCG AGGCGGGCGG CGCCGAGGCC GCGCAGCCGG TGCGCGAGGG CTTGGACTGG CTGCTGCCGA AGCAGGTGCT CGACCTCAAG GGCGACTGGG CGGTGAAGGC CCCCAATGTC CGCCCCGGCG GCTGGGCGTT CCAGTACAAC AACGCCCATT ATCCCGATCT CGACGACACC GCGGTGGTGG TGATGGCGCT CGACCGCGCC CGCCGCGATC AGCCGAGCGC GGCCTACGAC AATGCCATCG CCCGCGGCCG CGAATGGATC GAGGGGATGC AGAGCGACGA CGGCGGCTGG GCTGCCTTCG ATGTGAACAA CACCGAATAT TATTTGAACA ACATCCCGTT CTCGGACCAC GGCGCGATGC TCGATCCGCC GACCGAGGAC GTAACCGCGC GTTGCGTTTC GATGTTGGCG CAACTCGGCG AGACCGAGCA GACCAGCAAG GCGGTGGCGC GGGGCGTTGC CTATCTGCGC AAGACCCAGC TTCCGGATGG CTCGTGGTAC GGCCGATGGG GCATGAACTA CATCTATGGC ACCTGGGCGG TGCTGTGCGC GCTGAACGCC GCCGGCGTCG ATCATCAGGA CCCGGCGATC CGCAAGGCGG TCGCCTGGCT GGCGTCGATT CAGAACGCCG ATGGCGGCTG GGGCGAGGAC GGGGTCAGCT ACCGGTTGGA CTACCGGGGC TACGAAACTG CGCCGTCCAC GGCGTCGCAA ACGGCATGGG CCTTGCTTTC AATCATGGCT GCAGGGGAAG TCGATCATCC GGCGGTGGCG CGCGGGATTG AGTACCTAAA AGGCACACAG ACCGAAAAAG GACTGTGGGA CGAGCAGCGC CACACCGCTA CAGGCTTTCC GCGCGTGTTT TATCTGCGGT ATCATGGCTA CTCAAAGTTC TTTCCGCTCT GGGCGCTGGC GCGGTATCGA AATTTGAGAG CCACGAACAG CAAGGTCGTA GGGGTCGGAA TGTGA
|
Protein sequence | MDSGTFNPGG ERGNTLDASI DAARAALLGY RRDDGHWVFE LEADCTIPAE YVLLRHYLGE PIDAALEAKI AVYLRRTQGA HGGWPLVYDG EFDMSATVKG YFALKMIGDS IDAPHMAKAR EAILSRGGAV HANVFTRFLL AMFGILTWRA VPVLPVEIML LPMWSPFHLN KISYWARTTI VPLMVLAALK PRAVNRLGVG LDELFLQDPK SIGMPARAPH QNRGLFALFG AIDAVLRVIE PLIPKKLRKH AIDRAVAFVE ERLNGEDGLG AIYPPMANTV MMYKVLGYPE DHPPRAITRR GIDLLLVIGE EEAYCQPCVS PIWDTSLTCH ALIEAGGAEA AQPVREGLDW LLPKQVLDLK GDWAVKAPNV RPGGWAFQYN NAHYPDLDDT AVVVMALDRA RRDQPSAAYD NAIARGREWI EGMQSDDGGW AAFDVNNTEY YLNNIPFSDH GAMLDPPTED VTARCVSMLA QLGETEQTSK AVARGVAYLR KTQLPDGSWY GRWGMNYIYG TWAVLCALNA AGVDHQDPAI RKAVAWLASI QNADGGWGED GVSYRLDYRG YETAPSTASQ TAWALLSIMA AGEVDHPAVA RGIEYLKGTQ TEKGLWDEQR HTATGFPRVF YLRYHGYSKF FPLWALARYR NLRATNSKVV GVGM
|
| |