Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1726 |
Symbol | |
ID | 3908251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1967570 |
End bp | 1969534 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637883620 |
Product | squalene cyclase |
Protein accession | YP_485345 |
Protein GI | 86748849 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00651785 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.158685 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCCG GTACGACCAT TCTGGGCGCA GAGCGCGGCA GGACGCTCGA CGCCTCGATC GATGCGGCGC GTGCCGCGCT GCTCGGTTAT CGCCGCGACG ACGGCCATTG GGTGTTCGAG CTTGAGGCCG ATTGTACCAT TCCGGCCGAA TACGTGCTGC TGCGGCACTA TCTCGGCGAG CCGGTCGATG CCGCGCTGGA GGCCAAGATC GCGGTCTACC TGCGCCGGAC CCAGGGCGCG CATGGCGGCT GGCCGCTGGT GCACGACGGC GAGTTCGATG TCAGCGCGAC GGTAAAGGCC TACTTCGCCC TCAAGATGAT CGGCGACAGC ATCGACGCGC CGCATATGGC CAAGGCCCGC GAGGCGATCC TGGCGCGCGG CGGCGCGATC CACGTCAACG TGTTCACCCG CTTCCTTCTG TCGATGTTCG GCATTCTGAC CTGGCGCAGC GTGCCGGTGC TGCCGGTCGA GATCATGCTG CTGCCGATGT GGGCGCCGTT CCACCTCAAC AAGATCTCCT ATTGGGCGCG CACCACGATC GTGCCGCTGA TGGTGCTGGC GGCGCTGAAG CCGCGCGCGG TCAACAAGCT CGACATCGGC CTCGATGAAT TGTTCCTGCA GGACCCGCAA TCGATCGGCA TGCCGGCCAA GGCGCCGCAT CAGAGCTGGG GCCTGTTCAC GCTGTTCGGT TCGATCGACG CGGTGCTGCG GGTGATCGAG CCGCTGATCC CGAAAAAGCT GCGCAGTTAT GCGATCGGCC GCGCGGTCGC CTTCATCGAG GAGCGGCTGA ACGGCGAGGA CGGGCTCGGT GCGATCTATC CGCCGATGGC CAACACGGTG ATGATGTACA AGGTGCTGGG CTATGGTGAG GACCATCCGC CGCGCGCCAT CACCCGACGC GGCATCGACC TGCTGCTGGT GGTCGGCGAG GAGGAGGCCT ACTGCCAGCC CTGCGTCTCG CCGATCTGGG ACACCTCGCT GACCTGCCAC GCGCTGCTGG AGGCGGGCGG CGCCGAGGCC GCGCTGCCGG TGCGCAAGGG GCTGGACTGG CTGATTCCGA AGCAGGTGCT CGACCTCAAG GGCGACTGGG CGGTGAAGGC GCCCAACGTC CGCCCCGGGG GCTGGGCGTT CCAGTACAAC AATGCCCATT ACCCCGATCT GGACGACACC GCCGTGGTGG TTATGGCGCT CGACCGCGCC CGCCGCGATC AGCCGAGTGC GGCCTACGAC AATGCGATCG CGCGCGGGCG CGAGTGGATC GAGGGGATGC AGAGCGACGA TGGCGGCTGG GCTGCCTTCG ACGTGAACAA CACCGAGTAT TATTTGAACA ACATCCCGTT CTCGGATCAC GGCGCGCTGC TCGACCCGCC GACCGAGGAC GTCACTGCGC GCTGCGTCTC GATGCTGGCG CAGCTCGGCG AGACCGCGGA GACCAGCTCG GCGCTGGCCC GCGGCGTCGC CTATCTGCGC AAGACCCAAC TCGCCGAAGG CTCGTGGTAC GGCCGCTGGG GCCTGAATTA CATCTACGGA ACCTGGTCGG TGCTGTGCGC GCTGAATGCG GCCGGGGTCG CCCATCAGGA TCCGGCGATG CGCAAGGCGG TGGCCTGGCT GGCATCGATC CAGAATGCCG ATGGCGGCTG GGGCGAGGAT GCGGTCAGCT ATCGCCTGGA CTATCGGGGC TACGAAAGTG CACCGTCCAC GGCGTCCCAA ACGGCATGGG CCTTGCTTGC CTTGATGGCT GCCGGGGAAG TCGATCATCC GGCGGTGGCG CGCGGGGTTG AGTACCTAAA AGGCACACAG ACCGAAAAAG GCGTGTGGGA CGAGCAGCGC TACACCGCTA CAGGCTTTCC GCGGGTGTTT TATCTGCGGT ACCATGGCTA TTCAAAGTTC TTTCCGCTCT GGGCGCTGGC GCGGTATCGA AATTTGAGAG CCACGAACAG CAAGGTCGTA GGGGTCGGAA TGTGA
|
Protein sequence | MTSGTTILGA ERGRTLDASI DAARAALLGY RRDDGHWVFE LEADCTIPAE YVLLRHYLGE PVDAALEAKI AVYLRRTQGA HGGWPLVHDG EFDVSATVKA YFALKMIGDS IDAPHMAKAR EAILARGGAI HVNVFTRFLL SMFGILTWRS VPVLPVEIML LPMWAPFHLN KISYWARTTI VPLMVLAALK PRAVNKLDIG LDELFLQDPQ SIGMPAKAPH QSWGLFTLFG SIDAVLRVIE PLIPKKLRSY AIGRAVAFIE ERLNGEDGLG AIYPPMANTV MMYKVLGYGE DHPPRAITRR GIDLLLVVGE EEAYCQPCVS PIWDTSLTCH ALLEAGGAEA ALPVRKGLDW LIPKQVLDLK GDWAVKAPNV RPGGWAFQYN NAHYPDLDDT AVVVMALDRA RRDQPSAAYD NAIARGREWI EGMQSDDGGW AAFDVNNTEY YLNNIPFSDH GALLDPPTED VTARCVSMLA QLGETAETSS ALARGVAYLR KTQLAEGSWY GRWGLNYIYG TWSVLCALNA AGVAHQDPAM RKAVAWLASI QNADGGWGED AVSYRLDYRG YESAPSTASQ TAWALLALMA AGEVDHPAVA RGVEYLKGTQ TEKGVWDEQR YTATGFPRVF YLRYHGYSKF FPLWALARYR NLRATNSKVV GVGM
|
| |