Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4261 |
Symbol | |
ID | 6411945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4581257 |
End bp | 4583218 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714143 |
Product | squalene-hopene cyclase |
Protein accession | YP_001993232 |
Protein GI | 192292627 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.942083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTCCG GCAGCTACAC GACTGGTGTG GAGCGCAACG CGCTCGAAGC TTCGATCGAT GCGGCGCGCA GCGCGCTGCT GAATTATCGT CGCGACGATG GCCATTGGGT GTTCGAACTC GAGGCCGATT GCACCATTCC TGCCGAATAC GTGCTGCTGC GGCATTACCT CGGCGAGCCG GTCGATGCCG AGCTCGAAGC CAAGATCGCG GTTTATCTGC GCCGCATCCA GGGTGCCCAT GGCGGCTGGC CGCTGGTGCA CGACGGCGAC TTCGACATGA GCGCCAGCGT GAAGGGTTAC TTCGCGCTGA AGATGATCGG CGACAGCATC GATGCCCCGC ATATGGTGCG GGCGCGCGAG GCGATCCGTT CGCGCGGCGG CGCGATCCAC TCCAACGTCT TCACCCGGTT TCTGCTCACG TTGTACGGCG TTACGACCTG GCGCGCGGTT CCGGTACTGC CGGTCGAGAT CATGCTGCTG CCGAGCTGGT CGCCGTTCAC ACTGACCAAG ATCTCGTATT GGGCGCGTAC CACGATGGTG CCGCTGCTCG TGCTGTGCGC GCTGAAGCCG CAGGCCAAGA ATCCGAAGGG CGTCGGCATC GACGAACTAT TCCTTCAGGA CCCGAAGACG ATCGGGATGC CGGTCAAGGC GCCGCATCAG AACTGGGCGC TGTTCAAGCT GTTCGGATCG ATCGACGCGG TGCTGCGCGT GATCGAGCCT GTGATGCCCA AAGGCATCCG CAAGCGCGCG ATCGACAAGG CGCTCGCCTT CATCGAGGAG CGGCTCAACG GCGAGGACGG CATGGGCGCG ATCTTCCCGC CGATGGCCAA CGCCGTGATG ATGTACGAGG CGCTCGGCTA TCCCGAGGAC TATCCGCCGC GCGCCAGCCA GCGCCGCGGC ATTGATCTCT TGCTGGTCGA TCGCGGCGAC GAAGCCTACT GCCAGCCCTG CGTGTCGCCG GTGTGGGACA CCGCGCTCGC CAGCCATGCG GTGCTCGAGG CGGACGGTCA CGAGGGCGCC AAGTCGGTGC GGCCGGCGCT CGACTGGCTG CTCCCGCGCC AGGTGCTCGA CGTCAAGGGC GACTGGGCCG TCAAGGCCCC GAACGTCCGC CCCGGCGGCT GGGCGTTCCA GTACAACAAC GCCCACTATC CGGATCTCGA CGATACCGCG GTGGTGGTGA TGGCGCTCGA CCGCGCCCGC AAGGACCAGC CGAATCCCGC CTACGATGCC GCGATTGCCC GCGCCCGCGA GTGGATCGAG GGGATGCAGA GCGACGATGG CGGCTGGGGT GCCTTCGACA TCAACAACAC TGAGTATTAT TTGAACAACA TCCCGTTCTC GGACCATGGC GCGATGCTCG ATCCGCCGAC CGAGGACGTC ACCGCGCGCT GCGTCTCGAT GCTGGCTCAG CTCGGTGAGA CCATGGACAG CAGCCCGGCG CTGGCCCGCG CCGTCGGCTA TCTGCGCGAC ACCCAGCTCG CCGAGGGCTC CTGGTACGGC CGCTGGGGCA TGAATTACAT CTACGGCACC TGGTCGGTGC TGTGCGCCCT CAACGCCGCC GGCGTTCCCC ATGCCGATCC GATGATCCGC AAGGCGGTCG CCTGGCTGGA GTCGGTGCAG AATCGCGACG GCGGCTGGGG CGAGGACGCG GTCAGCTACC GACTGGATTA CCGCGGCTAC GAAAGTGCAC CTTCGACCGC CTCTCAGACG GCATGGGCTT TGCTTGCTCT GATGGCTGCG GGTGAGGTCG ATCATCCCGC CGTGGCACGG GGCATCGAGT ACCTGAAAAG CACACAGACC GAAAAAGGAC TGTGGGACGA GCAGCGTTAC ACGGCGACGG GCTTCCCGCG GGTGTTTTAT CTGCGGTATC ATGGCTATTC GAAGTTCTTC CCACTCTGGG CGCTCGCCCG GTATCGGAAC TTGCAGGCCA CGAACAGCAA GGTGGTAGGG GTCGGAATGT GA
|
Protein sequence | MDSGSYTTGV ERNALEASID AARSALLNYR RDDGHWVFEL EADCTIPAEY VLLRHYLGEP VDAELEAKIA VYLRRIQGAH GGWPLVHDGD FDMSASVKGY FALKMIGDSI DAPHMVRARE AIRSRGGAIH SNVFTRFLLT LYGVTTWRAV PVLPVEIMLL PSWSPFTLTK ISYWARTTMV PLLVLCALKP QAKNPKGVGI DELFLQDPKT IGMPVKAPHQ NWALFKLFGS IDAVLRVIEP VMPKGIRKRA IDKALAFIEE RLNGEDGMGA IFPPMANAVM MYEALGYPED YPPRASQRRG IDLLLVDRGD EAYCQPCVSP VWDTALASHA VLEADGHEGA KSVRPALDWL LPRQVLDVKG DWAVKAPNVR PGGWAFQYNN AHYPDLDDTA VVVMALDRAR KDQPNPAYDA AIARAREWIE GMQSDDGGWG AFDINNTEYY LNNIPFSDHG AMLDPPTEDV TARCVSMLAQ LGETMDSSPA LARAVGYLRD TQLAEGSWYG RWGMNYIYGT WSVLCALNAA GVPHADPMIR KAVAWLESVQ NRDGGWGEDA VSYRLDYRGY ESAPSTASQT AWALLALMAA GEVDHPAVAR GIEYLKSTQT EKGLWDEQRY TATGFPRVFY LRYHGYSKFF PLWALARYRN LQATNSKVVG VGM
|
| |