Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0062 |
Symbol | |
ID | 3834104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 69560 |
End bp | 71530 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637824132 |
Product | terpene synthase, squalene cyclase |
Protein accession | YP_425154 |
Protein GI | 83591402 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGCGA CCGCACCCCT TCGGGATCCC GGCGCGCCGA GCGCGGAAAA CTGTTCCGTC GATCGGCGGG AACTTGACGA TGTCATCGGC GAATCCTGCC GCTGGCTTGG CGAGCGCCAG AATCAGGACG GCCATTGGGT CTTCGAGCTG GAGGCCGATG CAACCATCCC TGCGGAATAT ATTCTGCTCA ACCATTTCCT TGACGAGATC GACGATGCGC GCGAAGCCCG CATCGCCAGC TATCTGCGCG CCATCCAGGG CAAGCATGGC GGCTGGCCGT TGTTCCACGA CGGCGATTTC GACATGTCGG CCACCGTCAA AGCCTATTAC GCCCTGAAGC TGACGGGCGA TGGCGTTGAC GAGCCCCATA TGGTGCGCGC CCGTCAGGCG ATTCTCGAGC ACGGCGGCGC CGAGCGGACC AACGTCTTCA CTCGCTTCAC CCTGGCGATG TTCGATCAGG TTCCCTGGCG CGCCTGCCCG GTGACGCCGG TCGAAGCGCT TCTGTTGCCG CGATTCGCCC CCTTCCACTG GAGCAAGGTC AGCTACTGGT CGCGCACGGT GATGACGCCG CTGATGATCC TCTATTCGCG CCGCGCCCGC GCCGTCAATC CGCGCGGCAT CGGCGTGCGC GAGCTGTTCC GCCGCGACCC CGAGGTCATC CGCGACTGGC TGAAGAACCC CACCGGCCAT TGGATCGGCG ATGCGCTGAT CCAGATCGAC AAGGTGCTGC GGGTCATCGA GCCGGCGATC CATTGGGCCT TCCGCGATCG CGCCGAGAAA TGGGCGCTCG ATTTCATCGA AGAGCGGCTG AACGGCCGCG ATGGCCTGGG CGGCATCTAT CCGGCCATCG CCAATACCCT GATGGCCTAT CACACCCTGG GCTACGCCAA GGACCATCCG GGCTATCGCA TCGCCCGCGA GGCGGTGGAC GGCCTTTGCA CCCCCCATGC CAAGGGCGAA TACGTTCAGC CCTGCCTGTC GCCGGTCTGG GACACCTGCC TTGCCAGCCA CGCCATCCAG GAAGCCGGGC AGAGCGCCGG CGACCGGGCG GTGGACCAGT CCAACGCCTG GCTGCGCGAG CGTCAGGTGC TTGACGTGGT CGGCGACTGG AAAAGCAACC GCGGCCATCT GCGCCCGGGC GGCTGGGCCT TCCAGTACAA CAACCCCCAT TACCCCGATG TCGACGATAC GGCCGTGGTG GTGATGGCCT TGGCGCGTTC GAAGGAAGAC GAGGCCAACC GCGAGGCCAT CGCCCGGGCC GAGGAATGGA TCATCGGCAT GCAGTCGTCC AACGGCGGCT GGGGCGCCTT CGACGCCGAG AACGAACACG ACTTCCTCAA CCATGTTCCC TTCGCCGATC ACGGCGCCCT GCTTGATCCG CCGACCGTCG ACGTCTCGGC CCGCTGCCTG GGCATGCTGG CCCAGCTTGG CCGGCCCAAG ACCGATCCGG TGGTGGCACG CGGTCTGGAT TATTTGTGGC GCGAACAGGA GGCCGACGGC TCGTGGTTCG GCCGTTGGGG CACCAATTAC ATCTATGGAA CGTGGTCGGC CCTCAACGCC TTCAACGCCG TCGAGTGGGA CATGACCGAC CCGCGCATCT GTAAGGCGGT GGATTGGCTG AAAAGCCGCC AGCGCGACGA CGGCGGCTGG GGCGAGGATT GCGCCACCTA TTGGAAGGAG CGGCGCTCGG TCAGCAAGGC CAGCACGCCC AGTCAAACCG CCTGGGCGGT GCTTGGCCTG ATGGCGGCCG GCGAAGTCGA CAGCCCCGAG GTCGAACGCG GCATCCGCTA TCTGCTCGAG GCGCCGCGCG ACGGCGGCAA ATGGGAAGAA GAGCTTTATA ACGCGGTGGG CTTCCCCCGG ATCTTCTACC TGCGCTATCA TGGCTACTCG GCTTACTTCC CGCTCTGGGC CCTGGCGCGC TATCGCAACC TGACCAGCGG CAACTGCAAG CGGACCATCC ACGGCATGTG A
|
Protein sequence | MDATAPLRDP GAPSAENCSV DRRELDDVIG ESCRWLGERQ NQDGHWVFEL EADATIPAEY ILLNHFLDEI DDAREARIAS YLRAIQGKHG GWPLFHDGDF DMSATVKAYY ALKLTGDGVD EPHMVRARQA ILEHGGAERT NVFTRFTLAM FDQVPWRACP VTPVEALLLP RFAPFHWSKV SYWSRTVMTP LMILYSRRAR AVNPRGIGVR ELFRRDPEVI RDWLKNPTGH WIGDALIQID KVLRVIEPAI HWAFRDRAEK WALDFIEERL NGRDGLGGIY PAIANTLMAY HTLGYAKDHP GYRIAREAVD GLCTPHAKGE YVQPCLSPVW DTCLASHAIQ EAGQSAGDRA VDQSNAWLRE RQVLDVVGDW KSNRGHLRPG GWAFQYNNPH YPDVDDTAVV VMALARSKED EANREAIARA EEWIIGMQSS NGGWGAFDAE NEHDFLNHVP FADHGALLDP PTVDVSARCL GMLAQLGRPK TDPVVARGLD YLWREQEADG SWFGRWGTNY IYGTWSALNA FNAVEWDMTD PRICKAVDWL KSRQRDDGGW GEDCATYWKE RRSVSKASTP SQTAWAVLGL MAAGEVDSPE VERGIRYLLE APRDGGKWEE ELYNAVGFPR IFYLRYHGYS AYFPLWALAR YRNLTSGNCK RTIHGM
|
| |