Gene Rru_A0062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A0062 
Symbol 
ID3834104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp69560 
End bp71530 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content66% 
IMG OID637824132 
Productterpene synthase, squalene cyclase 
Protein accessionYP_425154 
Protein GI83591402 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCGA CCGCACCCCT TCGGGATCCC GGCGCGCCGA GCGCGGAAAA CTGTTCCGTC 
GATCGGCGGG AACTTGACGA TGTCATCGGC GAATCCTGCC GCTGGCTTGG CGAGCGCCAG
AATCAGGACG GCCATTGGGT CTTCGAGCTG GAGGCCGATG CAACCATCCC TGCGGAATAT
ATTCTGCTCA ACCATTTCCT TGACGAGATC GACGATGCGC GCGAAGCCCG CATCGCCAGC
TATCTGCGCG CCATCCAGGG CAAGCATGGC GGCTGGCCGT TGTTCCACGA CGGCGATTTC
GACATGTCGG CCACCGTCAA AGCCTATTAC GCCCTGAAGC TGACGGGCGA TGGCGTTGAC
GAGCCCCATA TGGTGCGCGC CCGTCAGGCG ATTCTCGAGC ACGGCGGCGC CGAGCGGACC
AACGTCTTCA CTCGCTTCAC CCTGGCGATG TTCGATCAGG TTCCCTGGCG CGCCTGCCCG
GTGACGCCGG TCGAAGCGCT TCTGTTGCCG CGATTCGCCC CCTTCCACTG GAGCAAGGTC
AGCTACTGGT CGCGCACGGT GATGACGCCG CTGATGATCC TCTATTCGCG CCGCGCCCGC
GCCGTCAATC CGCGCGGCAT CGGCGTGCGC GAGCTGTTCC GCCGCGACCC CGAGGTCATC
CGCGACTGGC TGAAGAACCC CACCGGCCAT TGGATCGGCG ATGCGCTGAT CCAGATCGAC
AAGGTGCTGC GGGTCATCGA GCCGGCGATC CATTGGGCCT TCCGCGATCG CGCCGAGAAA
TGGGCGCTCG ATTTCATCGA AGAGCGGCTG AACGGCCGCG ATGGCCTGGG CGGCATCTAT
CCGGCCATCG CCAATACCCT GATGGCCTAT CACACCCTGG GCTACGCCAA GGACCATCCG
GGCTATCGCA TCGCCCGCGA GGCGGTGGAC GGCCTTTGCA CCCCCCATGC CAAGGGCGAA
TACGTTCAGC CCTGCCTGTC GCCGGTCTGG GACACCTGCC TTGCCAGCCA CGCCATCCAG
GAAGCCGGGC AGAGCGCCGG CGACCGGGCG GTGGACCAGT CCAACGCCTG GCTGCGCGAG
CGTCAGGTGC TTGACGTGGT CGGCGACTGG AAAAGCAACC GCGGCCATCT GCGCCCGGGC
GGCTGGGCCT TCCAGTACAA CAACCCCCAT TACCCCGATG TCGACGATAC GGCCGTGGTG
GTGATGGCCT TGGCGCGTTC GAAGGAAGAC GAGGCCAACC GCGAGGCCAT CGCCCGGGCC
GAGGAATGGA TCATCGGCAT GCAGTCGTCC AACGGCGGCT GGGGCGCCTT CGACGCCGAG
AACGAACACG ACTTCCTCAA CCATGTTCCC TTCGCCGATC ACGGCGCCCT GCTTGATCCG
CCGACCGTCG ACGTCTCGGC CCGCTGCCTG GGCATGCTGG CCCAGCTTGG CCGGCCCAAG
ACCGATCCGG TGGTGGCACG CGGTCTGGAT TATTTGTGGC GCGAACAGGA GGCCGACGGC
TCGTGGTTCG GCCGTTGGGG CACCAATTAC ATCTATGGAA CGTGGTCGGC CCTCAACGCC
TTCAACGCCG TCGAGTGGGA CATGACCGAC CCGCGCATCT GTAAGGCGGT GGATTGGCTG
AAAAGCCGCC AGCGCGACGA CGGCGGCTGG GGCGAGGATT GCGCCACCTA TTGGAAGGAG
CGGCGCTCGG TCAGCAAGGC CAGCACGCCC AGTCAAACCG CCTGGGCGGT GCTTGGCCTG
ATGGCGGCCG GCGAAGTCGA CAGCCCCGAG GTCGAACGCG GCATCCGCTA TCTGCTCGAG
GCGCCGCGCG ACGGCGGCAA ATGGGAAGAA GAGCTTTATA ACGCGGTGGG CTTCCCCCGG
ATCTTCTACC TGCGCTATCA TGGCTACTCG GCTTACTTCC CGCTCTGGGC CCTGGCGCGC
TATCGCAACC TGACCAGCGG CAACTGCAAG CGGACCATCC ACGGCATGTG A
 
Protein sequence
MDATAPLRDP GAPSAENCSV DRRELDDVIG ESCRWLGERQ NQDGHWVFEL EADATIPAEY 
ILLNHFLDEI DDAREARIAS YLRAIQGKHG GWPLFHDGDF DMSATVKAYY ALKLTGDGVD
EPHMVRARQA ILEHGGAERT NVFTRFTLAM FDQVPWRACP VTPVEALLLP RFAPFHWSKV
SYWSRTVMTP LMILYSRRAR AVNPRGIGVR ELFRRDPEVI RDWLKNPTGH WIGDALIQID
KVLRVIEPAI HWAFRDRAEK WALDFIEERL NGRDGLGGIY PAIANTLMAY HTLGYAKDHP
GYRIAREAVD GLCTPHAKGE YVQPCLSPVW DTCLASHAIQ EAGQSAGDRA VDQSNAWLRE
RQVLDVVGDW KSNRGHLRPG GWAFQYNNPH YPDVDDTAVV VMALARSKED EANREAIARA
EEWIIGMQSS NGGWGAFDAE NEHDFLNHVP FADHGALLDP PTVDVSARCL GMLAQLGRPK
TDPVVARGLD YLWREQEADG SWFGRWGTNY IYGTWSALNA FNAVEWDMTD PRICKAVDWL
KSRQRDDGGW GEDCATYWKE RRSVSKASTP SQTAWAVLGL MAAGEVDSPE VERGIRYLLE
APRDGGKWEE ELYNAVGFPR IFYLRYHGYS AYFPLWALAR YRNLTSGNCK RTIHGM