Gene Sala_3135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3135 
Symbol 
ID4082391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3286985 
End bp3288142 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content68% 
IMG OID638011520 
Productlycopene cyclase 
Protein accessionYP_618171 
Protein GI103488610 
COG category 
COG ID 
TIGRFAM ID[TIGR01789] lycopene cyclase
[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGGGC CGAACATCGA TAAATGCGAC ATCGCGATCG TCGGCGGCGG GCTTGCGGGC 
GGACTTGCGG CGCTGGCGCT GGCGGCCAAG CGGCCCGATC TCGACATACG ACTGATCGAG
CCGGGGCCGG TCGGGGGCAA TCATATCTGG TCCTTTTTCG ACAGCGACAT CGCAAAGAAG
GACCGCTGGC TCGTCGCGCC GCTCGTCCGC CACCACTGGC CGCGCTATGA CGTGCGCTTT
CCCGCGCACG CGCGGACGCT GCACATGGGA TATAAAAGCA TCACCGGCGA AGCGCTGGCC
GAAGCGGTGG CGGCGGCGCT GCCGGACGGC CATATCATCG CCGATCGCGC GAAACATGTC
GCGCCCGACC ATCTGCTGCT GGCGCGCGGC GGGCGATTGT CGGCGAAGCA TGTCCTCGAC
GCGCGCGGCG CGGGCAAGTT TCCGACACTG GACTGCGGCT GGCAGAAGTT TGTCGGGCAG
GCGCTCACCG TGAAGGGCGG GCACGGCGTC GAACAGCCGG TGGTGATGGA CGCGACGGTG
GAGCAATTGG ACGGCTATCG CTTTGTCTAT CTCCTCCCCT TCGACGCCGA AACCCTGTTC
GTCGAGGACA CCTATTACAG CGACGACGCC GACCTCGACG AAATGGTGGT GCGCGAACGC
ATTGCCGCCT ATGCCGCGGC GCAGGGCTGG CAGGTGACGG CGACGATGCG CGAGGAGAGC
GGCGTGTTAC CAGTGGTGAT CGCGGGCGAT TTCGACCGGT TGTGGCCGGA ATCGGACCGC
ACGTCACGAA TCGGCGTGCG CGCAGGGATG TTCCACGCGA CGACGGGTTA TTCGCTGCCG
CACGCCGTAC GCACCGCGGC GGCGCTGCCC GCGCTGGTCG GTCGCGCCGA CCTGCCCGCG
CTGCTGCGCG CGCGCGCGCA GTCGGCGTGG CGGCGCCAGC GCTTTTACCG GATGCTGGAC
GCCATGCTGT TCCGCGCCGC CGATCCCGAT AGGCGTTACC GCATTTTCGA GCGATTCTAT
CGCCTGTCGC CGCGGCTCGT CGCGCGCTTC TATGCCGGGC GGTCGACCGC GGCGGACCGG
CTGCGCCTGC TTGCGGGAAA GCCGCCGGTG CCGGTCGGCC GCGCGCTGTC GGCGCTTGCA
AAACTGGATT GGAAATGA
 
Protein sequence
MVGPNIDKCD IAIVGGGLAG GLAALALAAK RPDLDIRLIE PGPVGGNHIW SFFDSDIAKK 
DRWLVAPLVR HHWPRYDVRF PAHARTLHMG YKSITGEALA EAVAAALPDG HIIADRAKHV
APDHLLLARG GRLSAKHVLD ARGAGKFPTL DCGWQKFVGQ ALTVKGGHGV EQPVVMDATV
EQLDGYRFVY LLPFDAETLF VEDTYYSDDA DLDEMVVRER IAAYAAAQGW QVTATMREES
GVLPVVIAGD FDRLWPESDR TSRIGVRAGM FHATTGYSLP HAVRTAAALP ALVGRADLPA
LLRARAQSAW RRQRFYRMLD AMLFRAADPD RRYRIFERFY RLSPRLVARF YAGRSTAADR
LRLLAGKPPV PVGRALSALA KLDWK