Gene Swit_3200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3200 
Symbol 
ID5198204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp3516709 
End bp3517851 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content71% 
IMG OID640582746 
Productlycopene cyclase 
Protein accessionYP_001263685 
Protein GI148556103 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01789] lycopene cyclase
[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.696424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.730117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG TCATGTGCGA TCTCGCCATC ATCGGCGGCG GCCTTTCGGG CGGACTGATC 
GCATTGGCCG CGCGCCGGGC GCGGCCCGAT CGCCGCGTCC TGCTGATCGA GGCGGCGCCG
ACGCTGGGCG GCGCGCACCT CTGGTCCTTC CTCGACAGCG ACGTGGAAGC GGAGGACCGG
CCGCTGCTCG AACCGCTGGT CAACTATGGC TGGCGCGAGT TCGCGGTGGT CTTCCCGCTC
TACAAGCGGG CGCTGCCCTT CCCCTTCTAC AGCATCCGTT CCGACCGGTT CGACGCGGCG
CTGCGCGAGG CGATGCCGGC CGAAAGCATC CTGACCGGGC GGCGCGCGCT GGGCGTCGCC
CCCGACGGGG TGGTGCTGGA GGACGGGACG CTGATCGGGG CGGGCGGCGT GATCGACGCG
CGCGGGCCGG GCGACCTGTC GCTGCTAGAC CTCCATTACC GCAAGTTCGT CGGCTATGAG
CTGACCCTCG ACGGCCCGCA CAGCGTGCGC CGCGCGACGC TGATCGACGC CGCCGCCGAT
CCGGCCGAGG GCTTCCAATA TATGGAGATG CTGCCGATCG AGAACGACCG GCTGTTCATC
GAGGACGTCC GCTACGAAAG CTGGCCGGAG ATGGACATGG CCGACCATGG CAACCGCATC
GTCAACCATG CGGCGCGCTT CGGCTGGCGC ATCCGCAGCT CGGCGCGCGG CCAGGCGGGC
GTCCTGCCGA TCGCGCTGGG CGGCGACTTC GCCGACTATT GGCGATCGGG CGGCGAGGGC
GTCGCCAAGG CGGGCCTGCG CGCCGGGCTG TTCCACCCGG TCACCGGCAA TTCGCTGGCC
GATGCCGCGC GGGTCGCGCG GCTGGTCGCC GAAGCCGACG ACTGGTCGGG CGCGGCGCTC
CACGCCATGC TCCACGACCA TGCCGCGCGC GCCTGGGCGA AGCGGGCCTA TTATCGCCGC
TTCGCCACGC GGATGCTGCG CGATACTCCG CCGGCGGAAG GCTACAGGAT GCTCGAAAGC
CTCTATGCGA TGGACGCCGA CCTGATCGCG CGCTTCCACG CGATGCGCCT CGGCTTTGCC
GACCGCATGG CGCTCAGCCT CGGCGAGGGG CCGATGCCGG TCGGGGCGAT GTTCCGGCGA
TGA
 
Protein sequence
MSGVMCDLAI IGGGLSGGLI ALAARRARPD RRVLLIEAAP TLGGAHLWSF LDSDVEAEDR 
PLLEPLVNYG WREFAVVFPL YKRALPFPFY SIRSDRFDAA LREAMPAESI LTGRRALGVA
PDGVVLEDGT LIGAGGVIDA RGPGDLSLLD LHYRKFVGYE LTLDGPHSVR RATLIDAAAD
PAEGFQYMEM LPIENDRLFI EDVRYESWPE MDMADHGNRI VNHAARFGWR IRSSARGQAG
VLPIALGGDF ADYWRSGGEG VAKAGLRAGL FHPVTGNSLA DAARVARLVA EADDWSGAAL
HAMLHDHAAR AWAKRAYYRR FATRMLRDTP PAEGYRMLES LYAMDADLIA RFHAMRLGFA
DRMALSLGEG PMPVGAMFRR