Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3200 |
Symbol | |
ID | 5198204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 3516709 |
End bp | 3517851 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640582746 |
Product | lycopene cyclase |
Protein accession | YP_001263685 |
Protein GI | 148556103 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | [TIGR01789] lycopene cyclase [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.696424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.730117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCG TCATGTGCGA TCTCGCCATC ATCGGCGGCG GCCTTTCGGG CGGACTGATC GCATTGGCCG CGCGCCGGGC GCGGCCCGAT CGCCGCGTCC TGCTGATCGA GGCGGCGCCG ACGCTGGGCG GCGCGCACCT CTGGTCCTTC CTCGACAGCG ACGTGGAAGC GGAGGACCGG CCGCTGCTCG AACCGCTGGT CAACTATGGC TGGCGCGAGT TCGCGGTGGT CTTCCCGCTC TACAAGCGGG CGCTGCCCTT CCCCTTCTAC AGCATCCGTT CCGACCGGTT CGACGCGGCG CTGCGCGAGG CGATGCCGGC CGAAAGCATC CTGACCGGGC GGCGCGCGCT GGGCGTCGCC CCCGACGGGG TGGTGCTGGA GGACGGGACG CTGATCGGGG CGGGCGGCGT GATCGACGCG CGCGGGCCGG GCGACCTGTC GCTGCTAGAC CTCCATTACC GCAAGTTCGT CGGCTATGAG CTGACCCTCG ACGGCCCGCA CAGCGTGCGC CGCGCGACGC TGATCGACGC CGCCGCCGAT CCGGCCGAGG GCTTCCAATA TATGGAGATG CTGCCGATCG AGAACGACCG GCTGTTCATC GAGGACGTCC GCTACGAAAG CTGGCCGGAG ATGGACATGG CCGACCATGG CAACCGCATC GTCAACCATG CGGCGCGCTT CGGCTGGCGC ATCCGCAGCT CGGCGCGCGG CCAGGCGGGC GTCCTGCCGA TCGCGCTGGG CGGCGACTTC GCCGACTATT GGCGATCGGG CGGCGAGGGC GTCGCCAAGG CGGGCCTGCG CGCCGGGCTG TTCCACCCGG TCACCGGCAA TTCGCTGGCC GATGCCGCGC GGGTCGCGCG GCTGGTCGCC GAAGCCGACG ACTGGTCGGG CGCGGCGCTC CACGCCATGC TCCACGACCA TGCCGCGCGC GCCTGGGCGA AGCGGGCCTA TTATCGCCGC TTCGCCACGC GGATGCTGCG CGATACTCCG CCGGCGGAAG GCTACAGGAT GCTCGAAAGC CTCTATGCGA TGGACGCCGA CCTGATCGCG CGCTTCCACG CGATGCGCCT CGGCTTTGCC GACCGCATGG CGCTCAGCCT CGGCGAGGGG CCGATGCCGG TCGGGGCGAT GTTCCGGCGA TGA
|
Protein sequence | MSGVMCDLAI IGGGLSGGLI ALAARRARPD RRVLLIEAAP TLGGAHLWSF LDSDVEAEDR PLLEPLVNYG WREFAVVFPL YKRALPFPFY SIRSDRFDAA LREAMPAESI LTGRRALGVA PDGVVLEDGT LIGAGGVIDA RGPGDLSLLD LHYRKFVGYE LTLDGPHSVR RATLIDAAAD PAEGFQYMEM LPIENDRLFI EDVRYESWPE MDMADHGNRI VNHAARFGWR IRSSARGQAG VLPIALGGDF ADYWRSGGEG VAKAGLRAGL FHPVTGNSLA DAARVARLVA EADDWSGAAL HAMLHDHAAR AWAKRAYYRR FATRMLRDTP PAEGYRMLES LYAMDADLIA RFHAMRLGFA DRMALSLGEG PMPVGAMFRR
|
| |