Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_1543 |
Symbol | |
ID | 5198875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 1717567 |
End bp | 1718667 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640581091 |
Product | vanillate monooxygenase |
Protein accession | YP_001262044 |
Protein GI | 148554462 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.111008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00152746 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAGCGAAG CCGAGGTTGC GAAGCTATCC GACCGTTTCG AAATATCCCG GCGCAGCGTC GCGAACGCGG AAAGCGGGTT CATCCTGAAC GCTTGGTATA TCGCCGGCCT GTCCAGCGAT TTCGACCGGA AGCTCTCGAC GCGCACGATA TGCGGAAAGT CGATCCTGTT CTTCCGCGAC GCGGACGGCG GCGTCCATGC CATGCAGAAC CGCTGCGGGC ACCGCTCCTA CCCTCTCTCC GCCGGCACGC TCGACGGCAA TGAGATCGTC TGCGGCTATC ATGGCCTCAG ATACAATGCG CAGGGCATCT GCGTGCAGCT TCCCGATGGC GGCTGCCCGC GCGGCAGGTT CGGGGTGCGG AGCTATCCGC TGGTCGAGCG GGCCTCGTTC GTCTGGATCT GGGTCGGCGA TGCCGAACTG GCCAAGTCCA CGCCCGTGCC GCATCCCGAA TGGCTGGGCG GGGAAGGCTG GTCCTTCGTC ACCGGCTACA GCCATGCCGA AGGCAGCTAT GTCCATCTGC ACGAGAATCT GCTCGACCTG TCGCACCTGA CCTATCTGCA TGCGGCGACG TTCGGCACGC CCGAGTTCGC GCTCGCCCCG ATCGAGACCG AGATCAAGGA TGACGACATC CAGGTCTGGC GGAACGTCGA ATGCCAGTTG CCGCCGATCT ATTCGATCCC GCTGGGATGG GAAGGCCAGC GCGCGCTGCG AAGGTCCGGG TCACAGCTCG TTTCGCCCGG CCTCCATGTG AACACCGGCA TTTTCGAGAA TCTCGAACTG GCCGAGCAGC CCGTGCCGAA GCCGATGGTG AAGGTGGCGC AGCTCATCAC GCCGGAAACC CAGCATTCGA TTCATTATCA TTATGCGGTG GCCCGCAACT TCGCGCTCGA CGATGACGCC GTCGGCGATC ATCTGCTCAA GGGATCGCAG GCGGCGTTCA GGGAGGACAT CGAGGCGCTG CGCCGGATCA CGCAGATGCA CGCCGAGGCG GGCCCCGGGG GCGAAGTGTT CGAGTTCGAC ATCCCGACCG ACAGGGCGGG GCTCGAGATG CGGCGCCGCT TCAAGAAGTT GATCGATCTC GAGGAAGCAG CGGACGCCTA G
|
Protein sequence | MSEAEVAKLS DRFEISRRSV ANAESGFILN AWYIAGLSSD FDRKLSTRTI CGKSILFFRD ADGGVHAMQN RCGHRSYPLS AGTLDGNEIV CGYHGLRYNA QGICVQLPDG GCPRGRFGVR SYPLVERASF VWIWVGDAEL AKSTPVPHPE WLGGEGWSFV TGYSHAEGSY VHLHENLLDL SHLTYLHAAT FGTPEFALAP IETEIKDDDI QVWRNVECQL PPIYSIPLGW EGQRALRRSG SQLVSPGLHV NTGIFENLEL AEQPVPKPMV KVAQLITPET QHSIHYHYAV ARNFALDDDA VGDHLLKGSQ AAFREDIEAL RRITQMHAEA GPGGEVFEFD IPTDRAGLEM RRRFKKLIDL EEAADA
|
| |