Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3396 |
Symbol | |
ID | 5201010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 3726429 |
End bp | 3727475 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640582943 |
Product | vanillate monooxygenase |
Protein accession | YP_001263880 |
Protein GI | 148556298 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00111775 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0140003 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTTG TCAGAAACTG CTGGTACGTC GCCTTGTGGA GCGAGGACCT CCACAGCGGC GCGATCGAAA ACCGTATCAT ATTGTCGGAG CCGGTAGTGT TCTACCGCGA CCAGCAGGGG CAGCCTGTCG CCCTGTTCGA TATCTGTCCG CATCGCTTCG CGCCGCTCAG CAAGGGACGA TTGCTGCCGG GCGGTGGCAT CGCCTGCGGC TATCACGGGC TTGAATTCGG TGCCGACGGC ACCTGCATTC GCAACCCGCA TGGCGATCGC ATCCCCAATC GCGCGCAGGT GAAACGCTAT CCGCTGGTCG AACAAGACAG CCTGATCTGG ATCTGGATGG GGGATCAGAC GCCTGATCCC GCCCTCATTC CGCGTTATGA TTTTCTCGAT CCGGTCAACG GCTACACCGT CACCAGCCGC GAAACGCTGA CCATGCCGTG CGACTATCGG CTGATCGTCG ATAATCTGCT CGACCTCAGC CACATCAACT TCCTGCATGA CGGCATTCTT GGCCATGCCG ATGCGCCACG CGCGGACATC AGCGTCACGA CCGGCGACCG CTGGGTGGAG GTCACGCGCA TTTCACGCAA CCTTCCGGTT CCCGCGCTGT TCGCGATGCT GCTCAATGAT GGATCGGAGC GTGGCGACCT ATGGAACACG ATCCGCTGGG ATCTTGCCGG CTGCCTCAAA AACGATGCGC TTGTCTGTCC GGTCGGCGCC GATCGCAACG ACGGCGACGG CATTTTCGGC GCCCATCTGC TGACGCCGGT CACCGAAGAG ACCACGCTCT ACCATATCGC CGCCGCCCGC CAGAAGACCG CGCGGGCCGG GTCGGAAACG AGCGAAGAGG TGCGGCAGCG CCTGGGCGAG CTTCGCCGGT TTGCCTTTGA AATGCAGGAT CAGCCGATGA TCGCCGCACA AGCGGAGATC ATGCGCAAAT ATCCCGAAGC GACCCACCAC CCGGTCTTTC TGGAGATCGA CACCGGCCCG GCGCAGGCGC AGGCGATCAT CAAACGCCAC ATCGCAGCGG AAAACAGCCC GGCCTGA
|
Protein sequence | MPFVRNCWYV ALWSEDLHSG AIENRIILSE PVVFYRDQQG QPVALFDICP HRFAPLSKGR LLPGGGIACG YHGLEFGADG TCIRNPHGDR IPNRAQVKRY PLVEQDSLIW IWMGDQTPDP ALIPRYDFLD PVNGYTVTSR ETLTMPCDYR LIVDNLLDLS HINFLHDGIL GHADAPRADI SVTTGDRWVE VTRISRNLPV PALFAMLLND GSERGDLWNT IRWDLAGCLK NDALVCPVGA DRNDGDGIFG AHLLTPVTEE TTLYHIAAAR QKTARAGSET SEEVRQRLGE LRRFAFEMQD QPMIAAQAEI MRKYPEATHH PVFLEIDTGP AQAQAIIKRH IAAENSPA
|
| |