Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3246 |
Symbol | |
ID | 5900701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3509529 |
End bp | 3510572 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641563751 |
Product | vanillate monooxygenase |
Protein accession | YP_001684871 |
Protein GI | 167647208 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.546586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.604076 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAAC GCTACCTTCA GGACGCCTGG TATCAAGCCG CCTTCGCGCG CGAAATCGGC GACAAGCCCC TGGCGCGGAC CCTGCTCGAC GTCCCGATCG TCCTTTATCG CTCGGCCGGG AAGGTCGTGG CCCTGCATGA TCGCTGCCCG CACCGCTTTG CGCCCCTGTC GATGGGCGCG TTGCGCGACG GACAGATCAT CTGCGGCTAC CACGGCGTCG GCTTCGGCGA CGACGGCCGC TGCGTGCATA ATCCCCACGG CGCGCTCCCC AGGGCGATGA GGGTTCGTGC CTATCCAGTT GTCGAGCGCC ACACGGTCGT CTGGATCTGG ATGGGTGATA CGCGCAAGGC CGATCCTGCC CTGATACCCG ACCTGTCGTT CATCGACGAG ACGCCCGAAA CCGCGCGGAT CTGCTTCTAC ATCCCCACCG CGGCCAACTA CCGGCTGGTC GTCGACAATC TGATGGACCT CAGCCACGCC GATTACCTTC ACCCCACTTC GCTTGGCGGC GTGATGACCG GCGCGGAGGC CAGCACCGAG CTGGCCGCCG ACGGGGTCGT CAACACCTGG ATCAACCGGA ACTGCCTGGC TCCGGCGCGC TTTCACGCCA GGGTGCCGCC GCCGCAACGC GCCGACGCCT GGACTGAAGC GACCTGGCGA GCGCCGGCGA TCATGGTCAT CGGCACGGCC TTGGCGCCGG CCGGCGAGCC GCGGCGACGG GAGGACGAGA TCTGGGCCCT GCACAGCATG ACGCCCGAGA CCGCCTCGAC CACGCACTAC TTCGTCTGCG GCACCCGGGG GGAGCGCCTG GACGACGTGG AGTATTCCGA ACGCCTGCGA GGAATGCTGG CCAACGCCTT CATCAACGAG GACAAGCCCA TGCTCGAGGC GCAGCAGGCG CGGATGGGCG GTGCGTCGCT CTCGAGCCTG CGACCCGTCT TGCTCGCGGT CGACGCCGGT GCGATGCAGG CTCGCGCCCA ATTGGACAGG ATGATCGCCC TGGAGCAGGA CCCGTCCGCG CCCCCAGCCC GAGACGTCGC TTGA
|
Protein sequence | MSERYLQDAW YQAAFAREIG DKPLARTLLD VPIVLYRSAG KVVALHDRCP HRFAPLSMGA LRDGQIICGY HGVGFGDDGR CVHNPHGALP RAMRVRAYPV VERHTVVWIW MGDTRKADPA LIPDLSFIDE TPETARICFY IPTAANYRLV VDNLMDLSHA DYLHPTSLGG VMTGAEASTE LAADGVVNTW INRNCLAPAR FHARVPPPQR ADAWTEATWR APAIMVIGTA LAPAGEPRRR EDEIWALHSM TPETASTTHY FVCGTRGERL DDVEYSERLR GMLANAFINE DKPMLEAQQA RMGGASLSSL RPVLLAVDAG AMQARAQLDR MIALEQDPSA PPARDVA
|
| |