Gene Caul_3246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3246 
Symbol 
ID5900701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3509529 
End bp3510572 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content68% 
IMG OID641563751 
Productvanillate monooxygenase 
Protein accessionYP_001684871 
Protein GI167647208 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.546586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.604076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAC GCTACCTTCA GGACGCCTGG TATCAAGCCG CCTTCGCGCG CGAAATCGGC 
GACAAGCCCC TGGCGCGGAC CCTGCTCGAC GTCCCGATCG TCCTTTATCG CTCGGCCGGG
AAGGTCGTGG CCCTGCATGA TCGCTGCCCG CACCGCTTTG CGCCCCTGTC GATGGGCGCG
TTGCGCGACG GACAGATCAT CTGCGGCTAC CACGGCGTCG GCTTCGGCGA CGACGGCCGC
TGCGTGCATA ATCCCCACGG CGCGCTCCCC AGGGCGATGA GGGTTCGTGC CTATCCAGTT
GTCGAGCGCC ACACGGTCGT CTGGATCTGG ATGGGTGATA CGCGCAAGGC CGATCCTGCC
CTGATACCCG ACCTGTCGTT CATCGACGAG ACGCCCGAAA CCGCGCGGAT CTGCTTCTAC
ATCCCCACCG CGGCCAACTA CCGGCTGGTC GTCGACAATC TGATGGACCT CAGCCACGCC
GATTACCTTC ACCCCACTTC GCTTGGCGGC GTGATGACCG GCGCGGAGGC CAGCACCGAG
CTGGCCGCCG ACGGGGTCGT CAACACCTGG ATCAACCGGA ACTGCCTGGC TCCGGCGCGC
TTTCACGCCA GGGTGCCGCC GCCGCAACGC GCCGACGCCT GGACTGAAGC GACCTGGCGA
GCGCCGGCGA TCATGGTCAT CGGCACGGCC TTGGCGCCGG CCGGCGAGCC GCGGCGACGG
GAGGACGAGA TCTGGGCCCT GCACAGCATG ACGCCCGAGA CCGCCTCGAC CACGCACTAC
TTCGTCTGCG GCACCCGGGG GGAGCGCCTG GACGACGTGG AGTATTCCGA ACGCCTGCGA
GGAATGCTGG CCAACGCCTT CATCAACGAG GACAAGCCCA TGCTCGAGGC GCAGCAGGCG
CGGATGGGCG GTGCGTCGCT CTCGAGCCTG CGACCCGTCT TGCTCGCGGT CGACGCCGGT
GCGATGCAGG CTCGCGCCCA ATTGGACAGG ATGATCGCCC TGGAGCAGGA CCCGTCCGCG
CCCCCAGCCC GAGACGTCGC TTGA
 
Protein sequence
MSERYLQDAW YQAAFAREIG DKPLARTLLD VPIVLYRSAG KVVALHDRCP HRFAPLSMGA 
LRDGQIICGY HGVGFGDDGR CVHNPHGALP RAMRVRAYPV VERHTVVWIW MGDTRKADPA
LIPDLSFIDE TPETARICFY IPTAANYRLV VDNLMDLSHA DYLHPTSLGG VMTGAEASTE
LAADGVVNTW INRNCLAPAR FHARVPPPQR ADAWTEATWR APAIMVIGTA LAPAGEPRRR
EDEIWALHSM TPETASTTHY FVCGTRGERL DDVEYSERLR GMLANAFINE DKPMLEAQQA
RMGGASLSSL RPVLLAVDAG AMQARAQLDR MIALEQDPSA PPARDVA