Gene Caci_4895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4895 
Symbol 
ID8336249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5577701 
End bp5578711 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content68% 
IMG OID644957994 
ProductVanillate monooxygenase 
Protein accessionYP_003115596 
Protein GI256394032 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.802026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACAG CTTTCGCCCG CAACCAGTGG TACGTCGCCG CCTACGCCAG CGAAGTCGGC 
CGCACCTTCC TGGCGCGGAC CATCCTCGGC GAACCGATCG TCTTCTATCG CACCGGCCAG
GACGGCCGCG CCGTGGCACT CGCAGACCGC TGCGTCCACC GCCGCTACCC GCTGTCCGAG
AGCCGTCTGG ACGGAGACAC CATCGTCTGC GGGTACCACG GCTTCACCTA CGACACGTCC
GGGACCTGTG TCTTCGTCCC GGGGCAACAG CGCATCCCGC GCACGGCGCG TGTCGCGTCC
TATCCCGTCG CGGAGCTGGA CTCGTTCGTG TGGGTGTGGA TCGGCGATCC CGAACTCGCC
GACGACAAGC TCATACCGCG GGCTCCGCAC ATGGCCGACC CGGAGTTCGT CACGGTCTCG
GGGATGGAGC CCATCGACTG CGATTACGGC CTGCTCGTCG ACAACCTCCT CGACCTCTCC
CACGAGACCT ACCTGCACGG CGGCTACATC GGGACCCCCG AGGTCGCCGA CACGCCGATC
ACCACCGACG CCGACGAGCA GGCCGGGATC GTCCGCGTCG CCCGGCACAT GGTGGACGCG
GCGTGCCCGC CCTTCTACGC GAAGTCGACC GGCATCCAGG GCCGTATCAC GCGCTTGCAG
GACATCGAGT ACTTCGCCCC TTGCCTGTAC CTCCTGCACA GCCGCATCAC GCCGGCCGGC
GAGCAGAACC CGTTGTTCCG TACCGAGATC ACGTACGCGA TCACGCCTTC CGCGCCCGGA
CAGGTCTACG ACTTCTGGGC TGTCTCGCGG AACTTCGCCA CCGACGACCC GGCGGTCACC
GAGTTCCTGC GCGACTTCAA CCACCAGGTG GTGATGCAGG ATGTCGTCGC GCTGAACATC
CTGCAGAAGG CCCTGGACTC TGAGTCGGCG GGCTACCAGG AGCTCAGCAT CGGTATCGAC
GCCGGCGGCC TGGCCGCGCG CCGGATCCTC GCGCAGCTGG CCGCGCAGTG A
 
Protein sequence
MATAFARNQW YVAAYASEVG RTFLARTILG EPIVFYRTGQ DGRAVALADR CVHRRYPLSE 
SRLDGDTIVC GYHGFTYDTS GTCVFVPGQQ RIPRTARVAS YPVAELDSFV WVWIGDPELA
DDKLIPRAPH MADPEFVTVS GMEPIDCDYG LLVDNLLDLS HETYLHGGYI GTPEVADTPI
TTDADEQAGI VRVARHMVDA ACPPFYAKST GIQGRITRLQ DIEYFAPCLY LLHSRITPAG
EQNPLFRTEI TYAITPSAPG QVYDFWAVSR NFATDDPAVT EFLRDFNHQV VMQDVVALNI
LQKALDSESA GYQELSIGID AGGLAARRIL AQLAAQ