Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4895 |
Symbol | |
ID | 8336249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5577701 |
End bp | 5578711 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644957994 |
Product | Vanillate monooxygenase |
Protein accession | YP_003115596 |
Protein GI | 256394032 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.802026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACAG CTTTCGCCCG CAACCAGTGG TACGTCGCCG CCTACGCCAG CGAAGTCGGC CGCACCTTCC TGGCGCGGAC CATCCTCGGC GAACCGATCG TCTTCTATCG CACCGGCCAG GACGGCCGCG CCGTGGCACT CGCAGACCGC TGCGTCCACC GCCGCTACCC GCTGTCCGAG AGCCGTCTGG ACGGAGACAC CATCGTCTGC GGGTACCACG GCTTCACCTA CGACACGTCC GGGACCTGTG TCTTCGTCCC GGGGCAACAG CGCATCCCGC GCACGGCGCG TGTCGCGTCC TATCCCGTCG CGGAGCTGGA CTCGTTCGTG TGGGTGTGGA TCGGCGATCC CGAACTCGCC GACGACAAGC TCATACCGCG GGCTCCGCAC ATGGCCGACC CGGAGTTCGT CACGGTCTCG GGGATGGAGC CCATCGACTG CGATTACGGC CTGCTCGTCG ACAACCTCCT CGACCTCTCC CACGAGACCT ACCTGCACGG CGGCTACATC GGGACCCCCG AGGTCGCCGA CACGCCGATC ACCACCGACG CCGACGAGCA GGCCGGGATC GTCCGCGTCG CCCGGCACAT GGTGGACGCG GCGTGCCCGC CCTTCTACGC GAAGTCGACC GGCATCCAGG GCCGTATCAC GCGCTTGCAG GACATCGAGT ACTTCGCCCC TTGCCTGTAC CTCCTGCACA GCCGCATCAC GCCGGCCGGC GAGCAGAACC CGTTGTTCCG TACCGAGATC ACGTACGCGA TCACGCCTTC CGCGCCCGGA CAGGTCTACG ACTTCTGGGC TGTCTCGCGG AACTTCGCCA CCGACGACCC GGCGGTCACC GAGTTCCTGC GCGACTTCAA CCACCAGGTG GTGATGCAGG ATGTCGTCGC GCTGAACATC CTGCAGAAGG CCCTGGACTC TGAGTCGGCG GGCTACCAGG AGCTCAGCAT CGGTATCGAC GCCGGCGGCC TGGCCGCGCG CCGGATCCTC GCGCAGCTGG CCGCGCAGTG A
|
Protein sequence | MATAFARNQW YVAAYASEVG RTFLARTILG EPIVFYRTGQ DGRAVALADR CVHRRYPLSE SRLDGDTIVC GYHGFTYDTS GTCVFVPGQQ RIPRTARVAS YPVAELDSFV WVWIGDPELA DDKLIPRAPH MADPEFVTVS GMEPIDCDYG LLVDNLLDLS HETYLHGGYI GTPEVADTPI TTDADEQAGI VRVARHMVDA ACPPFYAKST GIQGRITRLQ DIEYFAPCLY LLHSRITPAG EQNPLFRTEI TYAITPSAPG QVYDFWAVSR NFATDDPAVT EFLRDFNHQV VMQDVVALNI LQKALDSESA GYQELSIGID AGGLAARRIL AQLAAQ
|
| |