Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_4881 |
Symbol | catA |
ID | 4041743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 1543053 |
End bp | 1543976 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637980302 |
Product | Catechol 1,2-dioxygenase |
Protein accession | YP_587012 |
Protein GI | 94313803 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3485] Protocatechuate 3,4-dioxygenase beta subunit |
TIGRFAM ID | [TIGR02439] catechol 1,2-dioxygenase, proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.405251 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.494696 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCACG CTGACATCGA AGCGCTGGTA AAGCAGTTCC TCGTACACAC CGCCACGCAA GGCACCCCGG ACGCACGCGC GCAACAGGTC GTGGTACGGC TGACGACCGA TCTGTTCAAG GCAATCGAGG ATCTGGATTT GAGCGCCACC GAAGTCTGGA AGGGCATCGA GTATTTCGCC GAGGCAGGCG CCACCCACGA GCTGGGCCTG CTGGCCGCCG GCCTGGGTCT GGAACGATTC CTGGACATCC GCGCCGACGA GGCAGAAGCC CGCGCCGGCC TGACCGGCGG CACGCCGCGC ACGATCGAAG GCCCGCTGTA TGTGGCCGGC GCGCCCGAAT CCACCGGCTT CGCACGACTC GACGACGGCA CGGAGGACGA CAAGGGCGAG GTGCTGTTCA TGCAGGGCAC GGTCTACGAC ACCGACGGCA AGCCGCTGGC CGGCGCGAAG GTCGAGGTAT GGCACGCCAA CCTGCTTGGC AACTACTCGT TCTTCGACAA GAGCCAGTCC CACTTCAACC TGCGCCGCAC GATCGTCACC GATGCCAACG GCCGCTACCA GTTCCGCAGC ATCGTGCCGA TGGGCTATGG CTGCCCGCCC GAGGGCACGA CGCAGCGCCT GCTGAACCTG CTGGCCCGTC ACGGCCGCCG CCCCGCACAT ATTCACTTCT TCGTATCCGC CGCCGGCCAT CGCAAGCTGA CCACGCAGAT CAACATCGAC GGCGACGAGT ACCTGTGGGA TGACTTCGCG TTTGCCAGCC GCGATGGACT CGTGCCGGCG GTCGAGCGCG TCGGGGACGC CGCGCAGCTC GACAAGCACG GCGTGAGCAA GCCGTTCGCC TCGATCGACT TCGACTTCCG CCTGCTGCGC GAGGCCACCG ACGCCCCGGC GGCCGAAGTG GACCGCCTGC GCGCTGCCGC CTGA
|
Protein sequence | MTHADIEALV KQFLVHTATQ GTPDARAQQV VVRLTTDLFK AIEDLDLSAT EVWKGIEYFA EAGATHELGL LAAGLGLERF LDIRADEAEA RAGLTGGTPR TIEGPLYVAG APESTGFARL DDGTEDDKGE VLFMQGTVYD TDGKPLAGAK VEVWHANLLG NYSFFDKSQS HFNLRRTIVT DANGRYQFRS IVPMGYGCPP EGTTQRLLNL LARHGRRPAH IHFFVSAAGH RKLTTQINID GDEYLWDDFA FASRDGLVPA VERVGDAAQL DKHGVSKPFA SIDFDFRLLR EATDAPAAEV DRLRAAA
|
| |