Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3900 |
Symbol | |
ID | 8335253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4425191 |
End bp | 4426498 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644957026 |
Product | Dyp-type peroxidase family |
Protein accession | YP_003114629 |
Protein GI | 256393065 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01412] Tat-translocated enzyme [TIGR01413] Dyp-type peroxidase family |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0223253 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACAAG CAGTGGACGA CGAGGCCACG ACCGAGACGG CGGCGGAACC CGGGACGGCC AGCCGGCGGC AGGTGATCGG CCGGGCGATC GGCGCGGCCG GAGTGGTCGC GGTCGGTGGC GTCGGCTACG GCGTCGCCCG GGCCACCGAG CCCGGCGGCA GCCCTTCCCC GCCCGCGTCC TCGGCTTCGG ACATTGTGCC GTTCTACGGC GCCAACCAGG CCGGGATCGC CACGCCGGCC CAGGACCGGC TCGCCTTCGC AGCTTTCGAT GTCACCAGCG GTTCGGCGCA GGCCTTCCAG GTGATGCTGG GAACCTGGGC CGCGGCCGCG GCGCAGATGA CCAAGGGCCT GCCGGTCGGC GCGGTGGACA ACAACCCACA GTCCCCGCCG ATCGACACCG GCGAGGCCGA CGGACTGAGC GCCGCCGGAT TGACGATCAC CGTCGGGTTC GGGCCCTCGC TGTTCGACCA CCGGTTCGGT CTGGCCGGCA AGCGCCCGGC GGCCCTGGCC GACCTGCCGA CCCTGCCCGG CGACGGCTCC CTGCAGCCCG CGCGCAGCGG CGGCGACCTC TGTGTGCAGG CCTGCGCGGA CGACCCGACC GTGGCGTTCC ACGTGATCCG CAACTTCGCC CGGCTGGCTC GCGGCACCGC GGTCATCCGC TGGTCCCAGC TCGGATTCGG CCGCACCTCC TCGACCTCGG ACAGCCAGCA GACCGAGCGC AACCTGATGG GCTTCAAGGA CGGCACCCGC AACATCAAAG CCGAAGCCGC CGACGATCTG CGCGACCACG TCTGGGTCGG CTCCGAGACC GACCAGGCCT GGATGACCGG CGGCAGCTAT CTGGTGGCCC GCCGTATCCG GATGCTGATC GAGTCCTGGG ACACCGACTA CCTGTCCGAC CAGGAGAACG TCTTCGGCCG GTTCAAGACC TCCGGCGCCC CGCTCACCGG CAAGTCCGAG TTCGACACAC CGGACCTGGC CGCCAAGCAC ACCGACGGCA CCCCGGTCAT CCCGCTCAAC GCCCACATCC GGCTGGCCGG CCCGGAGACC AACAACAACC AGAAGATCCT GCGCCGCGGC TACTCCTACA CCGACGGCAT CGACTCCGCC ACCGGCCTGC TCGACGCCGG CCTGTTCTTC CTCGCCTACC AGAAGGACCC GCGCCGGCAG TTCGTCCCGA TCCAGACCCG GCTCGGCCAC CAAGACAACC TGAACGAGTA CATCCGGCAC ACCGGCAGCG CGCTGTTCGC GGTGCCGCCG GGAGTCTCAG CCGCCGGAGA CTGGTGGGGG AAGAGCCTAT TCGCGTGA
|
Protein sequence | MGQAVDDEAT TETAAEPGTA SRRQVIGRAI GAAGVVAVGG VGYGVARATE PGGSPSPPAS SASDIVPFYG ANQAGIATPA QDRLAFAAFD VTSGSAQAFQ VMLGTWAAAA AQMTKGLPVG AVDNNPQSPP IDTGEADGLS AAGLTITVGF GPSLFDHRFG LAGKRPAALA DLPTLPGDGS LQPARSGGDL CVQACADDPT VAFHVIRNFA RLARGTAVIR WSQLGFGRTS STSDSQQTER NLMGFKDGTR NIKAEAADDL RDHVWVGSET DQAWMTGGSY LVARRIRMLI ESWDTDYLSD QENVFGRFKT SGAPLTGKSE FDTPDLAAKH TDGTPVIPLN AHIRLAGPET NNNQKILRRG YSYTDGIDSA TGLLDAGLFF LAYQKDPRRQ FVPIQTRLGH QDNLNEYIRH TGSALFAVPP GVSAAGDWWG KSLFA
|
| |