Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3064 |
Symbol | |
ID | 8334416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 3369834 |
End bp | 3371273 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644956211 |
Product | Dyp-type peroxidase family |
Protein accession | YP_003113814 |
Protein GI | 256392250 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01412] Tat-translocated enzyme [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000193803 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0168206 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACG ACACCACCGA ATCGCCCCAC GCCGCCCCAG GGCCGGTCAG CCCCGACACC CCGCCGCCAG CTGCCGCCGA GCTCCCGCGC CCGCCCCGCT GGCTCACCGG TCGTCGTCCG GCGGGCACCT CGCCGCGCAA CCCTTCCCGC CGCACCTTCC TGACCACCGG CGCCACCACC ATCGGCGGCG TGGCAGTAGG CGCCACCACC TCCGCCCTCC TCCTCAACGA CAACAGCACG GCCGCCACCC CCGCCGCCGA CGTCATCACC GCCCTCGGCA CCACGACCAT CGCCACCCCC TTCCCCCACC AAGCAGGCAT CTCCATCCCC GCCCGCCAAC AGAGCCACGG CACCGTAGCC GCCTTCGACC TCGCCCCCAA CACCACCCCC GCCCAGCTCA AAGCCCTGAT GCAAGCCTGG ACCGCCGCCA TCGCCGACCT CACCGCCGGC CGCGCTCCCG CCCCCGGCTC CAGCTCCACC GCCTCCCCCA CCCCCGCCCC CGACACCACC ACCCTCGGCA GCGGCCCATG CTCCCTGACC ATCACCGTCG GCATCGGCCC ATCCCTGTTC GGCAAGGCAG GCCTGGACCC CGCCGCCCGC CCCCCGCAGC TCGCCCCCCT CCCCGCCTTC GGCACCGAGC GCCTCGACCC CGCGCGCAGC GACGGCGACC TCGGCGTCGT CCTCGCCGCC GACGACGCGC TCGTCGTCTT CCACGCGCTG CGCGTCCTCA CCCGCGCCGC CGCCGGGACC GCCAAGCCGC GCTGGGTCAT GTCCGGCTTC AGCCGCGCGC CCGGTTCCTC GCCCGACCCC GCCGCCACCG GCCGCAATCT CATGGGCCAG CTCGACGGCA CCAACAACCC CGCCCCCGCG CAGCCGGACT TCGCGGGCAA GGTGTTCGTC CCCGCCGACG CCCCGACCGC CTGGATGCGC GGCGGCTCCT ACCTCGTCTT CCGCCGTATC AGGATGCTGC TGGACTCCTG GGACGCCCAG ACCACCGCCG AGCAGGAACG CGTCATCGGC CGCCACAAGG ACACCGGCGC CCCGCTGTCC GGCGGCACCG AGCACACCCC GGTCAACCTC TCCGGCCAGA ACCCCGACGG CTCCCTCGCC ATCCGCGGCG ACGCCCACAT CCGCCTGGCC GCCGCCGCCG GCAACAGCGG CGCCGCCATG CTCCGCCGCG GCCTGAGCTA CGACGACGGC CTCACCGCCG ACGGCCAACC CAACGCGGGC CTGCTCTTCC TAGCCTGGCA AGCCGACCCG AACCACGGCT TCGTCCCGGT CCAGAAGCAC CTGACCCACT CGATGGACGC CTTGAACCGC TTCACCACCC ACGAGACCAG CGCCCTGTTC GCGATGGTCC CGGCGCCGGT ACCCGGCGGC TACCTCAGCC AGGCGCTCCT AGATCACGCA CTCCTCGACC CGACCAATCA AGGACACTGA
|
Protein sequence | MTDDTTESPH AAPGPVSPDT PPPAAAELPR PPRWLTGRRP AGTSPRNPSR RTFLTTGATT IGGVAVGATT SALLLNDNST AATPAADVIT ALGTTTIATP FPHQAGISIP ARQQSHGTVA AFDLAPNTTP AQLKALMQAW TAAIADLTAG RAPAPGSSST ASPTPAPDTT TLGSGPCSLT ITVGIGPSLF GKAGLDPAAR PPQLAPLPAF GTERLDPARS DGDLGVVLAA DDALVVFHAL RVLTRAAAGT AKPRWVMSGF SRAPGSSPDP AATGRNLMGQ LDGTNNPAPA QPDFAGKVFV PADAPTAWMR GGSYLVFRRI RMLLDSWDAQ TTAEQERVIG RHKDTGAPLS GGTEHTPVNL SGQNPDGSLA IRGDAHIRLA AAAGNSGAAM LRRGLSYDDG LTADGQPNAG LLFLAWQADP NHGFVPVQKH LTHSMDALNR FTTHETSALF AMVPAPVPGG YLSQALLDHA LLDPTNQGH
|
| |