Gene Caci_3064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3064 
Symbol 
ID8334416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3369834 
End bp3371273 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content74% 
IMG OID644956211 
ProductDyp-type peroxidase family 
Protein accessionYP_003113814 
Protein GI256392250 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000193803 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0168206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACG ACACCACCGA ATCGCCCCAC GCCGCCCCAG GGCCGGTCAG CCCCGACACC 
CCGCCGCCAG CTGCCGCCGA GCTCCCGCGC CCGCCCCGCT GGCTCACCGG TCGTCGTCCG
GCGGGCACCT CGCCGCGCAA CCCTTCCCGC CGCACCTTCC TGACCACCGG CGCCACCACC
ATCGGCGGCG TGGCAGTAGG CGCCACCACC TCCGCCCTCC TCCTCAACGA CAACAGCACG
GCCGCCACCC CCGCCGCCGA CGTCATCACC GCCCTCGGCA CCACGACCAT CGCCACCCCC
TTCCCCCACC AAGCAGGCAT CTCCATCCCC GCCCGCCAAC AGAGCCACGG CACCGTAGCC
GCCTTCGACC TCGCCCCCAA CACCACCCCC GCCCAGCTCA AAGCCCTGAT GCAAGCCTGG
ACCGCCGCCA TCGCCGACCT CACCGCCGGC CGCGCTCCCG CCCCCGGCTC CAGCTCCACC
GCCTCCCCCA CCCCCGCCCC CGACACCACC ACCCTCGGCA GCGGCCCATG CTCCCTGACC
ATCACCGTCG GCATCGGCCC ATCCCTGTTC GGCAAGGCAG GCCTGGACCC CGCCGCCCGC
CCCCCGCAGC TCGCCCCCCT CCCCGCCTTC GGCACCGAGC GCCTCGACCC CGCGCGCAGC
GACGGCGACC TCGGCGTCGT CCTCGCCGCC GACGACGCGC TCGTCGTCTT CCACGCGCTG
CGCGTCCTCA CCCGCGCCGC CGCCGGGACC GCCAAGCCGC GCTGGGTCAT GTCCGGCTTC
AGCCGCGCGC CCGGTTCCTC GCCCGACCCC GCCGCCACCG GCCGCAATCT CATGGGCCAG
CTCGACGGCA CCAACAACCC CGCCCCCGCG CAGCCGGACT TCGCGGGCAA GGTGTTCGTC
CCCGCCGACG CCCCGACCGC CTGGATGCGC GGCGGCTCCT ACCTCGTCTT CCGCCGTATC
AGGATGCTGC TGGACTCCTG GGACGCCCAG ACCACCGCCG AGCAGGAACG CGTCATCGGC
CGCCACAAGG ACACCGGCGC CCCGCTGTCC GGCGGCACCG AGCACACCCC GGTCAACCTC
TCCGGCCAGA ACCCCGACGG CTCCCTCGCC ATCCGCGGCG ACGCCCACAT CCGCCTGGCC
GCCGCCGCCG GCAACAGCGG CGCCGCCATG CTCCGCCGCG GCCTGAGCTA CGACGACGGC
CTCACCGCCG ACGGCCAACC CAACGCGGGC CTGCTCTTCC TAGCCTGGCA AGCCGACCCG
AACCACGGCT TCGTCCCGGT CCAGAAGCAC CTGACCCACT CGATGGACGC CTTGAACCGC
TTCACCACCC ACGAGACCAG CGCCCTGTTC GCGATGGTCC CGGCGCCGGT ACCCGGCGGC
TACCTCAGCC AGGCGCTCCT AGATCACGCA CTCCTCGACC CGACCAATCA AGGACACTGA
 
Protein sequence
MTDDTTESPH AAPGPVSPDT PPPAAAELPR PPRWLTGRRP AGTSPRNPSR RTFLTTGATT 
IGGVAVGATT SALLLNDNST AATPAADVIT ALGTTTIATP FPHQAGISIP ARQQSHGTVA
AFDLAPNTTP AQLKALMQAW TAAIADLTAG RAPAPGSSST ASPTPAPDTT TLGSGPCSLT
ITVGIGPSLF GKAGLDPAAR PPQLAPLPAF GTERLDPARS DGDLGVVLAA DDALVVFHAL
RVLTRAAAGT AKPRWVMSGF SRAPGSSPDP AATGRNLMGQ LDGTNNPAPA QPDFAGKVFV
PADAPTAWMR GGSYLVFRRI RMLLDSWDAQ TTAEQERVIG RHKDTGAPLS GGTEHTPVNL
SGQNPDGSLA IRGDAHIRLA AAAGNSGAAM LRRGLSYDDG LTADGQPNAG LLFLAWQADP
NHGFVPVQKH LTHSMDALNR FTTHETSALF AMVPAPVPGG YLSQALLDHA LLDPTNQGH