Gene Caci_4595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4595 
Symbol 
ID8335949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5226919 
End bp5227989 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID644957696 
ProductDyp-type peroxidase family 
Protein accessionYP_003115298 
Protein GI256393734 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.264848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.006941 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCAAGC TTTCCCCCGC CGACGCCGCC CAGCCGGTGG TCAGCCCTCT GACCTCGGCG 
GCGATGTTCC TGGTGCTGAC CATCGAGGAC GGCGGCGAGG ACGCGGTCCG CGAGGTGGTC
CCCGACCTGA GCGCGTACGC GCGCGCCGTG GGGTTCGGGT ATCCGGAGGG CGGGCTGGCG
TGCGTCACCG GGTTCGGCTC GGCGGCGTGG GACCGGTTGT TCGGCGGTCC GCGGCCCGCC
GAGCTGCATC CGTTCGTCGC GCTGGACGGT CCCCGGCACT CGGCGCCGGC CACGCCGGGA
GACATCCTGC TGCACCTGCG CGCCCAGCAG CTGGACCTGT GCTACGACTT CGCCGAGCAC
GTCCTGAGGC GCCTGGGCGG CGCGGTGAAG GTCGTGGACG AGGTCCACGG CTTCCGCTAC
CACGACAACC GCGACCTGCT CGGCTTCGTG GACGGCACGG AGAACCCCGA GGGCGTCGAG
GCGGTCGAGG CGGCCCTGGT GGACGCCGAG CGGGATCCGG ACTTCGTCGG CGGCAGCTAT
GTGATCGTGC AGAAGTACAC CCATGACATG GCCGCCTGGC GAGCGTTGTC GGATCTGGAG
CAGGGGCTGA TCATCGGCCG CACCAAGATC GACAACATCG AGCTGCCGGA CGCGGTGAAG
CCGAAGACCT CGCATGTGGC GTTGAACACG GTCGTCGGCC CGGACGGCGA GGAACAGGAC
ATCCTGCGCC ACAACATGGC CTTCGGCTCC TTCCGCGAAG GCGAGGCGGG CACCTATTTC
ATCGGGTACT GCGCCACCCC CGAGGTCACC GAGCAGATGT TGCGCAACAT GTTCCTCGGT
GACGAGGAGG GCAACCAGGA CCGGATCCTG GACTTCTCCA CGGCGGTCAC CGGAGGGCTC
TTCTTCGTGC CGGCCAAGAG TTTCCTGGAC GACCCGCCGC CGGCGCCCGG CGAGGACTCC
CCCGCACCCG ACGCGTCCCC CTCGGATTCA CCCGCCGCGG ACCCCGCGCC GCCGGCACGA
CCCCGACCCA CCGACGGATC CCTGGGCCTC GGAAGCCTGA AACGGAGCTG A
 
Protein sequence
MPKLSPADAA QPVVSPLTSA AMFLVLTIED GGEDAVREVV PDLSAYARAV GFGYPEGGLA 
CVTGFGSAAW DRLFGGPRPA ELHPFVALDG PRHSAPATPG DILLHLRAQQ LDLCYDFAEH
VLRRLGGAVK VVDEVHGFRY HDNRDLLGFV DGTENPEGVE AVEAALVDAE RDPDFVGGSY
VIVQKYTHDM AAWRALSDLE QGLIIGRTKI DNIELPDAVK PKTSHVALNT VVGPDGEEQD
ILRHNMAFGS FREGEAGTYF IGYCATPEVT EQMLRNMFLG DEEGNQDRIL DFSTAVTGGL
FFVPAKSFLD DPPPAPGEDS PAPDASPSDS PAADPAPPAR PRPTDGSLGL GSLKRS