Gene Caci_3900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3900 
Symbol 
ID8335253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4425191 
End bp4426498 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content71% 
IMG OID644957026 
ProductDyp-type peroxidase family 
Protein accessionYP_003114629 
Protein GI256393065 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0223253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACAAG CAGTGGACGA CGAGGCCACG ACCGAGACGG CGGCGGAACC CGGGACGGCC 
AGCCGGCGGC AGGTGATCGG CCGGGCGATC GGCGCGGCCG GAGTGGTCGC GGTCGGTGGC
GTCGGCTACG GCGTCGCCCG GGCCACCGAG CCCGGCGGCA GCCCTTCCCC GCCCGCGTCC
TCGGCTTCGG ACATTGTGCC GTTCTACGGC GCCAACCAGG CCGGGATCGC CACGCCGGCC
CAGGACCGGC TCGCCTTCGC AGCTTTCGAT GTCACCAGCG GTTCGGCGCA GGCCTTCCAG
GTGATGCTGG GAACCTGGGC CGCGGCCGCG GCGCAGATGA CCAAGGGCCT GCCGGTCGGC
GCGGTGGACA ACAACCCACA GTCCCCGCCG ATCGACACCG GCGAGGCCGA CGGACTGAGC
GCCGCCGGAT TGACGATCAC CGTCGGGTTC GGGCCCTCGC TGTTCGACCA CCGGTTCGGT
CTGGCCGGCA AGCGCCCGGC GGCCCTGGCC GACCTGCCGA CCCTGCCCGG CGACGGCTCC
CTGCAGCCCG CGCGCAGCGG CGGCGACCTC TGTGTGCAGG CCTGCGCGGA CGACCCGACC
GTGGCGTTCC ACGTGATCCG CAACTTCGCC CGGCTGGCTC GCGGCACCGC GGTCATCCGC
TGGTCCCAGC TCGGATTCGG CCGCACCTCC TCGACCTCGG ACAGCCAGCA GACCGAGCGC
AACCTGATGG GCTTCAAGGA CGGCACCCGC AACATCAAAG CCGAAGCCGC CGACGATCTG
CGCGACCACG TCTGGGTCGG CTCCGAGACC GACCAGGCCT GGATGACCGG CGGCAGCTAT
CTGGTGGCCC GCCGTATCCG GATGCTGATC GAGTCCTGGG ACACCGACTA CCTGTCCGAC
CAGGAGAACG TCTTCGGCCG GTTCAAGACC TCCGGCGCCC CGCTCACCGG CAAGTCCGAG
TTCGACACAC CGGACCTGGC CGCCAAGCAC ACCGACGGCA CCCCGGTCAT CCCGCTCAAC
GCCCACATCC GGCTGGCCGG CCCGGAGACC AACAACAACC AGAAGATCCT GCGCCGCGGC
TACTCCTACA CCGACGGCAT CGACTCCGCC ACCGGCCTGC TCGACGCCGG CCTGTTCTTC
CTCGCCTACC AGAAGGACCC GCGCCGGCAG TTCGTCCCGA TCCAGACCCG GCTCGGCCAC
CAAGACAACC TGAACGAGTA CATCCGGCAC ACCGGCAGCG CGCTGTTCGC GGTGCCGCCG
GGAGTCTCAG CCGCCGGAGA CTGGTGGGGG AAGAGCCTAT TCGCGTGA
 
Protein sequence
MGQAVDDEAT TETAAEPGTA SRRQVIGRAI GAAGVVAVGG VGYGVARATE PGGSPSPPAS 
SASDIVPFYG ANQAGIATPA QDRLAFAAFD VTSGSAQAFQ VMLGTWAAAA AQMTKGLPVG
AVDNNPQSPP IDTGEADGLS AAGLTITVGF GPSLFDHRFG LAGKRPAALA DLPTLPGDGS
LQPARSGGDL CVQACADDPT VAFHVIRNFA RLARGTAVIR WSQLGFGRTS STSDSQQTER
NLMGFKDGTR NIKAEAADDL RDHVWVGSET DQAWMTGGSY LVARRIRMLI ESWDTDYLSD
QENVFGRFKT SGAPLTGKSE FDTPDLAAKH TDGTPVIPLN AHIRLAGPET NNNQKILRRG
YSYTDGIDSA TGLLDAGLFF LAYQKDPRRQ FVPIQTRLGH QDNLNEYIRH TGSALFAVPP
GVSAAGDWWG KSLFA