Gene Caci_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1002 
Symbol 
ID8332336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1142927 
End bp1143976 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content73% 
IMG OID644954151 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_003111771 
Protein GI256390207 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.188196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCAC TGAAGGAAGA ACCTCTGGTC CTCGGCTTCG AGACCTCCTG CGACGAGACC 
GGCATCGGCA TCGTGCGGGG CAACACCCTG CTCGCCGACG CGGTGGCCTC CAGTGTGGAC
GAGCACGCGC GCTTCGGCGG CGTGGTGCCC GAGGTCGCCA GCCGGGCGCA CCTGGAGGCC
ATGGTGCCGA CGGTGCAGCG GGCGCTGAGC GACGCCGGCG TGGCGCTGGC CGACATCGAC
GCGATCGCCG TCACCGCCGG TCCCGGCCTG GCCGGCGCCC TGCTGGTCGG CGTCGCCGCG
GCCAAGTCCT ACGCCCTGGC GCTGGACAAG CCGATCTACG GCGTCAACCA CCTCGCCGCG
CACATCTGCG TCGACCAGCT CGAACACGGT CCGCTGCCCG AGGGCTGCAT CGCCATGCTG
GTCTCCGGCG GGCACTCCTC GCTGCTGTTC GTGCCCGACG TCACCGGCGA CGTCCAGCCG
CTCGGCCAGA CCATCGACGA CGCCGCCGGC GAGGCCTTCG ACAAGGTGGC GCGCGTCCTG
GGCCTCGGCT TCCCAGGAGG CCCGCTGATC GACAAGGCGG CGCGCGAGGG CGACCCGGAG
GCGATCGCCT TCCCCCGCGG CCTCACCGGT CCGCGCGACG CGCCCCTGGA CTTCTCCTTC
TCCGGCCTCA AGACCGCGGT CGCGCGCTGG GTCGAGAAGC GCGAGCGCGC CGGCGAGAAG
ATCTCGATCC CGGACGTCGC GGCCTCCTTC CAGGAAGCGG TCGTGGATGT CCTGACCCGC
AAGGCGATCC TGGCGTGCAA GGAGCAAGGA GCGGAGTACC TGCTGATCGG CGGAGGCGTC
GCAGCGAACT CCCGCCTGCG CGTCCTGGCC GAGGAACGCG CCGCCAAGGC GGGCATCCAG
GTCCGCGTCC CGCGCCCCAA GCTGTGCACG GACAACGGCG CCATGGTCGC CGCCCTCGGT
TCGGAACTGG TGCGGCGCGG CCGCGTGCCC TCGCAGCTCG GCTTCCCGGC GGATTCCTCC
CAGCCGGTCG TGGACGTGGT GGTCCGATGA
 
Protein sequence
MTALKEEPLV LGFETSCDET GIGIVRGNTL LADAVASSVD EHARFGGVVP EVASRAHLEA 
MVPTVQRALS DAGVALADID AIAVTAGPGL AGALLVGVAA AKSYALALDK PIYGVNHLAA
HICVDQLEHG PLPEGCIAML VSGGHSSLLF VPDVTGDVQP LGQTIDDAAG EAFDKVARVL
GLGFPGGPLI DKAAREGDPE AIAFPRGLTG PRDAPLDFSF SGLKTAVARW VEKRERAGEK
ISIPDVAASF QEAVVDVLTR KAILACKEQG AEYLLIGGGV AANSRLRVLA EERAAKAGIQ
VRVPRPKLCT DNGAMVAALG SELVRRGRVP SQLGFPADSS QPVVDVVVR