Gene Caci_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1967 
Symbol 
ID8333310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2223803 
End bp2225053 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID644955116 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003112728 
Protein GI256391164 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.235133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCCAGC TCACTCGGAA GAAGCGGGGG CGCGCAACGA CGCGTGCGGC CGCTGCCCTG 
TTGGGCGTCG CCGGCTTGAT AACCGCGGGC GCTGTCGCCG CCCAGGCGTC TCCGGCGTCC
TCCTCCGCGG CCTCCGGGCC GGCTTCGGTG CACAGCTGCA GCCAGGCTGT CGCGCCGGGG
TATGCCACGT GCTTCGCGTT GAAGCGCACT GACAAGAAGG CCCATGCCAT GGCGGCCAAC
GGGCTGCCCT CCGGCTTCGG TCCGGCCGAT CTGGACAGCG CCTACTCGCT GCCGGCCAGC
GGCGGATCGG GCCAGACGGT GGCCATCGTG GACGCTCAGG ACGACCCGAA CGCGGAGTCG
GACCTGGCGA CCTACCGCTC GACGTACGGC CTGCCGGCGT GCACCACCGA CAACGGCTGC
TTCAAGAAGA TCGACCAGAA CGGCGGGAGC AACTACCCGA CCGCCGACCA GGGCTGGGCC
GGCGAGATCT CGCTGGACGT GGACATGGTC TCGGCCGTGT GCCCGGACTG CCACATCCTG
CTGGTCGAGG CGACCTCGGC GAACATGAAC GACCTGGGCA CCGCGGTGAA CCAGGCGGTG
TCGCAGGGTG CGAAGTTCGT CTCCAACAGC TACGGCGGCT CCGAGGACGG TTCGGAGGGC
CAGTCGGACT CCACCTACTT CGACCACCCC GGCGTGGCCA TCACCGCCTC CTCCGGTGAC
GGCGCCTACT CCGCCGGCAC CGAGTACCCG GCCTCCTCCC AGTACGTCAC CGCGGTCGGC
GGCACCTCGC TGACCAAGGA CTCTTCCAGC CGCGGCTGGT CGGAATCGGT GTGGGGGACC
AGCGACACCG AGGGCGCCGG CTCGGGCTGC TCGCAGGACG TCGCCAAGCC CTCGTGGCAG
ACCGACACCG ACTGCGCCAA CCGGATGGTC GCGGACGTCT CCGCGGTCGC CGACCCGGCC
ACCGGCGTCG CGGTCTACCA GACCTACGGC GGCAACGGCT GGGCCGTGTA CGGCGGCACC
TCGGCGTCCT CGCCGATCAT CGCCTCGGTC TACGCGCTGG CCGGAACCCC GGGTTCGAAC
GACGTCCCGG CGTCCTACCC CTACGCGCAC ACCGGCAACC TGAACGACGT CACCAGCGGC
AGCAACGGCA GCTGCTCGCC GGACTACTAC TGCACCGCCG GCCAGGGTTA CGACGGCCCG
ACGGGCCTGG GCACGCCGAA CGGGCTGACC GCCTTCACCG CGGGTAGCTG A
 
Protein sequence
MFQLTRKKRG RATTRAAAAL LGVAGLITAG AVAAQASPAS SSAASGPASV HSCSQAVAPG 
YATCFALKRT DKKAHAMAAN GLPSGFGPAD LDSAYSLPAS GGSGQTVAIV DAQDDPNAES
DLATYRSTYG LPACTTDNGC FKKIDQNGGS NYPTADQGWA GEISLDVDMV SAVCPDCHIL
LVEATSANMN DLGTAVNQAV SQGAKFVSNS YGGSEDGSEG QSDSTYFDHP GVAITASSGD
GAYSAGTEYP ASSQYVTAVG GTSLTKDSSS RGWSESVWGT SDTEGAGSGC SQDVAKPSWQ
TDTDCANRMV ADVSAVADPA TGVAVYQTYG GNGWAVYGGT SASSPIIASV YALAGTPGSN
DVPASYPYAH TGNLNDVTSG SNGSCSPDYY CTAGQGYDGP TGLGTPNGLT AFTAGS