Gene Caci_2539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2539 
Symbol 
ID8333888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2870107 
End bp2871411 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content72% 
IMG OID644955692 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003113298 
Protein GI256391734 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTGA AGAACTCCAC TGCGCGAGGC CCGGTCGGGC GGCGCGGGCT GCGCCTGGCG 
ACCGTCGTCG CGGCGGCCTC GACCCTGGTG GCCGCGCTGA CGGTCACCGC GGCTTCGGCC
AGCGCGGGGG TCGCGAGCGC CGACCACGGC AAGGGCCCCA AGGCGGGCAC CGTCAAGAAC
GCCTGCGCCA CGGCGAAGCC CGGCACCGCC CGCTGCTTCG CCGAGATCCG CACCGATGTC
GCCGGCGGGA CCGGCGTCCG CGGACCGGCG GCGGCCAAGC TCGGCGCCGC GGCGAAGACG
ACCGCGCTGC CGGCCGGCTT CGGTCCGGCG GACCTGCACT CCGCGTACAA CCTGCCGACC
ACCGGCGGGG CGAATCAGAC GGTTGCGATC GTCGACGCCG GAGACGACCC GACGGCCGAG
GCGGACCTCG CGGTCTACCG CTCCACCTAC GGCCTGCCCG CGTGCACCAC CGCCAACGGC
TGCTTCACCA AGGTGAACCA GCGCGGCGCG GCCAGCCCGC TGCCGCCGGA CCAGGGCTGG
GGCGTGGAGA TCGCGCTCGA CGTGGACATG GTCTCGGCGG CGTGCCCGCA GTGCAAGATC
CTGCTCGTCG AGGGTGACTC AGCCTCCTTT GACGATCTGG GAAACTCGGT GAACGAAGCC
GTGGCGCTCG GCGCGACCGA GGTGTCCAAC AGCTACGGCG GCTCGGAGGG CAACGGCATA
GACGCCTATG CCGCGGACTA CTCGCACCCG GGCGTGGCGA TCGTCGCCTC CAGCGGTGAC
GGCGGCTACG ACATCCCGAA CGTCCCGGCC GAGTACACCA GCGTGGTCGC CGTCGGCGGT
ACCTCGCTGA CCAAGGCCGC CAACACCCGC GGCTGGACCG AGACCGCCTG GCAGGGCGCC
AGCAGCGGCT GCTCGGCGTG GGTGGACAAG CCGGCCTGGC AGACCGACGC CAACTGCCCC
GGCCGCATGG TCGCGGACGT CTCGGCGGAC GCCGACCCGA ACACTGGCCC GGCGATCTAT
GTCACCGACA CCCCCGACCT CGAGGGCCTG CCCTCCGGCT GGGGCATCGT CGGCGGCACC
AGCGCCTCCT CGCCGTTCGT CGCCGGCGTG ATCGCCCTGG CCGGCAACCC GCAGAAGTTC
CCGAACGCCT CGGCGTTCTA CAGCAACCAC AGCAGCCTGA ACGACGTGGT CGGCGGGAAC
AACATCTTCG GGATCGACTG CGGCGGCGAC TACCAGTGCA ACGCGGTCGC CGGCTACGAC
GGCCCGACGG GCTGGGGGTC GCCGAACGGC TTGTCCGCAT TCTGA
 
Protein sequence
MNVKNSTARG PVGRRGLRLA TVVAAASTLV AALTVTAASA SAGVASADHG KGPKAGTVKN 
ACATAKPGTA RCFAEIRTDV AGGTGVRGPA AAKLGAAAKT TALPAGFGPA DLHSAYNLPT
TGGANQTVAI VDAGDDPTAE ADLAVYRSTY GLPACTTANG CFTKVNQRGA ASPLPPDQGW
GVEIALDVDM VSAACPQCKI LLVEGDSASF DDLGNSVNEA VALGATEVSN SYGGSEGNGI
DAYAADYSHP GVAIVASSGD GGYDIPNVPA EYTSVVAVGG TSLTKAANTR GWTETAWQGA
SSGCSAWVDK PAWQTDANCP GRMVADVSAD ADPNTGPAIY VTDTPDLEGL PSGWGIVGGT
SASSPFVAGV IALAGNPQKF PNASAFYSNH SSLNDVVGGN NIFGIDCGGD YQCNAVAGYD
GPTGWGSPNG LSAF