Gene Caci_0016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0016 
Symbol 
ID8331340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp17749 
End bp18954 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID644953182 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003110812 
Protein GI256389248 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAT TCGAGCACGA CCCGTACCAG CCGGAGCAGC CCGCCACCGG CGACCCGCAG 
CCGTGGGGCC ACCCGTCTTC CGGGCCGGTG CTCGGTCCGG CCCACGCCTC CGCGTCGGCT
TACCCGCCGG CCTACCCGTC GGCCTCTTCG CCTTCGCCGG CCGCCTCCGA GCCGATGCCC
CCGTACACGC CGCCGATCAC CTCGATAACG CCGGGCTATG GCAACCCGGG CGAGGCCGGC
GGTCTCGGCG GGCCTGGCAG TTTCGGCGGG CCTGGCGGTT TCGGCGGGCC TGGCGGTCCG
GGCGGTCCCG GATACACCAC GCATCCGGCG TTCTCCCCCG AGCCGCCGCG ACGTCCGCGG
CGCAAGCGCC GGATGGGCAT GGCCCTGATC ATCGCCGGCA CCATCGCGGC CTCGGCCGCC
GCCGGAGGCA TCGCGGGGAC CATAGCCAGC CACAACAACT CCTCCAACAG CGCCTCCTCG
AGCGTGCCGC TGAACAACAC GAGCGTGAAC ACCCCGGTCA GCAACCAGAC CGGTACACCG
ACCACCACGG TCGGCCAGGT CGCCAAGGCC GCACTGCCCA CGGTGGTCCA GGTCTCGGTG
GAGTCCTATC AGGGCAAGTC GGTCGGCTCC GGCGTCATCC TGACCGCCGA CGGCCTGATC
CTCACGAACA ACCATGTGAT CACCGACGCG GCCAACGGCA ACGGCCAGAT CACCATCACC
TTCAACAACG GCAAGACCGC CCAGGCGAGC ATCGTCGGCT ACGACAGCGG CAGCGACCTG
GCGGTGATCA AGGCGCAGAG CGTCAGCGGC CTGCCCACCG CCAGCCTCGG CGACAGCAGC
AAGATCCAGA TCGGCGACAC GGTGGTCGCC ATCGGCTCCC CCGACGGCCT GCAGAGCACG
GTGACCAGCG GCATCGTCAG CGCCCTGAAC CGCCAGGTGA CGGTCAGCAG CGAGTCCTCG
AGCCGGTTCT CCAGCGGCAG CCAGGTCACC TACAGCGCGA TCCAGACCGA CGCCAGCCTC
AACCCCGGCA ACAGCGGCGG CCCGCTGCTG AACGCCCAGG GCCAGGTCAT AGGCATCAAC
TCGGCCATCT ACTCGCCGAC CAGCTCCGCC AACGCCCAGG GCGGCAGCGT CGGACTCGGC
TTCTCGATCC CGATCGACCA GGTCAAGACC ATGCTCGCCA AGCTCGAAGG CGGTCAGATG
AGCTAG
 
Protein sequence
MTGFEHDPYQ PEQPATGDPQ PWGHPSSGPV LGPAHASASA YPPAYPSASS PSPAASEPMP 
PYTPPITSIT PGYGNPGEAG GLGGPGSFGG PGGFGGPGGP GGPGYTTHPA FSPEPPRRPR
RKRRMGMALI IAGTIAASAA AGGIAGTIAS HNNSSNSASS SVPLNNTSVN TPVSNQTGTP
TTTVGQVAKA ALPTVVQVSV ESYQGKSVGS GVILTADGLI LTNNHVITDA ANGNGQITIT
FNNGKTAQAS IVGYDSGSDL AVIKAQSVSG LPTASLGDSS KIQIGDTVVA IGSPDGLQST
VTSGIVSALN RQVTVSSESS SRFSSGSQVT YSAIQTDASL NPGNSGGPLL NAQGQVIGIN
SAIYSPTSSA NAQGGSVGLG FSIPIDQVKT MLAKLEGGQM S