Gene Caci_5298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5298 
Symbol 
ID8336652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6106407 
End bp6107591 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content73% 
IMG OID644958396 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003115998 
Protein GI256394434 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.111047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACC ACGCAGCGCG GATGGCTCGG ATCCTCGAGA CGCACCAGGA TGTCGTCCAC 
GTCGGCGGCT CCACCGAGGC GCTGATCAAG CGCGACGAGC TGCTGGTGCT CGGTGCGCAC
GCGGACGCGG TGCACGAGCA GGCGCGGCAG TGGGTGGACT CCCGGGAGGA CTTCGCCGAG
CTGGGGGTCA GCCGCCTGCG GCTGCGGGCC GGCGCCGGGG TGGACGCCGC GGATCTGACG
CACAGCCTGC GGGGCGGCGC GGGGGCGCAC CGGCGGACGA GCGTCACGCC CAACCACGTC
ATGAGCGGGG CGCCGAACTG GACCGGCGGT CCCTTCGGCG CGCCGACGCC CGCCGCAGAC
CTGCCCGCCC CGGTCGACGC CGAAGGCGGC GGCCGGCGGG CCACGATCGG GATCCTGGAC
ACCGGGATCG ACCCGCACCC GTGGTTCGCC GAGGCGGACT GGTACCAGGC CTGCACCGAG
ACAGAGCACG AAGACCTGGA CCCGGCGTCC GAGGACGACC TGGAGTCCGA CTCCGGCCAC
GGCACGTTCA TCGCCGGCGT GATCCTGCAG CACGCTCCGG GAACCTATCT GCGGGTGCAG
CGGGTCCTGG GCACCGACGG CGTCACCGAC GAGCTGGAAC TGCTGCACGG TCTGAGGCGG
CTGCACGCCC GGGCGGCCGC CGAGAGCAAC CGTCTGGACG TCCTGAACCT GTCACTGGGC
TGCTTCACCT TCGACGACCG GCCCTCCCCG GTCCTGGCCG ACGCCTTCGC GCGGGTCGCC
CGGCACTCGG TGATCGTCGC CGCCGCCGGG AACCACTCCT CGGACCGTCC CTACTGGCCC
GCCGCCCTCA AGGACGTCGT CGCCGTCGCG GCTCTGGCCC AGGCGGACAC CGACGGCCCG
GAGCGCGCGT CCTTCTCCAA CTACGGCTGG TGGGTGGACG CCTCGGCGCC GGGCGAGAAG
GTCTCCAGCA GCTTCCTGAC CCACGGCCGG GAGAACGGGG AAGACTTCCA CGGCTTCGCG
ACCTGGAGCG GCACCAGCTT CGCCGCCCCG TACGTGGCCG GTAAGATCGC CGCTTTGATG
TCCGCCAAGG ACATGACGGC GCGCGACGCC CTCAGCGAGC TGCTCGACCC GGCCAACACC
CGCATCCCCG ACCTGGGGGT GGTGGTGGCC TCGGACGGCC GCTGA
 
Protein sequence
MTDHAARMAR ILETHQDVVH VGGSTEALIK RDELLVLGAH ADAVHEQARQ WVDSREDFAE 
LGVSRLRLRA GAGVDAADLT HSLRGGAGAH RRTSVTPNHV MSGAPNWTGG PFGAPTPAAD
LPAPVDAEGG GRRATIGILD TGIDPHPWFA EADWYQACTE TEHEDLDPAS EDDLESDSGH
GTFIAGVILQ HAPGTYLRVQ RVLGTDGVTD ELELLHGLRR LHARAAAESN RLDVLNLSLG
CFTFDDRPSP VLADAFARVA RHSVIVAAAG NHSSDRPYWP AALKDVVAVA ALAQADTDGP
ERASFSNYGW WVDASAPGEK VSSSFLTHGR ENGEDFHGFA TWSGTSFAAP YVAGKIAALM
SAKDMTARDA LSELLDPANT RIPDLGVVVA SDGR