Gene Caci_5281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5281 
Symbol 
ID8336635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6084197 
End bp6087127 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content72% 
IMG OID644958379 
ProductPeptidase S53 propeptide 
Protein accessionYP_003115981 
Protein GI256394417 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCA CCGCAAAGAA GCGTTCGAAG ATACTGGCCG CCGGGTCGAT ACCGATGGCC 
ACGGCCGTCA TCGCCGCCGT CGGGATGTCC TCGGCCCAGG CCCAGAGCAA CGACACGAAC
AACAACGGCC ACACCGACTC CCTGGGGCGG GTCGCCGTCG CCGGGACGCA GCCGGCCTGG
GCCACCCCGG CCACCGAGAA GGCCGCCGTT CCGGCCACCC AGACCATCAG CACCCGGGTC
TACCTGGCCG GGCAGAACCA GGCGCAGATG TCGGCGCTGG CCCAGTCGGT CACCGACCCG
GCCAGCCCGA ACTACCACAA GTACCTGACC CCGGCGCAGG TCCAGTCCGA GTTCGGCTCG
ACCGCCTCGC AGGTCAAGAC GGTCACCGAC TGGGCGAAGG GCGCCGGGCT GCAGGTCAAG
TCGGTCGGCG AGAGCTGGGT GGACCTGGAG GGTCCGGCCT CGGCGGTCCA GACGGCCTTC
GGCACGACCC TGGCCAACTT CACCGCGCCG GACGGCCAGT CGCACTACGC GCCGGCCACC
GCGGCCATGG TCCCGGGCAA CGTCGCCAGC GCGATCGTCG GCGTCTCCGG GCTCTACGAC
GCGCCGAAGA TCGAGACGCC CGGCGCCGTG ACCCAGACCC CGTCGACGAC CCCGGCCGCC
GGACACCCGA CCTCCACGAC GTATGACACC TCCTGGTGCT CCACCTCCTG GGGCAGCAAG
AGCTACTACA ACATCCCGGC GCCGATCTGC GGCTACGACT CCGCGCAGCT GCGCGGCGCC
TACGGGGTGG GCAGCTCCGG CCTGACCGGC AAGGGCGCGA CCGTGGCGAT CATCGACGCC
TACGACAGCC CGACGATGGC CGCCGACGCC GCGAAGTACA ACAGCCAGTT CGGCCAGCAG
CAGTTCCGCG CCGGGCAGTA CCGCGAGGTC ACGAACACCG CCGCGTACAA CAACCAGGCC
GCCTGCGACC CCGACAAGTG GGGGCTGGAG CAGAGCCTGG ACGTCGAGGC CGTGCACAAC
ATGGCGCCGG ACGCCAACGT GGTCTACGTC GGCGCCGCCT CGTGCGAGGA CAACGACCTG
ATGGGGGCCT ACGAGTACGT GGTGAACAAC CACGCCGCCG ACATCGTGTC GGTCTCGCTC
GGCGGTCTGA TGCACACCAC CGGCTGGAAC CAGGACGCGG CGACCACCGC CGCCTTCGAC
CGCATCTTCA TGAAGGGCGC CCTGGAGGGC ATCGGCTTCA ACTTCTCCAC CGGCGACTGC
GGCGACGACA ACCCGGCGAA CGCCAACACC GGCTTCAACT GCTCCCCGGA CTCCGCCGGC
AAGCAGACCG AGTACCCGGC CAGCTCGGCG TGGGTCACCG CGGTCGGCGG CACGGCGCTG
AAGGTCGGCA CGGGCAACAG CTACCAGGGC GAGGAGCCCT GGGGCGACTA CCAGACCCCG
AAGGCCAACG TGCCCGGGAA CTTCGCCAAC GGCAGCTTCA CCGGCGGTGC CGGCGGCGGG
ACGTCCACGG ACATCACGCA GCCGTTCTAC CAGCGCAGCG CGGTCCCGGC CGCGCTGTCG
CAGACCGCCC CGAACGGCGC GCACCTGAAG ACCCCGATGC GCACCGTCCC GGACGTGGCG
ATGGACGCCG ACCCGTACAC CGGCATGGCG ACCTTCCGCA CCGCCGGCGG AAAGCCGGAC
TGGAACCCGA TGGGCGGCAC CTCGCTGGCC GCCCCGCTGT TCGCCGGCGA GGCCGCGCTG
CAGATGCAGG CGCACGGCGG CGTGGCTCCG GGCTTCGAGA ACCCGACCAT CTACGCCAAC
GCGAACAAGT TCCGCGACGT GACCGCCAAC GGCGGGATGT ACACCCTGAT CCCCGAGGGC
TGGAGCGGCA GCACGCTGAC CCAGGCCCAG GAAGAGATCA TGGGCGACGA CTCCTCGCTC
AAGGCCGGTC CCGGCTATGA CGAGGCCACG GGCCTGGGCT CGCCGACCCT GGGCTACCTG
CAGGCGCCGT ACGACGCCAA CCGCGTGGGC CGCCTCGCCG GCGGCGACCG CTACCAGACC
GGCATCCAGA TCTCCCAGCA GCAGTTCCCG GCGGCCGGCA CGGCCAACAA CGTGGTGCTG
GCGATCGGCA CCAACTTCCC GGACGCGCTG GCCGGCGCCC CGCTGGCCAA GAAGGTCGGC
GGCCCGCTGC TGCTGACCCC GGGCAACACG GTCGACGCCC AGGTGGTCAA CGAGATCCAC
CGGGTCCTCA AGCCCGGCGG CAAGGTCTAC GTCCTGGGCG GCACCGCGGC GATCACCCCG
GCGGTGGTCA ACGGCCTGAA GCTGCCGGCG GCGCAGATCA CCCGCATCGG CGGCGTGGAC
CGCTACGACA CCGCGATGCA GATCGCCAAG GCCATGGGCA ACCCGAGCCA CGTGGTCCTG
GCCACCGGCG CGGGCTTCGC CGACGCCCTG GCGGCCGGCC CGTACGCCTC CACGGTGTTC
GCCGACAACG GCAACCCGGC GGCGATCCTG CTCACCAACG ACAAGGTCAT GAACCCGGCC
GTGGCGGCGT ACGCGCACGG CGCCAAGGCC GTCTCGGCCG TCGGCGCCCA GGCCGTCACC
GCGGCGAAGA ACGCGCACAT CGCCAACCTG ACCAGCTTCG CCGGCTTCGA CCGCTACGAC
ACCGCCGCGC AGGTCGCCAA CACCTTCCAC GGCGAGCACA TCGCCGGCGT GGCGACCGGC
CTGAACTTCG CCGACGCCCT CACCGGCGCC GCGCAGCTCG GCGAGGCCGG CGGTCCCCTG
GTCCTGACCA ACGTCACCAA CCTCCCCGCC TTCTCCGCCA ACGCCCTCCA CGGCATCGGC
GCCTCCCTCG GCGGCGCGGG CCTGGTCGAG ATCTTCGGCG GCCCGGTGGC CATCAACCAG
GCCACCGAGT ACGCCATCGC GGCCGCGGCC AACGCGATCG CGGAGGGGTG A
 
Protein sequence
MSTTAKKRSK ILAAGSIPMA TAVIAAVGMS SAQAQSNDTN NNGHTDSLGR VAVAGTQPAW 
ATPATEKAAV PATQTISTRV YLAGQNQAQM SALAQSVTDP ASPNYHKYLT PAQVQSEFGS
TASQVKTVTD WAKGAGLQVK SVGESWVDLE GPASAVQTAF GTTLANFTAP DGQSHYAPAT
AAMVPGNVAS AIVGVSGLYD APKIETPGAV TQTPSTTPAA GHPTSTTYDT SWCSTSWGSK
SYYNIPAPIC GYDSAQLRGA YGVGSSGLTG KGATVAIIDA YDSPTMAADA AKYNSQFGQQ
QFRAGQYREV TNTAAYNNQA ACDPDKWGLE QSLDVEAVHN MAPDANVVYV GAASCEDNDL
MGAYEYVVNN HAADIVSVSL GGLMHTTGWN QDAATTAAFD RIFMKGALEG IGFNFSTGDC
GDDNPANANT GFNCSPDSAG KQTEYPASSA WVTAVGGTAL KVGTGNSYQG EEPWGDYQTP
KANVPGNFAN GSFTGGAGGG TSTDITQPFY QRSAVPAALS QTAPNGAHLK TPMRTVPDVA
MDADPYTGMA TFRTAGGKPD WNPMGGTSLA APLFAGEAAL QMQAHGGVAP GFENPTIYAN
ANKFRDVTAN GGMYTLIPEG WSGSTLTQAQ EEIMGDDSSL KAGPGYDEAT GLGSPTLGYL
QAPYDANRVG RLAGGDRYQT GIQISQQQFP AAGTANNVVL AIGTNFPDAL AGAPLAKKVG
GPLLLTPGNT VDAQVVNEIH RVLKPGGKVY VLGGTAAITP AVVNGLKLPA AQITRIGGVD
RYDTAMQIAK AMGNPSHVVL ATGAGFADAL AAGPYASTVF ADNGNPAAIL LTNDKVMNPA
VAAYAHGAKA VSAVGAQAVT AAKNAHIANL TSFAGFDRYD TAAQVANTFH GEHIAGVATG
LNFADALTGA AQLGEAGGPL VLTNVTNLPA FSANALHGIG ASLGGAGLVE IFGGPVAINQ
ATEYAIAAAA NAIAEG