Gene Caci_5805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5805 
Symbol 
ID8337166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6707573 
End bp6710569 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content74% 
IMG OID644958909 
ProductPeptidase S53 propeptide 
Protein accessionYP_003116504 
Protein GI256394940 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAACG CCGCTCAGCG GCGGCGTGCG CAGGCCGTGA TGGCGGGGGT CTTCCCCGTC 
GCCATCGCGC TTGTCGCCGC GTCCGCGTCC GCCTCGCCCG CGCAGGCCGA CACGCAGCAC
CACAGCACCA CGAACAAGAT CCTGACCGAC GCGCACCCGG CATGGGCGAC GGCCGACAAG
GACCGGGGGG TCCTGCCGGC CGCGCAGCAG ATCAGCACCC GGGTGTACCT GACCGGTCAG
GACCAGGCGG GGCTGGCCGC GCTGGCGCGC GCCGCGTCCG ACCCGAGCAG CCCGGACTAC
CAGCACTACC TGACCCCGGC CCAGGTCCAG GCGCGCTTCG GTGCGACGCC GGCCCAGCTC
GCCGCGGTGC AGAAGTGGCT GACCGGCGCC GGGCTGAAGG TCTCCGCGGT GCAGAGCGAC
TGGATCGACG CGGTCGGTGA CTCCGCCGCC GTGCAGCGCG CCTTCGGCAC GCAGATCAAG
GACTACCAGG GCACCGACGG CTCGGTGAAG TACGCCGCGT CCTCCTCCGC GGTCATCCCG
GCCGCCGTCG CCGACTACGT CGCCGGCGTC TCGGGCCTGT CGCAGGCCGC CGTGCGCGTG
CACGCCGATT CGGCGAAGGT CAACGCGGCC AACGCCGCGA ACCAGAGCTG CTCGCCGTCC
TGGGGCGCGA ACACCTCCAC CGCGTGGCCG GCCGGCGTCA ACCCGGGCCC GACGCCCCTG
CTGCCGTGCT CCTACACCCC GAAGCAGCTG CGCGACGCCT ACGGCGTGAC CAAGTCCGGC
ATGACCGGCA AGGGCGCGAC CATCGCCGTC GTGGACTGGT TCGCCTCGGC CACCATGGAG
GGCGACGCCA ACCAGTTCGC GGTCGCCCAC GGGGACAAGC CGTTCGCGCC AGGTCAGTAC
TCTGAGATCA AGGACGCGTC TCAGTGGACC AACATCGACG CCTGCGGCGG CCAGGACAAC
GTGGCCGGCG AGGAGTCGCT GGACGTGGAG ATGGCGCACG GCCTCGCGCC GGACGCCAAC
GTCCTGTATG TCGGCGCGAA CTCCTGCACT GACGCGGACC TGATGTCCGC CGAGGAGAAC
ATCGTCGATC ATCACCTCGC CGACGTGGTC TCCAACTCCT GGGGCGAGAT CATGCACACC
ACCGACGGCG AGGACCTGGA CCCGTCGGAG ATAGCCGCCT ACGACCGCAT CTTCCAGAAG
GGCGCGGCTG AGGGCATCGG CTTCGACTTC TCCTCCGGCG ACTGCGGCGA CGACGACCCG
GCGAACTACG CCGGCGGCGG CGCCAACTGC GCCGGCGACT CGGCGCGCAA GCAGACCGAG
TGGCCGACCA GCGACGCCTG GGTCACCTCG GTCGGCGGCA CCACGATGGC CACCAACGCG
CAGGGCGGCT ACGCCTGGGA AGCCGCGATG GGCGACCATG TCGGCGTCGC CGCGCAGGGC
GCCCCCAACT GGCAGCCGCC GGCCGGACGG GCGACCGTCC CGTTCTCGTT CTACTTCGGC
GGCGGCGGCG GTACCTCCGA GGACATCGCG CAGCCCTTCT ACCAGGCCGG TATCGTGCCG
AGCGCGCTGG CCAACGGCGG GCACGACAGC ACCCGCGCGA TGCGCACCGT CCCGGACGTG
GCGATGAACG GCGCGCTGGC GACCTCGGTG CTGGTCGGCA TGACCAGCGG CGCCACCTAC
AGTGAGGGCG GCTACGGCGG CACCTCGGTG GCCGCGCCGG AGTTCTCCGC GCTGCAGGCC
GACGCCAAGC AGGCCGCGGG CCACGCGCTG GGCTTCGCGA ACCCGTCGCT GTACGCGCTG
AACGGCGGCT CGGCGTTCCA CGACGTCACC GCGCACCCGG CGGGCCAGCC GCAGGTGATC
GAGGGTATCC ACGTCTCGAC CGCCGACCCG ACCCGCGGCA CGATGTACCA CGCCGGCCAG
GACACCTCGC TGGTCGCCGC CGCGGGTTAC GACGACGCCA CCGGCCTGGG CTCCCCGGCG
GACGACTACC TGGCGAAGGT CGCCACGGTC ACGCCGCTGC AGCCGCCGGC GCCGCCGACC
AACCCCGGCT CGCCGAACGC GCCGGTCGTC AAGCGGATCG CCGGCGGCGA CCGCTACGGC
ACCGCGATCA GCGTCTCGCA GTCCTCCTTC CCGAAGGCGG GCTCGGCCTC GGCCGTGGTC
CTGGCCACCG GCGAGACGTT CCCCGACGCG CTGTCCGGCG CGCCGCTGGC CACCAAGCTC
GGCGGCCCGC TGCTGCTGAC CCCGTCGAAG ACCGTCGACC CGGCCGTGGT CGCCGAGATC
CACCGGGTCC TGGCGCCCGG CGGCAAGGTC TACGTCCTGG GCGGCGTGAA CGCGGTCTCC
GACAAGGTGG TCGCCGGTCT CGGCCTGCCC GGCGCGCAGG TCTCCCGGGT CTCCGGCTCG
GACCGCTTCG CCACCTCGCT GGCGATCGCC GAGCAGCTGG GCAACCCGAC CGGCAACGTG
ATCCTGGCGA CCGGCGACGA CTTCGCCGAC GCCCTGACCG CCGCGCCGTT CTCGGCGGTC
TACGGCGGTC CCACCGGCGG CCCGGCGGCG ATCCTGCTGA CCGACAACCG CAAGCTGCCG
CCGGCGGTGG CGAGCTACGT GGCCGGCGCG CACGCGGTGG CGGCGGTCGG CGTCCAGGCC
ACCGTGGCCG ACGCCGGTCT GAAGAACCGC GACGCCAGCG CCCAGTTCGG CGGCACGGAC
CGCTTCGCGA CCGGTGCGAT GGTGGCCGGG CGGTTCAGCG CGCCGAAGAC CGTCGGCGTG
GCCACCGGCA CCCAGTTCGC CGACGCCCTG ACCGGCGCGG CGATGCTGGC CGCGGCGCAC
AGCCCGCTGC TGCTCACCCA GCCGACGGCG CTGCCGGCCA GCACCGCGGC GGTGCTGCAC
GGCTTCAGCC AGGCGCTGGC GGGCGGCTCG ATCGAGCTGT TCGGCGGCCG GGTGGCGGTC
TCCGACGGCG TCGAGCAGCA GGTCGCCAAG GCGGTCGGCG GTCGCGTCGA GTCGTAA
 
Protein sequence
MPNAAQRRRA QAVMAGVFPV AIALVAASAS ASPAQADTQH HSTTNKILTD AHPAWATADK 
DRGVLPAAQQ ISTRVYLTGQ DQAGLAALAR AASDPSSPDY QHYLTPAQVQ ARFGATPAQL
AAVQKWLTGA GLKVSAVQSD WIDAVGDSAA VQRAFGTQIK DYQGTDGSVK YAASSSAVIP
AAVADYVAGV SGLSQAAVRV HADSAKVNAA NAANQSCSPS WGANTSTAWP AGVNPGPTPL
LPCSYTPKQL RDAYGVTKSG MTGKGATIAV VDWFASATME GDANQFAVAH GDKPFAPGQY
SEIKDASQWT NIDACGGQDN VAGEESLDVE MAHGLAPDAN VLYVGANSCT DADLMSAEEN
IVDHHLADVV SNSWGEIMHT TDGEDLDPSE IAAYDRIFQK GAAEGIGFDF SSGDCGDDDP
ANYAGGGANC AGDSARKQTE WPTSDAWVTS VGGTTMATNA QGGYAWEAAM GDHVGVAAQG
APNWQPPAGR ATVPFSFYFG GGGGTSEDIA QPFYQAGIVP SALANGGHDS TRAMRTVPDV
AMNGALATSV LVGMTSGATY SEGGYGGTSV AAPEFSALQA DAKQAAGHAL GFANPSLYAL
NGGSAFHDVT AHPAGQPQVI EGIHVSTADP TRGTMYHAGQ DTSLVAAAGY DDATGLGSPA
DDYLAKVATV TPLQPPAPPT NPGSPNAPVV KRIAGGDRYG TAISVSQSSF PKAGSASAVV
LATGETFPDA LSGAPLATKL GGPLLLTPSK TVDPAVVAEI HRVLAPGGKV YVLGGVNAVS
DKVVAGLGLP GAQVSRVSGS DRFATSLAIA EQLGNPTGNV ILATGDDFAD ALTAAPFSAV
YGGPTGGPAA ILLTDNRKLP PAVASYVAGA HAVAAVGVQA TVADAGLKNR DASAQFGGTD
RFATGAMVAG RFSAPKTVGV ATGTQFADAL TGAAMLAAAH SPLLLTQPTA LPASTAAVLH
GFSQALAGGS IELFGGRVAV SDGVEQQVAK AVGGRVES