Gene Caci_6409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6409 
Symbol 
ID8337772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7390062 
End bp7393145 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content73% 
IMG OID644959510 
ProductPeptidase S53 propeptide 
Protein accessionYP_003117104 
Protein GI256395540 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.276013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTTC GCCGACCCCG GTGGCGTCGC GCGGCCGGGA TCGCCGCCGC GATCGCCGTG 
TCCGCGACGG CGGTCCCGGC CGCCCAGGCA GCAGCCGGAT CGGCGCCGCC GACGGCACCG
GTCAGGATCG GCGCCGCGCC CCGTCTGCCG GCCCGCACCG TCCCCGCTGC GACGCCCGCC
GACTCCCAGC CGCTGCACCT GCGGATCGGG CTGGCGCCGC GCGATCCGGC CGGACTGGCC
GCTTTCGTGG CCGCGGTGAG CGACCCGAAG TCGCGCCAGT ATAGGCACTA CCTCGCGCCC
GGCGAGTTCG GCAGCCGCTT CGGCGCGACA CCGGCCGCCC TCGCCGCGGT CGGCGCCGAG
CTGCGCTCGA TCGGGCTCTC CCCCGGCGCG ACCGACGGCA GCGGCGAATC CATCGCCGTG
GACACCACCG TCGGCGCCGC GAAATCCATC CTGCACACCG GATTCTCCGG CTACACCACC
GCGGACGGCC GCCACGCCTT CGCGAACACC GCCGCCCCGG CGCTGCCGCC GGCCCTCGCT
TCGGCGATCA CCGGCATCAC CGGCCTGGAC GATCTGACGG CGCCGCGTCC GCAGTTCGTC
AGCTCGAACC TCGGGAGCCA AGCGGCGCTC ACCCCGCAGG CGCCCTCACC GGATTTCGCG
GCACCCGGGC TGTGTCCTCA GGAAACCAGC ATCCTGGCAG GCCAGGGCAA GACCGACGGC
CAGCAGTACT ACGACCCCGC AGACCTCAAC GCCATCTACG GCAACCGCGA CCAGTACATC
GTCGGCGACT ACGGCGAGGG CGTCACCGTC GCGCTGCCGG AGTTCGAGGG CTTCACCCCG
GCGGCGATCG CGCAGTACCA GAACTGCGTC ACCCCGCAGA ACACCAACGC CGGCACGCCC
GGCGGGCTGA CGACGATCGC CGTGGACGGC GGGCCCACCA CGGCGGCGCC GACCGCGGCG
AACCAGGTCG GCATGGAGAC CGCGCTGGAC GTCGAGACGA TCAACGGCAC CGCGCCCGGC
GCGGCCGTCC GGGATTATGA GGGCCCTGAT ACGGCCGTCG GCGCCCTGGA CACCTACCAG
CGGATCGTCA CCGACGACCA GGCCCAGGTC ATCGCGATCA GCTGGAGCGT CTGCGAGCTC
GACGCCGCCC CGGCGACCCT CGCCGCCGAG AACACGATCT TCCAGCAGGC CGCCGCCCAG
GGACAGTCGG TGCTGGCCGC GACCGGGGAC AACGGCTCGA CGGCGTGCCC GGCGAGCAGT
CCGCACGCCG CCACCCCGGC GGTGTCCGAC CCCGCCTCGC AGCCCTCGGT CACCGCGGTC
GGCGGCACCA CGATGAGCGG TGGCGGCGGC GCCGGGCTGA CCACCTGGAA CACTTCCGGC
GGCGCGACGG GCGGCGGCGT GTCCACCGTG TGGAAGCAGG ATGCGATCGG CGCCGCGTAT
CAGGCCGGCC ACACCGGGCC CGGGTACGCC GACGCCTGCG GCGCGCCCGC CGGGACGACC
TGCCGCCAGG TCCCCGACGT CGCCGCGCAC GCCGGCGCCG ACGGGCTGAT CGTCGAGTAC
TACGCGACCG GCACGGCCGG GAGCTGGGCC ATCGTCGGCG GCACGAGCCT GGCGACCCAG
CTGTGGGCCG GTATCACAGC GCTCGCCGAC AGCAGCGACG ACTGCGCGGC CACCGGCCCG
ATCGGCCCGC TGAACCCCGC GCTCTACCAC GCCGCGGCCG ACGGCTCCGG ATCGTTCACC
GACGTCACCA CCGGCAGCAA CGCCAGACCC GCATCCGGCT ACACCGGCGC GCTTTACACC
GCCGGTCCCG GCTATGACCT CACCACCGGA CTCGGCGCCC CGCTGACCGA CGGCCTCGTG
CCGGACCTGT GCGGCTACCC CGCCCCGGCT GCCGGCAGCA CGTATCACCC GGTGAGCCCG
GCACGGATCC TGGACACCCG CAACGGCACC GGAGCCGGCG GCAGGATCGC GCAGGTCCCG
GCCGGCGGAA CCGTCACGCT GGCCGTCACC GGCGCGCACG GCGTGCCGGC GAGCGGGGTC
ACCGGCACGG CGCTGAACCT CACCGCGGTC AGCAGCTTGG GCGGGTACCT TTCGGTCTCC
GCGCACGGCA CCGCGCGTCC GCTGTCCTCC ACCGTCAACT ACGGGGCGAA CCGGCCGGCC
TCCAACTTCG TCACCGTCGC GGTCCCCGCC GGCGGCCAGA TCGACATCAC CAACCACGGG
ACCGCCGGCG CCGACGTCAT CGCCGACCTG TCGGGGTACT ACTCCGCGAC CCTGGCCGGC
GGTTCCACCT ACACCGCGGT CAACCCGTTC CGGATCCTCG ACACGCGCGT GCCGAACGGT
GTCCCGGCCA AGGCTCCGGT GGCGGCACAC GGGACGCTGG CGTTGCAGAT CAGCGGCACG
CACGGCATAC CGGCGACCGG CGTCACCGCG GTGGCGCTGA ACATCCAGGT CGCCGACGAC
GCCTCGGGCG GCGACCTGAT CGCCTACGCC GACGGGACGA CGCAGCCGTT GGCCTCGAAC
GACAACTGGG CCGCGGGTCA GACCGTGTCG GACTTCGCCC TTGTACCGGT CGGCGCCGAC
GGCAAGATCG ACCTCTACAA CAGCAGCCCC GGCTCGGCCA ACGTCATCGC CGACTTCGCG
GGCTACTCCA CGACCGCGGC CACCGGACTG AAGTTCCATC CCATCCGGGC GGCCCGGCTG
CTCGACACCC GTGACGGCAC CGGCGTCAAC AGCACCGCCG GGCAGCCCTA CCAGATCCCG
GCGGGCGGCA GCTTCACCGT GAACCTGGAC CCGGTCGCGC TTCTGCTGGG CGGCAATCCG
CACAACGCCG TCGCGACCGC TCCGGCCGTC GCCCTGAACC TCACCGTCGC CTCACCGAGC
AGCGGCGGCT ACATCACGGT CTATCCCGAC GGCCAGAGCC TGCCCGCCGT GGTCCCGGCC
GCCGTCGACT TCACCGCCGG CCAGACCACC GCCGACGCCG CGATACTCCC GGTCGGCGCC
GACGGCGGAC TGACTGTCAC CAACCACAGC GGCGGAACGA TCCAATTGGT CATCGATCTC
ATCGGCTACT ACGGCACGAC CTGA
 
Protein sequence
MHVRRPRWRR AAGIAAAIAV SATAVPAAQA AAGSAPPTAP VRIGAAPRLP ARTVPAATPA 
DSQPLHLRIG LAPRDPAGLA AFVAAVSDPK SRQYRHYLAP GEFGSRFGAT PAALAAVGAE
LRSIGLSPGA TDGSGESIAV DTTVGAAKSI LHTGFSGYTT ADGRHAFANT AAPALPPALA
SAITGITGLD DLTAPRPQFV SSNLGSQAAL TPQAPSPDFA APGLCPQETS ILAGQGKTDG
QQYYDPADLN AIYGNRDQYI VGDYGEGVTV ALPEFEGFTP AAIAQYQNCV TPQNTNAGTP
GGLTTIAVDG GPTTAAPTAA NQVGMETALD VETINGTAPG AAVRDYEGPD TAVGALDTYQ
RIVTDDQAQV IAISWSVCEL DAAPATLAAE NTIFQQAAAQ GQSVLAATGD NGSTACPASS
PHAATPAVSD PASQPSVTAV GGTTMSGGGG AGLTTWNTSG GATGGGVSTV WKQDAIGAAY
QAGHTGPGYA DACGAPAGTT CRQVPDVAAH AGADGLIVEY YATGTAGSWA IVGGTSLATQ
LWAGITALAD SSDDCAATGP IGPLNPALYH AAADGSGSFT DVTTGSNARP ASGYTGALYT
AGPGYDLTTG LGAPLTDGLV PDLCGYPAPA AGSTYHPVSP ARILDTRNGT GAGGRIAQVP
AGGTVTLAVT GAHGVPASGV TGTALNLTAV SSLGGYLSVS AHGTARPLSS TVNYGANRPA
SNFVTVAVPA GGQIDITNHG TAGADVIADL SGYYSATLAG GSTYTAVNPF RILDTRVPNG
VPAKAPVAAH GTLALQISGT HGIPATGVTA VALNIQVADD ASGGDLIAYA DGTTQPLASN
DNWAAGQTVS DFALVPVGAD GKIDLYNSSP GSANVIADFA GYSTTAATGL KFHPIRAARL
LDTRDGTGVN STAGQPYQIP AGGSFTVNLD PVALLLGGNP HNAVATAPAV ALNLTVASPS
SGGYITVYPD GQSLPAVVPA AVDFTAGQTT ADAAILPVGA DGGLTVTNHS GGTIQLVIDL
IGYYGTT