Gene Cagg_0721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0721 
Symbol 
ID7266973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp894981 
End bp900476 
Gene Length5496 bp 
Protein Length1831 aa 
Translation table11 
GC content68% 
IMG OID643565572 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_002462081 
Protein GI219847648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCCG CCACTTCCCA CCGCCACGCC CTGATTGTCG GCGTTGACCA CACCGCTGAT 
CCGCACCTTG CCTCCTTGAC CAGCGCCGAA CGCGACGCGC GCGCATTGGC AGCGGCGCTA
GAAGCGCCGG CCTGCGGCTT CACGGTGACG TTGCTGCTCG GCGCTGAAGC GACCGCTCAC
CGTATCCGGC GCGAGATTAC TGCGCTACGC CAGCAAGCGA AACAACACCC TCTTGACATG
ATTGTCGCCT TCTGCGGCCA TGGCGTACCG GTAGCGCTCG ATGGCGGAGG GCATGAGACG
TTTCTCGCCA CCCACGATTT CATCACCGAT GACGCAGCGT TAGATGCCAC CGCCTTCCTC
TCGCTGCGCT GGCTTTACCA GCAGGTCTAT AAAGCCACTG AACCACGCAG CGCGGTACTC
ATCCTCGATC ACTGCTATGC CGGCAACATT CGAGACGCTG GCAACAATCG GTTGATAGTA
GACCTGCGCG CAGCAATTGA ACAGTATCGC GCCGAACAAC GACCTACCCC CGTGCCTCAC
GAATGGCTAC GCGCGATCTT TCCCGCCACC CGTCCCGGCG AGCTGGCCGG CGAAACCGCC
GCCGGCGGGT TGTTGACACA GGCCATCCTC ACCGTATTGC GTGGCGAAAC CCGGCTCGGT
GATGGGCATA TCACCATCGG CCAGCTTGAC CACTATCTGA AACAGCAGTT TGCGGGGCAG
AGCCAGCAAC CATACACCCT GATCGAAGGC AACTATCCCT TGATACTAGC CGATTACCGC
GAGCGCATTG CTGCGGAGCG GGCTGCCGCC GAACAACAAC ACCGCCGCGC TGAAGCGCAG
GAAATGCTCA AACGCTGGCG CTCGCCTGAT AGCCACGCCC GCGCTGCCGA ACTACAGGCA
AGCTTTGTCG GGCGGGAAAA GGAACTGGAG GAGATCCAGC GGCATATCGA GCAGCTCCGC
CCTACCGGCG GCTACCTGCT CGTCACCGGG GTGGCCGGCC AAGGCAAGAG CAGCATCCTG
GCGCGCTTGA CCAGCACAAC CGATCCCCCC CTCCCAGCCT ACTTTATTCG CTTCATGCCC
GGCCCCGACG AGCAAGCCGC CTTGCTCGGC CACCTGATTG CCGAACTGCT GACCCAGCAC
GGCCTGAGCG ACGAGGCGTT GACCTATCTG GCCGATAATG CCAGCCCGAT CACGCTGCGC
AACAGCTTGA TAGCGCTGCT CGACCGGGTG GCCCACAAGC ACCCGCTGAC CCTGATGATC
GATGGCCTTG ACCAAATCCC GGTTGATCGA AACCTCGGCC AGCGCGATCT CAGCTTTTTG
CCTGAACAGT TGCCGAAAGG GGTGGTGTTC GTAATCGGCA CCCGTCCGGA TGATACGCTG
GTGCCGCTCA AACTGCTCAC CTCGTGCCGC GAATACCCCC TGCCGCCGCT CAGCCTCGCC
GACTTCACCA CCTTGCTTGA CCGGCGTGGG GTCGCGTTGA GCGAGGCCGA CCGCGCGGAG
CTGTATACGG CGCTGCACGG CAATGCCTTC GACCTCGCCT TCTTGGCACA GGAATTGCAG
CGCCAACCCA ACATTGCCAA TGCGCAGGCG CTGATCCGGC AGGTGATCGC CAACCCACGC
AATATTTTCG TTCCGACCAT TGAGCGATTG AAGCGCGAAC TGCTCTGGGA TAGGGCGATT
AAACCGATCC TCGGTGTACT AACGGCGGCG CAGGAACCGC TCAGCTCGCC GGCCCTCACC
GGCATTTTAG AACTGAATCA CGACCTGGTG GAAGACGCGA TAACGCTACT GCGCGGCCTA
CTCGGCGAAC GTGACCGGGC GGGGCAAGCG CGCTATTTCT TGCTTCACCT GAAACTGCTC
GACTATTTGC GCAGCAAATT GTTCCCCGAT AACGAGCTAG CCACCTTCCA CAACCGCTTA
GCGCATTGGT GTGAACGCGA TCTCGACCGT CTCTGGCAAT CGGCCAACGA CCAACCCGAA
GCAGAGCGGC GCGCCTACGC CCAAACCCAC CTCGTCGACC ACTTCGTCCA CGCCAAAGCC
TACGACCGGC TGTGGCAGTT GCTCGATGCC GATGAGTATG GCGCGGCCAA GCGGCAGGCC
GACCCCAGCC TGCGCCGCTA TGCCCTTGAC CTTGACCTCG CCCGGCGCGC GGTGGTCGCG
GCTGCCGGCG ACGATATTCC TGCGCTGGCG CGCAGTTTGC CGCGCCTCTG GCAATACAGC
CTGCTCCGCT GCTCGCTCAC CAGCCAAATC GACCGATGGC CGGTCGCACT GTTTACCGCG
CTGGTGGCGC TCGGCCGCAG CGCCGAAGCC CGCGACCGGG CCGAACTGCT GAGCAATCCG
AAACACCGCG CCAGCGTGTT GCTGGCGATT GGCCACGCGC TGCTCAATCG TGGCGAGCCG
GAAGAGGCGT TGGCGGTGTG GCGAGGGGTG CGCAAGGCGG TTGATGCGAT ATCTGATGCT
GAAGAGCGGT TCACACAGTT GCATGAGTTG GCCGTGGCCT TCACCCAGGC GGGATGTGCA
GAAGAAGCGG ATGCGGTCTT TGCACAAGCC CGCCACGCCG CCGACGCGAT TGCAGACAAC
GATGACCGCA CAAGGGCGCT GACCGCACTG GCCACCGCCC TCGCCCGCGC CGGGCAGTTC
ACCGCCGCCC GCCACGCCGC CGACGCGATT GCTGACAACG ATGACCGCGC CAAGGCGCTG
ACCGAACTGG CCACCGCCCT CGCCCGCGCC GGGCAGTTCA CCGCCGCCCG CCACGCCGCC
GACGCGATTG ATAGCGAATA TCTCCGCGCA AGGGCGCTGA CCGAACTTGC CACCGCCCTC
GCCCACGCCG GGCAGTTCAC CGCCGCCCGC CACGCCGCCG ACGCGATTGA TAGCGAATAT
CGCCGCGCCG AGGCGCTGAC CGAACTGGCC ACCGCCCTCG CCCGCGCCGG GCAGTTCACC
GCCGCCCGCC ACGCCGCCGA CGCGATTGAT AGCGAATATC TCCGCGCATG GGCGCTGACC
GAACTGGCCA CCGCCCTCGC CCGCGCCGGG CACGCACAGG CAGCCGAAGA TACCTTTACC
GCCGCCCGCC ACGCCGCCGA CGCGATTGCT GACAACGCTG CCCGCGCCGA GGCGCTGACC
GCACTGGCCA CCGCCCTCGC CCACGCCGGG CACGCACAGG CAGCCGAAGA TACCTTTACC
GCCGCCCGCC ACGCCGCCGA CGCGATTGAT AGCGAATATC GCCGCGCCGA GGCGCTGACC
GAACTGGCCA CCGCCCTCGC CCACGCCGGG CACGCACAGG CAGCCGAAGA TACCTTTACC
GCCGCCCGCC ACGCCGCCGA CGCGATTGCT GACAACCGTA CCCGCGCATG GGCGCTGACC
AATCTTGCCA CCGCCCTCGC CCGCGCCGGG CAGTTCACCG CCGCCCGCCA CGCCGCCGAC
GCGATTGCTG ACAACGCTGA CCGCGCAAGG GCGCTGACCA AACTTGCCAC CGCCCTCGCC
CACGCCGGGC ATGTCACCGC CGCCCGCCAC GCCGCCGACG CGATTGCTGA CAACGATGAC
CGCGCCGAGG CGCTGACCAA ACTTGCCACC GCCCTCGCCC ACGCCGGGCA TGTCACCGCC
GCCCGCCACG CCGCCGACGC GATTGATAGC GAATATCACC GCGCCAGGGC GCTGACCGAA
CTGGCCACCG CCCTCGCCCG CGCCGGGCAT GTCACCGCCG CCCGCCACGC CGCCGACGCG
ATTGCTGACA ACGATGACCG CGCCAAGGCG CTGACCGCCC TTGCCACCGC CCTCGCCCGC
GCCGGGCAGT TCACCGCCGC CCGCCACGCC GCCGACGCGA TTGCTGACAA CGATGACCGC
GCCGAGGCGC TGACCAAACT TGCCACCGCC CTCGCCCACG CCGGGCATGT CACCGCCGCC
CGCCACGCCG CCGACGCGAT TGCTGACAAC GATGACCGCG CCGAGGCGCT GACCAAACTT
GCCACCGCCC TCGCCCACGC CGGGCAGTTC ACCGCCGCCC GCCACGCCGC CGACGCGATT
GATAGCGAAT ATCACCGCGC CAGGGCGCTG ACCGCCCTTG CCACCGCCCT CGCCCACGCC
GGGCAGTTCA CCGCCGCCCG CCACGCCGCC GACGCGATTG ATAGCGAATA TCACCGCGCC
AGGGCGCTGA CCGCCCTTGC CACCGCCCTC GCCCACGCCG GGCACGCACA GGCAGCCGAA
GATACCTTTA CCGCCGCCCG CCACGCCGCC GACGCGATTG CTGACAACGC TGACCGCGCA
AGGGCGCTGA CCAATCTTGC CACCGCCCTC GCCCGCGCCG GGCACGCACA GGCAGCCGAA
GATACCTTTA CCGCCGCCCG CCACGCCGCC GACGCGATTG CTGACAACGC TGACCGCGCC
GAGGCGCTGA CCGCCCTTGC CACCGCCCTC GCCCACGCCG GGCATGTCAC CGCCGCCCGC
CACGCCGCCG ACGCGATTGC TGACAACCGT ACCCGCGCAT GGGCGCTGAC CAATCTTGCC
ACCGCCCTCG CCCGCGCCGG GCAGTTCACC GCCGCCCGCC ACGCCGCCGA CGCGATTGCT
GACAACGCTG ACCGCGCAAG GGCGCTGACC AAACTTGCCA CCGCCCTCGC CCACGCCGGG
CACGCACAGG CAGCCGAAGA TACCTTTACC GCCGCCCGCC ACGCCGCCGA CGCGATTGCT
GACAACGCTG ACCGCGCCGA GGCGCTGACC GAACTGGCCA CCGCCCTCGC CCACGCCGGG
CAGTTCACCG CCGCCCGCCA CGCCGCCGAC GCGATTGCTG ACAACCGTGC CCGCGCATGG
GCGCTGACCA ATCTTGCCAC CGCCCTCGCC CGCGCCGGGC ACGCACAGGC AGCCGAAGAT
ACCTTTACCG CCGCCCGCCA CGCCGCCGAC GCGATTGCTG ACAACCGTGC CCGCGCAAGG
GCGCTGACCA ATCTTGCCAC CGCCCTCGCC CACGCCGGGC ACGCACAGGC AGCCGAAGAT
ACCTTTACCG CCGCCCGCCA CGCCGCCGAC GCGATTGCTG ACAACGCTGC CCGCGCCGAG
GCGCTGACCA ATCTTGCCAC CGCCCTCGCC CACGCCGGGC ACGCACAGGC AGCCGAAGAT
ACCTTTACCG CCGCCCGCCA CGCCGCCGAC GCGATTGCTG ACAACGCTGA CCGCGCAAGG
GCGCTGACCA ATCTTGCCAC CGCCCTCGCC CACGCCGGGA ACGCACAGGC AGCCGAAGAT
ACCTTTACCG CCGCCCGCCA CGCCGCCGAC GCGATTGCTG ACAACGATGA CCGCGCCAGG
GCGCTGACCA ATCTTGCCAC CGCCCTCGCC AACACTGACC GCATCGCCGA GGTCGTGTTA
CTCGTTGCCG AGGTATGGCA AAATGCGCAG ACCCGCGCTG AATTGCTACT CTTGTTTGCA
ATGGCGACAA TGTTGCTCCG GGCGTACCCG GAGATCGGGG CCGGTTTTGT GCAGGCGTTT
GCGTGGGTCG AAGAAGCGAT GCGGCGGGGT ATGTAG
 
Protein sequence
MSAATSHRHA LIVGVDHTAD PHLASLTSAE RDARALAAAL EAPACGFTVT LLLGAEATAH 
RIRREITALR QQAKQHPLDM IVAFCGHGVP VALDGGGHET FLATHDFITD DAALDATAFL
SLRWLYQQVY KATEPRSAVL ILDHCYAGNI RDAGNNRLIV DLRAAIEQYR AEQRPTPVPH
EWLRAIFPAT RPGELAGETA AGGLLTQAIL TVLRGETRLG DGHITIGQLD HYLKQQFAGQ
SQQPYTLIEG NYPLILADYR ERIAAERAAA EQQHRRAEAQ EMLKRWRSPD SHARAAELQA
SFVGREKELE EIQRHIEQLR PTGGYLLVTG VAGQGKSSIL ARLTSTTDPP LPAYFIRFMP
GPDEQAALLG HLIAELLTQH GLSDEALTYL ADNASPITLR NSLIALLDRV AHKHPLTLMI
DGLDQIPVDR NLGQRDLSFL PEQLPKGVVF VIGTRPDDTL VPLKLLTSCR EYPLPPLSLA
DFTTLLDRRG VALSEADRAE LYTALHGNAF DLAFLAQELQ RQPNIANAQA LIRQVIANPR
NIFVPTIERL KRELLWDRAI KPILGVLTAA QEPLSSPALT GILELNHDLV EDAITLLRGL
LGERDRAGQA RYFLLHLKLL DYLRSKLFPD NELATFHNRL AHWCERDLDR LWQSANDQPE
AERRAYAQTH LVDHFVHAKA YDRLWQLLDA DEYGAAKRQA DPSLRRYALD LDLARRAVVA
AAGDDIPALA RSLPRLWQYS LLRCSLTSQI DRWPVALFTA LVALGRSAEA RDRAELLSNP
KHRASVLLAI GHALLNRGEP EEALAVWRGV RKAVDAISDA EERFTQLHEL AVAFTQAGCA
EEADAVFAQA RHAADAIADN DDRTRALTAL ATALARAGQF TAARHAADAI ADNDDRAKAL
TELATALARA GQFTAARHAA DAIDSEYLRA RALTELATAL AHAGQFTAAR HAADAIDSEY
RRAEALTELA TALARAGQFT AARHAADAID SEYLRAWALT ELATALARAG HAQAAEDTFT
AARHAADAIA DNAARAEALT ALATALAHAG HAQAAEDTFT AARHAADAID SEYRRAEALT
ELATALAHAG HAQAAEDTFT AARHAADAIA DNRTRAWALT NLATALARAG QFTAARHAAD
AIADNADRAR ALTKLATALA HAGHVTAARH AADAIADNDD RAEALTKLAT ALAHAGHVTA
ARHAADAIDS EYHRARALTE LATALARAGH VTAARHAADA IADNDDRAKA LTALATALAR
AGQFTAARHA ADAIADNDDR AEALTKLATA LAHAGHVTAA RHAADAIADN DDRAEALTKL
ATALAHAGQF TAARHAADAI DSEYHRARAL TALATALAHA GQFTAARHAA DAIDSEYHRA
RALTALATAL AHAGHAQAAE DTFTAARHAA DAIADNADRA RALTNLATAL ARAGHAQAAE
DTFTAARHAA DAIADNADRA EALTALATAL AHAGHVTAAR HAADAIADNR TRAWALTNLA
TALARAGQFT AARHAADAIA DNADRARALT KLATALAHAG HAQAAEDTFT AARHAADAIA
DNADRAEALT ELATALAHAG QFTAARHAAD AIADNRARAW ALTNLATALA RAGHAQAAED
TFTAARHAAD AIADNRARAR ALTNLATALA HAGHAQAAED TFTAARHAAD AIADNAARAE
ALTNLATALA HAGHAQAAED TFTAARHAAD AIADNADRAR ALTNLATALA HAGNAQAAED
TFTAARHAAD AIADNDDRAR ALTNLATALA NTDRIAEVVL LVAEVWQNAQ TRAELLLLFA
MATMLLRAYP EIGAGFVQAF AWVEEAMRRG M