Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0721 |
Symbol | |
ID | 7266973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 894981 |
End bp | 900476 |
Gene Length | 5496 bp |
Protein Length | 1831 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643565572 |
Product | peptidase C14 caspase catalytic subunit p20 |
Protein accession | YP_002462081 |
Protein GI | 219847648 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.137952 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCCG CCACTTCCCA CCGCCACGCC CTGATTGTCG GCGTTGACCA CACCGCTGAT CCGCACCTTG CCTCCTTGAC CAGCGCCGAA CGCGACGCGC GCGCATTGGC AGCGGCGCTA GAAGCGCCGG CCTGCGGCTT CACGGTGACG TTGCTGCTCG GCGCTGAAGC GACCGCTCAC CGTATCCGGC GCGAGATTAC TGCGCTACGC CAGCAAGCGA AACAACACCC TCTTGACATG ATTGTCGCCT TCTGCGGCCA TGGCGTACCG GTAGCGCTCG ATGGCGGAGG GCATGAGACG TTTCTCGCCA CCCACGATTT CATCACCGAT GACGCAGCGT TAGATGCCAC CGCCTTCCTC TCGCTGCGCT GGCTTTACCA GCAGGTCTAT AAAGCCACTG AACCACGCAG CGCGGTACTC ATCCTCGATC ACTGCTATGC CGGCAACATT CGAGACGCTG GCAACAATCG GTTGATAGTA GACCTGCGCG CAGCAATTGA ACAGTATCGC GCCGAACAAC GACCTACCCC CGTGCCTCAC GAATGGCTAC GCGCGATCTT TCCCGCCACC CGTCCCGGCG AGCTGGCCGG CGAAACCGCC GCCGGCGGGT TGTTGACACA GGCCATCCTC ACCGTATTGC GTGGCGAAAC CCGGCTCGGT GATGGGCATA TCACCATCGG CCAGCTTGAC CACTATCTGA AACAGCAGTT TGCGGGGCAG AGCCAGCAAC CATACACCCT GATCGAAGGC AACTATCCCT TGATACTAGC CGATTACCGC GAGCGCATTG CTGCGGAGCG GGCTGCCGCC GAACAACAAC ACCGCCGCGC TGAAGCGCAG GAAATGCTCA AACGCTGGCG CTCGCCTGAT AGCCACGCCC GCGCTGCCGA ACTACAGGCA AGCTTTGTCG GGCGGGAAAA GGAACTGGAG GAGATCCAGC GGCATATCGA GCAGCTCCGC CCTACCGGCG GCTACCTGCT CGTCACCGGG GTGGCCGGCC AAGGCAAGAG CAGCATCCTG GCGCGCTTGA CCAGCACAAC CGATCCCCCC CTCCCAGCCT ACTTTATTCG CTTCATGCCC GGCCCCGACG AGCAAGCCGC CTTGCTCGGC CACCTGATTG CCGAACTGCT GACCCAGCAC GGCCTGAGCG ACGAGGCGTT GACCTATCTG GCCGATAATG CCAGCCCGAT CACGCTGCGC AACAGCTTGA TAGCGCTGCT CGACCGGGTG GCCCACAAGC ACCCGCTGAC CCTGATGATC GATGGCCTTG ACCAAATCCC GGTTGATCGA AACCTCGGCC AGCGCGATCT CAGCTTTTTG CCTGAACAGT TGCCGAAAGG GGTGGTGTTC GTAATCGGCA CCCGTCCGGA TGATACGCTG GTGCCGCTCA AACTGCTCAC CTCGTGCCGC GAATACCCCC TGCCGCCGCT CAGCCTCGCC GACTTCACCA CCTTGCTTGA CCGGCGTGGG GTCGCGTTGA GCGAGGCCGA CCGCGCGGAG CTGTATACGG CGCTGCACGG CAATGCCTTC GACCTCGCCT TCTTGGCACA GGAATTGCAG CGCCAACCCA ACATTGCCAA TGCGCAGGCG CTGATCCGGC AGGTGATCGC CAACCCACGC AATATTTTCG TTCCGACCAT TGAGCGATTG AAGCGCGAAC TGCTCTGGGA TAGGGCGATT AAACCGATCC TCGGTGTACT AACGGCGGCG CAGGAACCGC TCAGCTCGCC GGCCCTCACC GGCATTTTAG AACTGAATCA CGACCTGGTG GAAGACGCGA TAACGCTACT GCGCGGCCTA CTCGGCGAAC GTGACCGGGC GGGGCAAGCG CGCTATTTCT TGCTTCACCT GAAACTGCTC GACTATTTGC GCAGCAAATT GTTCCCCGAT AACGAGCTAG CCACCTTCCA CAACCGCTTA GCGCATTGGT GTGAACGCGA TCTCGACCGT CTCTGGCAAT CGGCCAACGA CCAACCCGAA GCAGAGCGGC GCGCCTACGC CCAAACCCAC CTCGTCGACC ACTTCGTCCA CGCCAAAGCC TACGACCGGC TGTGGCAGTT GCTCGATGCC GATGAGTATG GCGCGGCCAA GCGGCAGGCC GACCCCAGCC TGCGCCGCTA TGCCCTTGAC CTTGACCTCG CCCGGCGCGC GGTGGTCGCG GCTGCCGGCG ACGATATTCC TGCGCTGGCG CGCAGTTTGC CGCGCCTCTG GCAATACAGC CTGCTCCGCT GCTCGCTCAC CAGCCAAATC GACCGATGGC CGGTCGCACT GTTTACCGCG CTGGTGGCGC TCGGCCGCAG CGCCGAAGCC CGCGACCGGG CCGAACTGCT GAGCAATCCG AAACACCGCG CCAGCGTGTT GCTGGCGATT GGCCACGCGC TGCTCAATCG TGGCGAGCCG GAAGAGGCGT TGGCGGTGTG GCGAGGGGTG CGCAAGGCGG TTGATGCGAT ATCTGATGCT GAAGAGCGGT TCACACAGTT GCATGAGTTG GCCGTGGCCT TCACCCAGGC GGGATGTGCA GAAGAAGCGG ATGCGGTCTT TGCACAAGCC CGCCACGCCG CCGACGCGAT TGCAGACAAC GATGACCGCA CAAGGGCGCT GACCGCACTG GCCACCGCCC TCGCCCGCGC CGGGCAGTTC ACCGCCGCCC GCCACGCCGC CGACGCGATT GCTGACAACG ATGACCGCGC CAAGGCGCTG ACCGAACTGG CCACCGCCCT CGCCCGCGCC GGGCAGTTCA CCGCCGCCCG CCACGCCGCC GACGCGATTG ATAGCGAATA TCTCCGCGCA AGGGCGCTGA CCGAACTTGC CACCGCCCTC GCCCACGCCG GGCAGTTCAC CGCCGCCCGC CACGCCGCCG ACGCGATTGA TAGCGAATAT CGCCGCGCCG AGGCGCTGAC CGAACTGGCC ACCGCCCTCG CCCGCGCCGG GCAGTTCACC GCCGCCCGCC ACGCCGCCGA CGCGATTGAT AGCGAATATC TCCGCGCATG GGCGCTGACC GAACTGGCCA CCGCCCTCGC CCGCGCCGGG CACGCACAGG CAGCCGAAGA TACCTTTACC GCCGCCCGCC ACGCCGCCGA CGCGATTGCT GACAACGCTG CCCGCGCCGA GGCGCTGACC GCACTGGCCA CCGCCCTCGC CCACGCCGGG CACGCACAGG CAGCCGAAGA TACCTTTACC GCCGCCCGCC ACGCCGCCGA CGCGATTGAT AGCGAATATC GCCGCGCCGA GGCGCTGACC GAACTGGCCA CCGCCCTCGC CCACGCCGGG CACGCACAGG CAGCCGAAGA TACCTTTACC GCCGCCCGCC ACGCCGCCGA CGCGATTGCT GACAACCGTA CCCGCGCATG GGCGCTGACC AATCTTGCCA CCGCCCTCGC CCGCGCCGGG CAGTTCACCG CCGCCCGCCA CGCCGCCGAC GCGATTGCTG ACAACGCTGA CCGCGCAAGG GCGCTGACCA AACTTGCCAC CGCCCTCGCC CACGCCGGGC ATGTCACCGC CGCCCGCCAC GCCGCCGACG CGATTGCTGA CAACGATGAC CGCGCCGAGG CGCTGACCAA ACTTGCCACC GCCCTCGCCC ACGCCGGGCA TGTCACCGCC GCCCGCCACG CCGCCGACGC GATTGATAGC GAATATCACC GCGCCAGGGC GCTGACCGAA CTGGCCACCG CCCTCGCCCG CGCCGGGCAT GTCACCGCCG CCCGCCACGC CGCCGACGCG ATTGCTGACA ACGATGACCG CGCCAAGGCG CTGACCGCCC TTGCCACCGC CCTCGCCCGC GCCGGGCAGT TCACCGCCGC CCGCCACGCC GCCGACGCGA TTGCTGACAA CGATGACCGC GCCGAGGCGC TGACCAAACT TGCCACCGCC CTCGCCCACG CCGGGCATGT CACCGCCGCC CGCCACGCCG CCGACGCGAT TGCTGACAAC GATGACCGCG CCGAGGCGCT GACCAAACTT GCCACCGCCC TCGCCCACGC CGGGCAGTTC ACCGCCGCCC GCCACGCCGC CGACGCGATT GATAGCGAAT ATCACCGCGC CAGGGCGCTG ACCGCCCTTG CCACCGCCCT CGCCCACGCC GGGCAGTTCA CCGCCGCCCG CCACGCCGCC GACGCGATTG ATAGCGAATA TCACCGCGCC AGGGCGCTGA CCGCCCTTGC CACCGCCCTC GCCCACGCCG GGCACGCACA GGCAGCCGAA GATACCTTTA CCGCCGCCCG CCACGCCGCC GACGCGATTG CTGACAACGC TGACCGCGCA AGGGCGCTGA CCAATCTTGC CACCGCCCTC GCCCGCGCCG GGCACGCACA GGCAGCCGAA GATACCTTTA CCGCCGCCCG CCACGCCGCC GACGCGATTG CTGACAACGC TGACCGCGCC GAGGCGCTGA CCGCCCTTGC CACCGCCCTC GCCCACGCCG GGCATGTCAC CGCCGCCCGC CACGCCGCCG ACGCGATTGC TGACAACCGT ACCCGCGCAT GGGCGCTGAC CAATCTTGCC ACCGCCCTCG CCCGCGCCGG GCAGTTCACC GCCGCCCGCC ACGCCGCCGA CGCGATTGCT GACAACGCTG ACCGCGCAAG GGCGCTGACC AAACTTGCCA CCGCCCTCGC CCACGCCGGG CACGCACAGG CAGCCGAAGA TACCTTTACC GCCGCCCGCC ACGCCGCCGA CGCGATTGCT GACAACGCTG ACCGCGCCGA GGCGCTGACC GAACTGGCCA CCGCCCTCGC CCACGCCGGG CAGTTCACCG CCGCCCGCCA CGCCGCCGAC GCGATTGCTG ACAACCGTGC CCGCGCATGG GCGCTGACCA ATCTTGCCAC CGCCCTCGCC CGCGCCGGGC ACGCACAGGC AGCCGAAGAT ACCTTTACCG CCGCCCGCCA CGCCGCCGAC GCGATTGCTG ACAACCGTGC CCGCGCAAGG GCGCTGACCA ATCTTGCCAC CGCCCTCGCC CACGCCGGGC ACGCACAGGC AGCCGAAGAT ACCTTTACCG CCGCCCGCCA CGCCGCCGAC GCGATTGCTG ACAACGCTGC CCGCGCCGAG GCGCTGACCA ATCTTGCCAC CGCCCTCGCC CACGCCGGGC ACGCACAGGC AGCCGAAGAT ACCTTTACCG CCGCCCGCCA CGCCGCCGAC GCGATTGCTG ACAACGCTGA CCGCGCAAGG GCGCTGACCA ATCTTGCCAC CGCCCTCGCC CACGCCGGGA ACGCACAGGC AGCCGAAGAT ACCTTTACCG CCGCCCGCCA CGCCGCCGAC GCGATTGCTG ACAACGATGA CCGCGCCAGG GCGCTGACCA ATCTTGCCAC CGCCCTCGCC AACACTGACC GCATCGCCGA GGTCGTGTTA CTCGTTGCCG AGGTATGGCA AAATGCGCAG ACCCGCGCTG AATTGCTACT CTTGTTTGCA ATGGCGACAA TGTTGCTCCG GGCGTACCCG GAGATCGGGG CCGGTTTTGT GCAGGCGTTT GCGTGGGTCG AAGAAGCGAT GCGGCGGGGT ATGTAG
|
Protein sequence | MSAATSHRHA LIVGVDHTAD PHLASLTSAE RDARALAAAL EAPACGFTVT LLLGAEATAH RIRREITALR QQAKQHPLDM IVAFCGHGVP VALDGGGHET FLATHDFITD DAALDATAFL SLRWLYQQVY KATEPRSAVL ILDHCYAGNI RDAGNNRLIV DLRAAIEQYR AEQRPTPVPH EWLRAIFPAT RPGELAGETA AGGLLTQAIL TVLRGETRLG DGHITIGQLD HYLKQQFAGQ SQQPYTLIEG NYPLILADYR ERIAAERAAA EQQHRRAEAQ EMLKRWRSPD SHARAAELQA SFVGREKELE EIQRHIEQLR PTGGYLLVTG VAGQGKSSIL ARLTSTTDPP LPAYFIRFMP GPDEQAALLG HLIAELLTQH GLSDEALTYL ADNASPITLR NSLIALLDRV AHKHPLTLMI DGLDQIPVDR NLGQRDLSFL PEQLPKGVVF VIGTRPDDTL VPLKLLTSCR EYPLPPLSLA DFTTLLDRRG VALSEADRAE LYTALHGNAF DLAFLAQELQ RQPNIANAQA LIRQVIANPR NIFVPTIERL KRELLWDRAI KPILGVLTAA QEPLSSPALT GILELNHDLV EDAITLLRGL LGERDRAGQA RYFLLHLKLL DYLRSKLFPD NELATFHNRL AHWCERDLDR LWQSANDQPE AERRAYAQTH LVDHFVHAKA YDRLWQLLDA DEYGAAKRQA DPSLRRYALD LDLARRAVVA AAGDDIPALA RSLPRLWQYS LLRCSLTSQI DRWPVALFTA LVALGRSAEA RDRAELLSNP KHRASVLLAI GHALLNRGEP EEALAVWRGV RKAVDAISDA EERFTQLHEL AVAFTQAGCA EEADAVFAQA RHAADAIADN DDRTRALTAL ATALARAGQF TAARHAADAI ADNDDRAKAL TELATALARA GQFTAARHAA DAIDSEYLRA RALTELATAL AHAGQFTAAR HAADAIDSEY RRAEALTELA TALARAGQFT AARHAADAID SEYLRAWALT ELATALARAG HAQAAEDTFT AARHAADAIA DNAARAEALT ALATALAHAG HAQAAEDTFT AARHAADAID SEYRRAEALT ELATALAHAG HAQAAEDTFT AARHAADAIA DNRTRAWALT NLATALARAG QFTAARHAAD AIADNADRAR ALTKLATALA HAGHVTAARH AADAIADNDD RAEALTKLAT ALAHAGHVTA ARHAADAIDS EYHRARALTE LATALARAGH VTAARHAADA IADNDDRAKA LTALATALAR AGQFTAARHA ADAIADNDDR AEALTKLATA LAHAGHVTAA RHAADAIADN DDRAEALTKL ATALAHAGQF TAARHAADAI DSEYHRARAL TALATALAHA GQFTAARHAA DAIDSEYHRA RALTALATAL AHAGHAQAAE DTFTAARHAA DAIADNADRA RALTNLATAL ARAGHAQAAE DTFTAARHAA DAIADNADRA EALTALATAL AHAGHVTAAR HAADAIADNR TRAWALTNLA TALARAGQFT AARHAADAIA DNADRARALT KLATALAHAG HAQAAEDTFT AARHAADAIA DNADRAEALT ELATALAHAG QFTAARHAAD AIADNRARAW ALTNLATALA RAGHAQAAED TFTAARHAAD AIADNRARAR ALTNLATALA HAGHAQAAED TFTAARHAAD AIADNAARAE ALTNLATALA HAGHAQAAED TFTAARHAAD AIADNADRAR ALTNLATALA HAGNAQAAED TFTAARHAAD AIADNDDRAR ALTNLATALA NTDRIAEVVL LVAEVWQNAQ TRAELLLLFA MATMLLRAYP EIGAGFVQAF AWVEEAMRRG M
|
| |