Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0761 |
Symbol | |
ID | 7268080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 942211 |
End bp | 945015 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643565612 |
Product | peptidase C1A papain |
Protein accession | YP_002462121 |
Protein GI | 219847688 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0657055 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.223124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCCT TGACCCCGCT TACCGCAGTG CCTGCGATCG ATGAGACGAC CCGAAAGGTG TTGGCCGACT ACTGGATTAC CAGTGTCGAA GAGTTGGTAG CGACGGCGCG TGCGAGTAAT GCCGGTCTTG GGAGTGGGCT GGCTGCACTT GCTCAGGTGT TGGGGCGAAG TGAAAACGAT GTGCGGGCAA TGGTGATGGC AGCGCAGGAA GTGGCGCCTG ATGCCAGCTC GTTTAGCGTT GATGTCGCGA TGGAGCCGGT CGGTACCGGT GCGATCTTCA CCGATCTGCC GGAGGTCGAT GCAACTTCGT TTAGTCCACC GGTCGGCTTG CCCGCCGAAG TACCACCGAT TGCGACTCTG CCTCCGCCAA TCAGTCAAGG ACCACGCAAT ACGTGTGTCG CGTTTACGGT GGCTGCCATG GTGCAGGCAC TTAGCAACGA CCCCACCGAT CTCTCCGAAC AGTTTATTTA CTGGATTAGT AAAGCGCGTG ATGGCATCCC CGGTGATGTC GGTACGAACC CGTTGGTCGC ATTACGCGCC GTTGCCGAGT TAGGGGTCTG CCGTGAGGAG ACGTGGCCCT ATCGCCCCGA ACCGGTAGAC CATACCAACC CCGGTCACGA GCGTCCAAGT GAAAGAGCAT TTCAGGAAGC TAAGCAGCGT CGGATTAGCG GGGTTGAACA GTTGCCTCCC CGTGATGTGA ACCAGATCAA GGCTGCACTG GCCGCCGGCC GACCGGTGTT GATCGGTTTG ATGATCGGTG AGCATTGGAC GAGTAGTGGG CAGGTCCGTC GAATTGGGCG GGTGCGTAAG GCGTTGCCCG GTGAGCAACG GTTGGGTGGT CATGCGATGT GTGTATTGGG CTACCGTGAT GATCCGACGG CTCCCGGCGG TGGCTATTTT ATTGTGCGGA ATTCGTGGGG AAGCGAATGG GCGAACGAAA ATCCTGATGG TCCCGGTTAT TGCTATGTCC CATATCAACT CATCTATGAA GAGGGTTTGG CAGCGTTGAT CGCTACCGGT GTGATCATTG AAGCAGCAAC TGCGTCGACA GCGACGTTGA GTGCGCCGAC TACCTCGGAG TTGGCGGCGA TTCTTGCCGA AGCTCAGGTG ATCCGTGCCC GGCTTGATAC GTTGATCAAT CGGTTGCAGG CGCTGGTGGG TGGGCAACCG CAACCGGTCA TGAGCGCCGT TGAGCCGGCC CCGGCGTTAC CACCATCACC TGAACCGGCA GCAGTTGTGG CCGGCTATAG CGGCCCATTG ATCCTTATTG CCGATGAGCA GAGCCGTGAT GAGTTGTACC CGAATGGGAT CGATGGCCGC CGAGGCGAAC CGTTGTTGCG GATCGATGCG AAAGCCGCCA GCGAATTGGC CCAACGTAGC GATGATCCGA AAGAATTGCA GACACTCCAC AAGACCCGTA ACGAGGCTGA AGAAAGACAT TTTGGAGTGG TTGCCGATGT GGATCAAGAA GACCTCGCGC AAGCGCGTTG GGCGGTTATG GTGAACGCGG TTGACGATGC TCGTATTATT CAGGCGTTGT GGCCGCTCAT CGAGTATCGT GCCTACCAGC AGGGTATTGA CCTCCCGCTG GTGGACTTCC GTCCTGGTGA AACGTGTGCT GAATGGGCTA GTCGCTACGC CGATCCCAAG CAACCGTGGG AACAGCGGGC ACCGGTCTTG GTGTATCGTC CGGGTGAGCG AGTCAATAGC TGGCTCGCCC GTCATGGCAC GATGCCCGGC CCGGTCAAGC CGAGCCAAGG CGTTCCCTTC TATATCCTGA TTGCAGCGCG CCCCGGCCCG CTTACCGCGA ACGATCAGGC GTTTATTAGC TTCAATGTCC AATACGAACT CGATATCTTC TGGGGAGTGG GTCGTCTCTG TTTCACCGAC GAACGCGGTC ACCATCGCTA TGCCGATTAC ACTACCTACG CGCAGCGCCT GGTTGACTAC GAACGAAGGT CGGTCAACGA TGTTCGGATA CGCCGCGAGA TTGTATACTT TGGTACCCGT CACGATCTCG ATAAATCGAC CGAACGTAGT GCGCTTGAGC TGGTGAAGCC GCTGGCCGAG TGGCACGACC GTGGTCTACC ACAGCGATTG GGCTACGGGA AGAGGTTGTT GTTGGCAAAC GACGCAACCC GCAGTAACCT TGAGCAAGCT CTGCGTGATG GAAACCGGCC ACCGGCGATC TGGTTTAGCG CGACGCATGG TCTAGGCCTG CCGGTCACCG ACCGCGAGTT GATCCTTTAT CAAGGAGCAC TGGTGACGCA AGACTGGACC GGTTTTGGGG GGATTAAGCG CGAGCATTGG TTTGCTGCCG AAGATTTGCC AAGTAACCTC TCGCTCGAAG GTATGGTTGC CTTGCTGTTT GCCTGCTATG GCGCCGGTTG CCCACAGCGA GATGAGTTTA TCGTTGACCC GGAAAAAGGC CGTCCGGTCA TCGCCCCGTT TACCTTCGTC GCCCAACTAC CGCAGCAGCT CTTGCTACGC GGTGCGCTTG GGGTGGTTGG TCATGTGGAA CGGGCATGGA CATACGGTTT CAGCATGGAC GGCGCGCGTG GTCAGACCCA AGCGTTTGAA GATGTGATCG GTCGGTTGGT GGCCGGGAAG CGGCTGGGGA GTGCGACCGA TCAATTCAAC ATTATTCAAG CGGCACGCTC GATGACCTTA GCCGAGGAAC TTGAGAATAT CAAATTCGGC AAGCAACCGG AACCGCGTGA GTTGTCAACG CTGTGGATGG CCCGTAACGA TGCCCGTAAC TACATGTTGT TGGGTGATCC GGCTGCTCGC TTGCCCGTAC CGTAA
|
Protein sequence | MRALTPLTAV PAIDETTRKV LADYWITSVE ELVATARASN AGLGSGLAAL AQVLGRSEND VRAMVMAAQE VAPDASSFSV DVAMEPVGTG AIFTDLPEVD ATSFSPPVGL PAEVPPIATL PPPISQGPRN TCVAFTVAAM VQALSNDPTD LSEQFIYWIS KARDGIPGDV GTNPLVALRA VAELGVCREE TWPYRPEPVD HTNPGHERPS ERAFQEAKQR RISGVEQLPP RDVNQIKAAL AAGRPVLIGL MIGEHWTSSG QVRRIGRVRK ALPGEQRLGG HAMCVLGYRD DPTAPGGGYF IVRNSWGSEW ANENPDGPGY CYVPYQLIYE EGLAALIATG VIIEAATAST ATLSAPTTSE LAAILAEAQV IRARLDTLIN RLQALVGGQP QPVMSAVEPA PALPPSPEPA AVVAGYSGPL ILIADEQSRD ELYPNGIDGR RGEPLLRIDA KAASELAQRS DDPKELQTLH KTRNEAEERH FGVVADVDQE DLAQARWAVM VNAVDDARII QALWPLIEYR AYQQGIDLPL VDFRPGETCA EWASRYADPK QPWEQRAPVL VYRPGERVNS WLARHGTMPG PVKPSQGVPF YILIAARPGP LTANDQAFIS FNVQYELDIF WGVGRLCFTD ERGHHRYADY TTYAQRLVDY ERRSVNDVRI RREIVYFGTR HDLDKSTERS ALELVKPLAE WHDRGLPQRL GYGKRLLLAN DATRSNLEQA LRDGNRPPAI WFSATHGLGL PVTDRELILY QGALVTQDWT GFGGIKREHW FAAEDLPSNL SLEGMVALLF ACYGAGCPQR DEFIVDPEKG RPVIAPFTFV AQLPQQLLLR GALGVVGHVE RAWTYGFSMD GARGQTQAFE DVIGRLVAGK RLGSATDQFN IIQAARSMTL AEELENIKFG KQPEPRELST LWMARNDARN YMLLGDPAAR LPVP
|
| |