Gene Cagg_0761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0761 
Symbol 
ID7268080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp942211 
End bp945015 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content58% 
IMG OID643565612 
Productpeptidase C1A papain 
Protein accessionYP_002462121 
Protein GI219847688 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0657055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.223124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCT TGACCCCGCT TACCGCAGTG CCTGCGATCG ATGAGACGAC CCGAAAGGTG 
TTGGCCGACT ACTGGATTAC CAGTGTCGAA GAGTTGGTAG CGACGGCGCG TGCGAGTAAT
GCCGGTCTTG GGAGTGGGCT GGCTGCACTT GCTCAGGTGT TGGGGCGAAG TGAAAACGAT
GTGCGGGCAA TGGTGATGGC AGCGCAGGAA GTGGCGCCTG ATGCCAGCTC GTTTAGCGTT
GATGTCGCGA TGGAGCCGGT CGGTACCGGT GCGATCTTCA CCGATCTGCC GGAGGTCGAT
GCAACTTCGT TTAGTCCACC GGTCGGCTTG CCCGCCGAAG TACCACCGAT TGCGACTCTG
CCTCCGCCAA TCAGTCAAGG ACCACGCAAT ACGTGTGTCG CGTTTACGGT GGCTGCCATG
GTGCAGGCAC TTAGCAACGA CCCCACCGAT CTCTCCGAAC AGTTTATTTA CTGGATTAGT
AAAGCGCGTG ATGGCATCCC CGGTGATGTC GGTACGAACC CGTTGGTCGC ATTACGCGCC
GTTGCCGAGT TAGGGGTCTG CCGTGAGGAG ACGTGGCCCT ATCGCCCCGA ACCGGTAGAC
CATACCAACC CCGGTCACGA GCGTCCAAGT GAAAGAGCAT TTCAGGAAGC TAAGCAGCGT
CGGATTAGCG GGGTTGAACA GTTGCCTCCC CGTGATGTGA ACCAGATCAA GGCTGCACTG
GCCGCCGGCC GACCGGTGTT GATCGGTTTG ATGATCGGTG AGCATTGGAC GAGTAGTGGG
CAGGTCCGTC GAATTGGGCG GGTGCGTAAG GCGTTGCCCG GTGAGCAACG GTTGGGTGGT
CATGCGATGT GTGTATTGGG CTACCGTGAT GATCCGACGG CTCCCGGCGG TGGCTATTTT
ATTGTGCGGA ATTCGTGGGG AAGCGAATGG GCGAACGAAA ATCCTGATGG TCCCGGTTAT
TGCTATGTCC CATATCAACT CATCTATGAA GAGGGTTTGG CAGCGTTGAT CGCTACCGGT
GTGATCATTG AAGCAGCAAC TGCGTCGACA GCGACGTTGA GTGCGCCGAC TACCTCGGAG
TTGGCGGCGA TTCTTGCCGA AGCTCAGGTG ATCCGTGCCC GGCTTGATAC GTTGATCAAT
CGGTTGCAGG CGCTGGTGGG TGGGCAACCG CAACCGGTCA TGAGCGCCGT TGAGCCGGCC
CCGGCGTTAC CACCATCACC TGAACCGGCA GCAGTTGTGG CCGGCTATAG CGGCCCATTG
ATCCTTATTG CCGATGAGCA GAGCCGTGAT GAGTTGTACC CGAATGGGAT CGATGGCCGC
CGAGGCGAAC CGTTGTTGCG GATCGATGCG AAAGCCGCCA GCGAATTGGC CCAACGTAGC
GATGATCCGA AAGAATTGCA GACACTCCAC AAGACCCGTA ACGAGGCTGA AGAAAGACAT
TTTGGAGTGG TTGCCGATGT GGATCAAGAA GACCTCGCGC AAGCGCGTTG GGCGGTTATG
GTGAACGCGG TTGACGATGC TCGTATTATT CAGGCGTTGT GGCCGCTCAT CGAGTATCGT
GCCTACCAGC AGGGTATTGA CCTCCCGCTG GTGGACTTCC GTCCTGGTGA AACGTGTGCT
GAATGGGCTA GTCGCTACGC CGATCCCAAG CAACCGTGGG AACAGCGGGC ACCGGTCTTG
GTGTATCGTC CGGGTGAGCG AGTCAATAGC TGGCTCGCCC GTCATGGCAC GATGCCCGGC
CCGGTCAAGC CGAGCCAAGG CGTTCCCTTC TATATCCTGA TTGCAGCGCG CCCCGGCCCG
CTTACCGCGA ACGATCAGGC GTTTATTAGC TTCAATGTCC AATACGAACT CGATATCTTC
TGGGGAGTGG GTCGTCTCTG TTTCACCGAC GAACGCGGTC ACCATCGCTA TGCCGATTAC
ACTACCTACG CGCAGCGCCT GGTTGACTAC GAACGAAGGT CGGTCAACGA TGTTCGGATA
CGCCGCGAGA TTGTATACTT TGGTACCCGT CACGATCTCG ATAAATCGAC CGAACGTAGT
GCGCTTGAGC TGGTGAAGCC GCTGGCCGAG TGGCACGACC GTGGTCTACC ACAGCGATTG
GGCTACGGGA AGAGGTTGTT GTTGGCAAAC GACGCAACCC GCAGTAACCT TGAGCAAGCT
CTGCGTGATG GAAACCGGCC ACCGGCGATC TGGTTTAGCG CGACGCATGG TCTAGGCCTG
CCGGTCACCG ACCGCGAGTT GATCCTTTAT CAAGGAGCAC TGGTGACGCA AGACTGGACC
GGTTTTGGGG GGATTAAGCG CGAGCATTGG TTTGCTGCCG AAGATTTGCC AAGTAACCTC
TCGCTCGAAG GTATGGTTGC CTTGCTGTTT GCCTGCTATG GCGCCGGTTG CCCACAGCGA
GATGAGTTTA TCGTTGACCC GGAAAAAGGC CGTCCGGTCA TCGCCCCGTT TACCTTCGTC
GCCCAACTAC CGCAGCAGCT CTTGCTACGC GGTGCGCTTG GGGTGGTTGG TCATGTGGAA
CGGGCATGGA CATACGGTTT CAGCATGGAC GGCGCGCGTG GTCAGACCCA AGCGTTTGAA
GATGTGATCG GTCGGTTGGT GGCCGGGAAG CGGCTGGGGA GTGCGACCGA TCAATTCAAC
ATTATTCAAG CGGCACGCTC GATGACCTTA GCCGAGGAAC TTGAGAATAT CAAATTCGGC
AAGCAACCGG AACCGCGTGA GTTGTCAACG CTGTGGATGG CCCGTAACGA TGCCCGTAAC
TACATGTTGT TGGGTGATCC GGCTGCTCGC TTGCCCGTAC CGTAA
 
Protein sequence
MRALTPLTAV PAIDETTRKV LADYWITSVE ELVATARASN AGLGSGLAAL AQVLGRSEND 
VRAMVMAAQE VAPDASSFSV DVAMEPVGTG AIFTDLPEVD ATSFSPPVGL PAEVPPIATL
PPPISQGPRN TCVAFTVAAM VQALSNDPTD LSEQFIYWIS KARDGIPGDV GTNPLVALRA
VAELGVCREE TWPYRPEPVD HTNPGHERPS ERAFQEAKQR RISGVEQLPP RDVNQIKAAL
AAGRPVLIGL MIGEHWTSSG QVRRIGRVRK ALPGEQRLGG HAMCVLGYRD DPTAPGGGYF
IVRNSWGSEW ANENPDGPGY CYVPYQLIYE EGLAALIATG VIIEAATAST ATLSAPTTSE
LAAILAEAQV IRARLDTLIN RLQALVGGQP QPVMSAVEPA PALPPSPEPA AVVAGYSGPL
ILIADEQSRD ELYPNGIDGR RGEPLLRIDA KAASELAQRS DDPKELQTLH KTRNEAEERH
FGVVADVDQE DLAQARWAVM VNAVDDARII QALWPLIEYR AYQQGIDLPL VDFRPGETCA
EWASRYADPK QPWEQRAPVL VYRPGERVNS WLARHGTMPG PVKPSQGVPF YILIAARPGP
LTANDQAFIS FNVQYELDIF WGVGRLCFTD ERGHHRYADY TTYAQRLVDY ERRSVNDVRI
RREIVYFGTR HDLDKSTERS ALELVKPLAE WHDRGLPQRL GYGKRLLLAN DATRSNLEQA
LRDGNRPPAI WFSATHGLGL PVTDRELILY QGALVTQDWT GFGGIKREHW FAAEDLPSNL
SLEGMVALLF ACYGAGCPQR DEFIVDPEKG RPVIAPFTFV AQLPQQLLLR GALGVVGHVE
RAWTYGFSMD GARGQTQAFE DVIGRLVAGK RLGSATDQFN IIQAARSMTL AEELENIKFG
KQPEPRELST LWMARNDARN YMLLGDPAAR LPVP