Gene Cagg_1055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1055 
Symbol 
ID7268507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1306168 
End bp1307805 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content58% 
IMG OID643565900 
Productchaperonin GroEL 
Protein accessionYP_002462405 
Protein GI219847972 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGC AACTGATCTT CGATCAGCAA GCTCGTACTG CACTCAAACA CGGTATCGAT 
ACACTCGCGC TAGCAGTCAA AACGACGCTC GGTCCACGTG GGCGTAATGT CGCACTCGAT
AAGAAATGGG GTGCGCCAAC CGTCACCCAC GACGGTGTAA GCGTCGCGAA GGAGATTGAG
CTGAAGGATC CGTTCGCCAA CCTTGGTGTC CAACTGCTCA AGCAGGCAGC CGTCAAGACC
AACGATGTTG CCGGTGATGG TACCACCACG GCAACGGTGC TTGCCCAAGC GATTATCAAT
GAGGGCTTGA AACTGGTCGC TGCCGGCGCC AACCCCATGC TGCTCAAGCG TGGTCTCGAT
AAAGGTGGTC AAGCACTGGT CGCTCGCATC AAAGAGCAGG CGATCACCCT CAAGACCCGC
GATGAGATTC GGAACGTCGC GACAATCTCT GCCCAAGATG CCGAGGTTGG TGAGTTACTG
GCGACCGTGA TGGATAAGAT CGGTCGCGAC GGCGTCGTTA CGGTCGAGGA GGGGAAGAGC
ACCCATCTCG AGCACGAGCT GGTCGAGGGT ATGCAGTTTG ACCGTGGTTA TATCTCGCCC
TACTTCATCA CCGACTCGGC TCGCATGGAG GCGGTGCTCG ATGAACCCTA CATCTTGATC
ACCGACAAGA AGATCAGCTC GATCAAGGAT TTGTTGCCGA TCCTTGAAGC GGTGCTAAGC
AGCGGCAAGA AGGATCTCCT CGTCATCGCT GAGGATGTTG ATGGTGAGGC TCTGGCGACG
TTGGTCGTCA ACAAGCTGCG TGGTACCCTT AATGCCCTCG CCGTGAAGGC CCCCGGCTTC
GGCGACCGGC GCAAGGCGAT GCTGCAAGAT ATTGCGATCC TCACCGGTGG TACCGTCATC
TCCGAGGAGA TTGGCCGCAA GCTCGAAAGC GCTACTCTGC AAGACCTTGG CCGCGCCCGC
CGCGTGAAGG CCGACAAGGA TAACACTGTG ATCGTCGAGG GTCACGGTGA CAAGCAAGCC
ATCCAGGCTC GCATTGCCCA ACTCAAGCAG CAGATCGAGA CTACAACTTC GGATTACGAC
CGCGAGAAGT TGCAGGAGCG TGTCGCGAAG TTGTCGGGTG GTGTGGCCGT GATCAAGGTC
GGTGCGCCGA CCGAACCGGC GATGAAAGAG CGCAAAGCCC GCGTCGAAGA TGCGCTCAAC
GCGACCCGCG CTGCAGTTGA GGAGGGTATC GTTCCCGGTG GTGGTGTCGC ACTTCTCAAC
GCCATCCCAG CACTCGATAA CGTCACCACT CAGTTTGACG AAGAGCGCAT GGCGCTCAAC
GTCCTGCGCC GCGCCCTCGA AGAGCCACTC CGCCAGCTCG CAACCAATGC CGGCGAAGAT
GGTTCGGTGG TGGTTGAGAA CGTGCGCAAC GAGCAGCGGA AGCACAACAA CAACCACTAC
GGTTACGATG TCATGACCGG TACGTATGTC GATCTCATGC AAGCCGGCAT TATCGACCCG
GCCAAAGTGG TACGTACCGC GTTGGAGAAC GCAATTAGCG TCGCCGGTAT GGTGCTGACC
ACCGAGGCGT TGATCGTCGA GGCCCCTGAA CCCAAGAAGA ACAACAACAC GCCACCAATG
CCGGACGACG ATTTCTAA
 
Protein sequence
MAKQLIFDQQ ARTALKHGID TLALAVKTTL GPRGRNVALD KKWGAPTVTH DGVSVAKEIE 
LKDPFANLGV QLLKQAAVKT NDVAGDGTTT ATVLAQAIIN EGLKLVAAGA NPMLLKRGLD
KGGQALVARI KEQAITLKTR DEIRNVATIS AQDAEVGELL ATVMDKIGRD GVVTVEEGKS
THLEHELVEG MQFDRGYISP YFITDSARME AVLDEPYILI TDKKISSIKD LLPILEAVLS
SGKKDLLVIA EDVDGEALAT LVVNKLRGTL NALAVKAPGF GDRRKAMLQD IAILTGGTVI
SEEIGRKLES ATLQDLGRAR RVKADKDNTV IVEGHGDKQA IQARIAQLKQ QIETTTSDYD
REKLQERVAK LSGGVAVIKV GAPTEPAMKE RKARVEDALN ATRAAVEEGI VPGGGVALLN
AIPALDNVTT QFDEERMALN VLRRALEEPL RQLATNAGED GSVVVENVRN EQRKHNNNHY
GYDVMTGTYV DLMQAGIIDP AKVVRTALEN AISVAGMVLT TEALIVEAPE PKKNNNTPPM
PDDDF