Gene Cagg_0488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0488 
Symbol 
ID7266984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp605378 
End bp607018 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content61% 
IMG OID643565352 
Productchaperonin GroEL 
Protein accessionYP_002461865 
Protein GI219847432 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00790388 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0259613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAGC AGCTCTCTTT CAATGAAGAA GCTCGCCGCG CGCTTAAGCG TGGTGTTGAT 
CTCGTCGCCG ATGCGGTCAA GACGACCCTT GGCCCGCGTG GCCGCAATGT GGCGATCGAC
AAGAAGTTCG GTTCGCCGAC GGTGACACAC GACGGTGTGA CGGTTGCGAA GGAGATCGAT
CTGAAAGATC CCTTCGAGAA CATGGGCGCG CAGCTTCTCA AGGAAGCAGC CACCAAGACC
AACGACGTTG CCGGTGACGG TACTACCACT GCCACCGTAC TGGCCCAGGC GATCGTTACC
GAGGGTCTGA AGGTTGTCGC AGCCGGTGCG AATGCGATGC TGATCAAGCG TGGCCTTGAT
CGGGGTGCCG AGGCGTTGGT GGCCGCAATC AAGGCGAGCG CTGTGCCGGT ACGTGACCGC
GCCGACATTG CGCACGTAGC GACCAACTCG GCGGCCGACA GTGAGATCGG TGAGCTGATC
GCCGAGGTGA TGGAGAAGGT CGGCAAGGAT GGCGTCATCA CCGTCGAAGA GTCGAAGGGC
GTGACCTTCG AGAAAGAGTA CACCGAAGGC ATGCAGTTCG ACCGTGGCTA CATCTCGGGC
TACATGGTGA CCAATGTCGA GCGGCAGGAA GCAGAGCTTG ACGATCCCTA CATCCTGATC
ACCGACAAGA AGATCAGCAG CATCCAAGAG ATCCTGCCGA TCCTTGAGAA GGTATTGCAG
GTCACGAAGA ACTTCGTCAT CATCGCCGAG GACGTTGACG GCGAGGCGTT GGCGACGTTA
GTCGTCAACA AGCTGCGCGG CACGATCAAC GCGCTGGCAG TCAAGGCTCC GGGCTTTGGT
GATCGCCGCA AGGCAATGCT GCAAGACATC GCCATTCTTA CCGGTGGTAC CGTGATCAGT
GAGGAGATCG GTCGCAAGCT CGACAGCGCG ACGATTGAAG ACCTGGGCCG CGCGCGCAAG
GTCATTGCGA CCAAGGACGA TACCACCATT ATCGAGGGCC GTGGTGACGA AGCTGCGATC
CGTGCGCGCA TTGAGCAGAT CCGTGCCCAA ATTGCGACAA CGACCAGCGA CTTTGACCGT
GAGAAGCTGC AGGAGCGGCT GGCGAAGCTG GCCGGTGGTG TCGCCGTGAT TAAGGTCGGT
GCGGCGACCG AGCCGGAGCT GAAGGAGAAG AAGCACCGCG TCGAGGACGC GCTGAGCGCG
ACCCGCGCAG CGGTCGAAGA GGGTATCGTA CCCGGTGGTG GTGTGGCGTT GATCAACGCC
ATCCCGGCGC TCGACAATGT GCAGGTCGCC CACGAGGATG AGAAGGTCGG TCTGCAGATC
CTGCGCCGTG CTCTCGAAGA GCCGCTGCGC ATCCTTGCCC GCAACGCCGG CGAGGATGGC
TCGGTGATTA TCGCCAATGT CCGCCGCTTG CAGGAGGAGA AGGGCGATAA GACCATCGGC
TACAACGTGC TGACCGGCCA GTACGGCAGC ATGATCGAGC AGGGTATCAT TGACCCGGTG
AAGGTGACGC GCAGTGCGGT GCAGAACGCA GTTTCGATTG CCGGTATGAT CCTGACCACC
GAGGCGCTGA TCACCGACAT TCCCGAGGAT AAGCCGGCTG TCGGTGCCGG CGCCGGCGCC
GGTGCTGGGA TGGACTTCTA G
 
Protein sequence
MPKQLSFNEE ARRALKRGVD LVADAVKTTL GPRGRNVAID KKFGSPTVTH DGVTVAKEID 
LKDPFENMGA QLLKEAATKT NDVAGDGTTT ATVLAQAIVT EGLKVVAAGA NAMLIKRGLD
RGAEALVAAI KASAVPVRDR ADIAHVATNS AADSEIGELI AEVMEKVGKD GVITVEESKG
VTFEKEYTEG MQFDRGYISG YMVTNVERQE AELDDPYILI TDKKISSIQE ILPILEKVLQ
VTKNFVIIAE DVDGEALATL VVNKLRGTIN ALAVKAPGFG DRRKAMLQDI AILTGGTVIS
EEIGRKLDSA TIEDLGRARK VIATKDDTTI IEGRGDEAAI RARIEQIRAQ IATTTSDFDR
EKLQERLAKL AGGVAVIKVG AATEPELKEK KHRVEDALSA TRAAVEEGIV PGGGVALINA
IPALDNVQVA HEDEKVGLQI LRRALEEPLR ILARNAGEDG SVIIANVRRL QEEKGDKTIG
YNVLTGQYGS MIEQGIIDPV KVTRSAVQNA VSIAGMILTT EALITDIPED KPAVGAGAGA
GAGMDF