Gene Cpha266_1936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1936 
SymbolgroEL 
ID4570050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2243329 
End bp2244972 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content50% 
IMG OID639766518 
Productchaperonin GroEL 
Protein accessionYP_912376 
Protein GI119357732 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000102441 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCTA AAGATATTAT TTTTGATTCT GACGCGAGAG CGAAACTGAA AGTTGGCGTT 
GACAAACTGG CCAACGCGGT TAAAGTTACT CTTGGACCTG CCGGACGCAA TGTCCTGATC
GACAAAAAAT TCGGTGCTCC AACTTCCACC AAAGATGGCG TGACTGTTGC CAAAGAGATC
GAACTTGCTG ATGCTGTTGA GAACATGGGC GCGCAGATGG TTCGTGAAGT TGCTTCGAAA
ACCAGTGATG TTGCCGGTGA CGGTACCACT ACGGCAACCG TTCTTGCACA GGCTATCTAT
CGTGAAGGTC TGAAGAACGT TGCAGCCGGT GCCCGCCCGA TTGATTTGAA AAGAGGCATC
GACCGTGCTG TCAAAGAGGT TGTGCTTGAG CTGAGAAATA TCAGCCGCAG CATCTCCGGT
AAAAAAGAGA TTGCCCAGGT CGGCACTATT TCTGCCAACA ACGATCCTGA AATCGGCGAA
CTGATTGCCG AAGCCATGGA TAAGGTCGGC AAGGACGGCG TTATTACCGT TGAAGAGGCA
AAAGGCATGG ATACCGAGCT GAAGGTTGTT GAGGGTATGC AGTTTGATCG TGGCTACCTT
TCGCCGTACT TCGTGACCAA TCCTGAAAAC ATGGAGGCAG AGCTCGAAGA TCCGCTTATC
CTTATTCATG ACAAAAAGAT CAGCAACATG AAAGAGCTGT TGCCGATTCT TGAAAAATCA
GCACAGTCCG GTCGTCCCCT CCTCATCATT TCCGAGGATA TCGAAGGCGA GGCACTTGCT
ACGCTTGTTG TCAACAGGCT CAGGGGTACC CTGAAAGTCT GCGCCGTCAA AGCTCCGGGC
TTCGGCGATC GTCGCAAAGC AATGCTTGAA GATATCGCTA TTCTTACCGG CGGTACCGTT
ATTTCTGAAG AGAAAGGCTA CAAACTTGAA AACGCGACGC TTACCTATCT TGGTCAGGCC
GGTCGTATTA CGGTTGACAA GGACAATACC ACTGTTGTTG AGGGTAAGGG CAAGCCGGAA
GAGATCAAGG CTCGCATCAA CGAAATCAAA GGCCAGATTG AAAAATCAAC CTCTGATTAT
GATACCGAAA AATTGCAGGA GCGGCTTGCA AAACTTTCCG GCGGCGTAGC CGTACTCAAT
ATCGGTGCAT CTACCGAAGT TGAGATGAAA GAGAAAAAAG CCCGCGTTGA AGATGCGCTG
CATGCAACCC GCGCTGCTGT TCAGGAAGGT ATTGTTGTTG GTGGCGGTGT TGCGCTTATT
CGTGCTATCA AAGGCCTCGA TAATGCGGTT GCCGACAATG AAGATCAGAA AACCGGCATC
GAAATTATCC GTCGCGCGCT TGAAGAGCCG CTTCGCCAGA TCGTTGCGAA CACCGGCACT
ACCGATGGTG CAGTTGTTCT TGAAAAGGTG AAGAATGGCG AAGGCGACTT TGGTTTCAAT
GCCAGAACCG AACAGTACGA AAACCTGGTT GAAGCAGGTG TTGTCGATCC TACCAAGGTG
ACCAGAAGCG CTCTTGAGAA CGCTGCATCA GTTGCCAGTA TTCTTTTGAC AACCGAAGCT
GCAATTACAG ACATCAAGGA AGAAAAATCC GACATGCCTG CAATGCCTCC GGGCGGAATG
GGTGGTATGG GCGGTATGTA CTGA
 
Protein sequence
MTAKDIIFDS DARAKLKVGV DKLANAVKVT LGPAGRNVLI DKKFGAPTST KDGVTVAKEI 
ELADAVENMG AQMVREVASK TSDVAGDGTT TATVLAQAIY REGLKNVAAG ARPIDLKRGI
DRAVKEVVLE LRNISRSISG KKEIAQVGTI SANNDPEIGE LIAEAMDKVG KDGVITVEEA
KGMDTELKVV EGMQFDRGYL SPYFVTNPEN MEAELEDPLI LIHDKKISNM KELLPILEKS
AQSGRPLLII SEDIEGEALA TLVVNRLRGT LKVCAVKAPG FGDRRKAMLE DIAILTGGTV
ISEEKGYKLE NATLTYLGQA GRITVDKDNT TVVEGKGKPE EIKARINEIK GQIEKSTSDY
DTEKLQERLA KLSGGVAVLN IGASTEVEMK EKKARVEDAL HATRAAVQEG IVVGGGVALI
RAIKGLDNAV ADNEDQKTGI EIIRRALEEP LRQIVANTGT TDGAVVLEKV KNGEGDFGFN
ARTEQYENLV EAGVVDPTKV TRSALENAAS VASILLTTEA AITDIKEEKS DMPAMPPGGM
GGMGGMY