Gene Cphamn1_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0783 
SymbolgroEL 
ID6374450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp838459 
End bp840111 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content50% 
IMG OID642683291 
Productchaperonin GroEL 
Protein accessionYP_001959215 
Protein GI189499745 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000375112 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.275175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCAA AAGATATTCT CTTTGACGCA TCGGCCAGAG CAAAGCTCAA AGTCGGCGTC 
GACAAACTCG CAGATGCTGT TAAGGTAACT CTCGGCCCGG CCGGTCGAAA TGTTCTTATC
GACAAAAAAT TTGGTGCACC CACTTCAACC AAAGACGGTG TCACTGTTGC AAAAGAGATC
GAGCTTGAGG ATTCCTTTGA AAATATGGGA GCCCAGATGG TTCGTGAAGT ATCCTCGAAA
ACAAGTGATG TCGCTGGTGA CGGCACAACC ACAGCTACGG TTCTCGCTCA GGCGATCTAC
CGTGAAGGGT TGAAAAACGT TGCTGCTGGC GCACGTCCGA TCGATCTTAA AAGAGGCATA
GACAAGGCTG TGAAAGAGGT GATCGGTGAG CTGAGAACTA TCAGCAACGA TATTTCCGGA
AAAATAGAGA TTGCCCAGGT TGGAACCATC TCTGCCAACA ACGACCCTGA AATCGGTCAG
TTGATAGCTG ATGCGATGGA AAAGGTGGGC AAGGACGGCG TCATTACCGT TGAAGAAGCC
AAGGGTATGG ATACCGAGTT GAAAGTTGTG GAAGGTATGC AGTTCGACCG CGGCTACCTC
TCTCCGTACT TTGTGACCAA TTCCGAGAAG ATGGATGCCG AGCTTGAAGA TCCCTATATC
CTCATCCATG ACAAGAAGAT CAGCAACATG AAAGATCTTC TCCCGATTCT CGAGAAAACC
GCCCAGTCAG GACGGCCTTT GATGATCATC TCGGAGGACA TCGAGGGTGA AGCACTTGCT
ACGCTCGTCG TCAATAAACT TCGCGGAACC CTTAAAGTCT GTGCCGTTAA AGCACCGGGC
TTCGGTGACC GTCGTAAAGC CATGCTTGAG GATATCGCTA TTCTTACCGG TGGAACCGTT
ATCTCTGAGG AAAAAGGCTA CAAACTCGAG AACGCCACGA TCTCCTACCT CGGTCAGGCA
GCTACTGTCA CGGTAGACAA AGACAATACA ACTATTGTTG AAGGTAAGGG ACAGGCTGAC
GATATCAAGG CACGCATCAA CGAAATCAAA AATCAGATCG ATGCGTCCAC TTCCGATTAT
GATACTGAAA AGCTTCAGGA GCGTCTCGCA AAGCTTTCAG GCGGCGTTGC TGTTATCAAC
ATCGGCGCTT CGACTGAAGT TGAGATGAAA GAGAAAAAAG CTCGTGTTGA AGATGCTCTG
CACGCCACTC GTGCAGCTGT TCAGGAAGGC ATTGTCGCCG GCGGCGGTGT TGCTCTGATT
CGCGCGGCAA AAGGACTCGA CAATGTGCAG CCGGAAAACG AAGATCAGAA AACCGGTGTG
GAAATTGTTC GTCGTGCTCT TGAAGAACCT CTGCGTCAGA TCGTTGCAAA TACCGGCACA
ACCGATGGTG CTGTTGTTGT CGAAAGGGTA AAGCAGGGTG AAGGCGACTT TGGTTTCAAT
GCCAGAACAG AGGAATATGA GAAGATGACG GAAGCAGGAG TTGTTGATCC TACCAAGGTG
ACAAGGACAG CTCTTGAAAA CGCCGCTTCG GTCGCAGGAA TTCTCCTGAC CACTGAAGCA
GCTATCACCG ACATCAAGGA AGAAGGAGGC GATATGCCTG CTATGCCTCC GGGCGGCATG
GGCGGCATGG GTGGCATGGG CGGTATGATG TAA
 
Protein sequence
MSAKDILFDA SARAKLKVGV DKLADAVKVT LGPAGRNVLI DKKFGAPTST KDGVTVAKEI 
ELEDSFENMG AQMVREVSSK TSDVAGDGTT TATVLAQAIY REGLKNVAAG ARPIDLKRGI
DKAVKEVIGE LRTISNDISG KIEIAQVGTI SANNDPEIGQ LIADAMEKVG KDGVITVEEA
KGMDTELKVV EGMQFDRGYL SPYFVTNSEK MDAELEDPYI LIHDKKISNM KDLLPILEKT
AQSGRPLMII SEDIEGEALA TLVVNKLRGT LKVCAVKAPG FGDRRKAMLE DIAILTGGTV
ISEEKGYKLE NATISYLGQA ATVTVDKDNT TIVEGKGQAD DIKARINEIK NQIDASTSDY
DTEKLQERLA KLSGGVAVIN IGASTEVEMK EKKARVEDAL HATRAAVQEG IVAGGGVALI
RAAKGLDNVQ PENEDQKTGV EIVRRALEEP LRQIVANTGT TDGAVVVERV KQGEGDFGFN
ARTEEYEKMT EAGVVDPTKV TRTALENAAS VAGILLTTEA AITDIKEEGG DMPAMPPGGM
GGMGGMGGMM