Gene Mchl_5249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5249 
Symbol 
ID7113857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5620326 
End bp5621966 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content67% 
IMG OID643527942 
Productchaperonin GroEL 
Protein accessionYP_002423939 
Protein GI218533123 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGA AAGACGTACG TTTCTCCGCC GATGCTCGCG ACAAGATGCT GCGCGGCGTC 
GACATCCTCG CGGATGCCGT CAAGGTGACG CTCGGCCCCA AGGGCCGCAA CGTCGTGATC
GAGAAGAGCT TCGGCGCCCC GCGCATCACC AAGGACGGCG TCACCGTCGC CAAGGAGATC
GAGCTCGCCG ACCGCTTCGA GAACATGGGC GCACAGATGG TGCGCGAAGT GGCCTCGAAG
ACCAACGACA TCGCCGGTGA CGGCACCACC ACCGCGACCG TGCTGGCCCA GGCGATCGTC
CGCGAAGGCG CCAAGTACGT CGCCGCCGGC ATCAACCCGA TGGACCTGAA GCGCGGCATC
GACCTCGCCA CGGCCGCCGC GGTGAAGGAC ATCACCGCCC GCGCCAAGAA GGTCGCCTCC
TCCGAAGAGG TCGCCCAGGT CGGCACGATC TCCGCCAATG GCGACAAGGA GATCGGCGAG
ATGATCGCCC ACGCCATGCA GAAGGTGGGC AACGAGGGCG TCATCACCGT CGAGGAGGCC
AAGACCGCCG AGACCGAGCT CGACGTCGTC GAGGGCATGC AGTTCGACCG CGGCTACCTC
TCGCCGTACT TCGTGACCAA CGCCGAGAAG ATGGTCGCCG AGCTCGAGGA TCCCTACATC
CTCATCCACG AGAAGAAGCT CTCCTCGCTG CAGCCGATGC TGCCGGTGCT CGAGGCCGTG
GTGCAGACCG GCAAGCCGCT CGTCATCATC GCCGAGGACA TCGAGGGTGA GGCGCTCGCC
ACGCTCGTCG TGAACAAGCT GCGCGGCGGC CTCAAGGTCG CGGCCGTGAA GGCTCCGGGC
TTCGGTGATC GCCGCAAGGC GATGCTCGAG GACATCGCGA TCCTCACCAA GGGCCAGACC
ATCTCCGAGG ATCTCGGCAT CAAGCTCGAG AACGTCGCCC TGCCGATGCT CGGCCGCGCC
AAGCGCGTCC GCATCGAGAA GGAGACCACC ACGATCATCG ACGGTCTCGG CGAGAAGGCC
GACATCGAGG CCCGCGTCGG CCAGATCAAG GCGCAGATCG AGGAGACCAC CTCGGACTAC
GATCGTGAGA AGCTCCAGGA GCGTCTGGCC AAGCTCGCGG GCGGCGTCGC GGTGATCCGC
GTCGGCGGCG CGACCGAGGT CGAGGTCAAG GAGAAGAAGG ATCGCGTGGA CGACGCGCTC
AACGCCACCC GCGCTGCGGT GGAAGAGGGC ATCGTCCCCG GCGGCGGCGT CGCTCTGCTC
CTGGCCAAGA AGGCGGTCGC CGAGCTGAAG TCCGACATCC CGGACGTCCA GGCCGGCATC
AAGATCGTCC TCAAGGCCCT CGAAGCCCCG ATCCGTCAGA TCGCCAGCAA CGCGGGTGTC
GAGGGCTCCA TCGTCGTCGG CAAGATCACC GACAACGGCG GCGAGACCTT CGGCTTCAAC
GCGCAGACCG AAGAGTATGT CGACATGATC CAGGCCGGCA TCGTCGACCC GGCCAAGGTC
GTGCGTACCG CCCTTCAGGA CGCGGCCTCG GTCGCCGGCC TGCTGGTGAC CACGGAAGCC
ATGGTCGCCG ACGCGCCGAA GAAGGATAGC GGCGCCCCGG CGATGCCGGG CGGCGGCGGC
ATGGGCGGCA TGGACTTCTA A
 
Protein sequence
MAAKDVRFSA DARDKMLRGV DILADAVKVT LGPKGRNVVI EKSFGAPRIT KDGVTVAKEI 
ELADRFENMG AQMVREVASK TNDIAGDGTT TATVLAQAIV REGAKYVAAG INPMDLKRGI
DLATAAAVKD ITARAKKVAS SEEVAQVGTI SANGDKEIGE MIAHAMQKVG NEGVITVEEA
KTAETELDVV EGMQFDRGYL SPYFVTNAEK MVAELEDPYI LIHEKKLSSL QPMLPVLEAV
VQTGKPLVII AEDIEGEALA TLVVNKLRGG LKVAAVKAPG FGDRRKAMLE DIAILTKGQT
ISEDLGIKLE NVALPMLGRA KRVRIEKETT TIIDGLGEKA DIEARVGQIK AQIEETTSDY
DREKLQERLA KLAGGVAVIR VGGATEVEVK EKKDRVDDAL NATRAAVEEG IVPGGGVALL
LAKKAVAELK SDIPDVQAGI KIVLKALEAP IRQIASNAGV EGSIVVGKIT DNGGETFGFN
AQTEEYVDMI QAGIVDPAKV VRTALQDAAS VAGLLVTTEA MVADAPKKDS GAPAMPGGGG
MGGMDF