Gene Mext_4782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4782 
Symbol 
ID5835441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5341727 
End bp5343367 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content67% 
IMG OID641370579 
Productchaperonin GroEL 
Protein accessionYP_001642221 
Protein GI163854178 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0380913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGA AAGACGTACG TTTCTCCGCC GATGCTCGCG ACAAGATGCT GCGCGGCGTC 
GACATCCTCG CGGATGCCGT CAAGGTGACG CTCGGCCCCA AGGGCCGCAA CGTCGTGATC
GAGAAGAGCT TCGGCGCCCC GCGCATCACC AAGGACGGCG TCACCGTCGC CAAGGAGATC
GAGCTCGCCG ACCGCTTCGA GAACATGGGC GCACAGATGG TGCGCGAAGT GGCCTCGAAG
ACCAACGACA TCGCCGGTGA CGGCACCACC ACCGCGACCG TGCTGGCCCA GGCCATCGTC
CGCGAAGGCG CCAAGTACGT CGCCGCCGGC ATCAACCCGA TGGACCTGAA GCGCGGCATC
GACCTCGCCA CGGCCGCCGC GGTGAAGGAC ATCACCGCCC GCGCCAAGAA GGTCGCCTCC
TCCGAAGAGG TCGCCCAGGT CGGCACGATC TCCGCCAATG GCGACAAGGA GATCGGCGAG
ATGATCGCCC ACGCCATGCA GAAGGTGGGC AACGAGGGCG TCATCACCGT CGAGGAGGCC
AAGACCGCCG AGACCGAACT CGACGTCGTC GAGGGCATGC AGTTCGACCG CGGCTACCTC
TCGCCGTACT TCGTGACCAA CGCCGAGAAG ATGGTCGCCG AGCTCGAGGA TCCCTACATC
CTCATCCACG AGAAGAAGCT CTCCTCGCTG CAGCCGATGC TGCCGGTGCT CGAGGCCGTG
GTGCAGACCG GCAAGCCGCT CGTCATCATC GCCGAGGACA TCGAGGGTGA GGCGCTCGCC
ACGCTCGTCG TGAACAAGCT GCGCGGCGGC CTCAAGGTCG CGGCCGTGAA GGCTCCGGGC
TTCGGTGATC GCCGCAAGGC GATGCTCGAG GACATCGCGA TCCTCACCAA GGGCCAGACC
ATCTCCGAGG ATCTCGGCAT CAAGCTCGAG AACGTCGCCC TGCCGATGCT CGGCCGCGCC
AAGCGCGTCC GCATCGAGAA GGAGACCACC ACGATCATCG ACGGTCTCGG CGAGAAGGCC
GACATCGAGG CCCGCGTCGG TCAGATCAAG GCGCAGATCG AGGAGACCAC CTCGGACTAC
GATCGTGAGA AGCTCCAGGA GCGTCTGGCC AAGCTCGCGG GCGGCGTCGC GGTGATCCGC
GTCGGCGGCG CGACCGAGGT CGAGGTCAAG GAGAAGAAGG ATCGCGTGGA CGACGCGCTC
AACGCCACCC GCGCTGCGGT GGAAGAGGGC ATCGTCCCCG GCGGCGGCGT CGCGCTGCTC
CTGGCCAAGA AGGCGGTCGC CGAGCTGAAG TCCGACATCC CGGACGTCCA GGCCGGCATC
AAGATCGTCC TCAAGGCGCT CGAAGCCCCG ATCCGTCAGA TCGCCAGCAA CGCGGGTGTC
GAGGGCTCCA TCGTCGTCGG CAAGATCACC GACAACGGCG GCGAGACCTT CGGCTTCAAC
GCCCAGACCG AAGAGTATGT CGACATGATC CAGGCCGGCA TCGTCGACCC GGCCAAGGTC
GTGCGCACCG CCCTTCAGGA CGCGGCCTCG GTCGCCGGCC TGCTGGTGAC CACGGAAGCC
ATGGTCGCCG ACGCGCCGAA GAAGGACAGC GGCGCTCCGG CGATGCCGGG CGGCGGCGGC
ATGGGCGGCA TGGACTTCTA A
 
Protein sequence
MAAKDVRFSA DARDKMLRGV DILADAVKVT LGPKGRNVVI EKSFGAPRIT KDGVTVAKEI 
ELADRFENMG AQMVREVASK TNDIAGDGTT TATVLAQAIV REGAKYVAAG INPMDLKRGI
DLATAAAVKD ITARAKKVAS SEEVAQVGTI SANGDKEIGE MIAHAMQKVG NEGVITVEEA
KTAETELDVV EGMQFDRGYL SPYFVTNAEK MVAELEDPYI LIHEKKLSSL QPMLPVLEAV
VQTGKPLVII AEDIEGEALA TLVVNKLRGG LKVAAVKAPG FGDRRKAMLE DIAILTKGQT
ISEDLGIKLE NVALPMLGRA KRVRIEKETT TIIDGLGEKA DIEARVGQIK AQIEETTSDY
DREKLQERLA KLAGGVAVIR VGGATEVEVK EKKDRVDDAL NATRAAVEEG IVPGGGVALL
LAKKAVAELK SDIPDVQAGI KIVLKALEAP IRQIASNAGV EGSIVVGKIT DNGGETFGFN
AQTEEYVDMI QAGIVDPAKV VRTALQDAAS VAGLLVTTEA MVADAPKKDS GAPAMPGGGG
MGGMDF