Gene M446_5580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5580 
Symbol 
ID6133323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6120582 
End bp6122222 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content67% 
IMG OID641645705 
Productchaperonin GroEL 
Protein accessionYP_001772319 
Protein GI170743664 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA AGGACGTTCG TTTCTCCTCC GACGCGCGCG AGAAGATGCT GCGCGGTGTC 
GACATCCTCG CCAACGCCGT GAAGGTGACG CTCGGCCCCA AGGGCCGCAA CGTCGTGCTC
GAGAAGAGCT TCGGGGCTCC CCGCATCACC AAGGACGGCG TGACCGTCGC CAAGGAGATC
GAACTCGCCG ACAAGTTCGA GAACATGGGC GCCCAGATGG TGCGCGAGGT GGCCTCGAAG
ACCAGCGATC TCGCCGGTGA TGGCACCACG ACCGCCACCG TGCTGGCCCA GGCCATCGTC
AAGGAAGGCG CCAAGTACGT CGCCGCCGGC ATGAACCCGA TGGACCTCAA GCGCGGCATC
GACCTCGCCA CCGCCGCCAC CGTGAAGGAC ATTACTGGCC GCGCCAGGAA AGTTGTCTCG
TCGGAGGGGA TTGCCCAGGT CGGCACGATC TCGGCCAACG GCGACAAGGA GATCGGCGAG
ATGATCGCCC AGGCCATGCA GAAGGTCGGC AACGAGGGCG TGATCACGGC TGAAGAAGCG
AAGACCGCCG TGACCGAGCT CGACGTCGTC GAGGGCATGC AGTTCGACCG CGGCTACCTC
TCCCCGTACT TCATCACGAA CGCGGAGAAG ATGATCGCCG ATCTCGAGGA TCCTTACATC
CTGATCCACG AGAAGAAGCT GTCGTCGCTG CAGGCGATGC TGCCGGTGCT CGAGGCGGTG
GTGCAGACCG GCAAGCCGCT GCTGATCGTG GCCGAGGACA TCGAGGGCGA GGCGCTGGCC
ACGCTGGTGG TCAACAAGCT GCGCGGCGGT CTGAAGGTGG CGGCCGTGAA GGCGCCGGGC
TTCGGCGACC GGCGCAAGGC GATGCTGGAG GACATCGCGA TCCTGACCGC CGGTCAGATG
ATCGCGGAGG ATCTCGGCAT CAAGCTGGAG AACGTGACGC TGCCGATGCT CGGGCGGGCC
AAGCGGGTGC GGATCGAGAA GGAGAACACC ACGATCATCG ATGGGGCCGG GGAGAAGGCG
GACATCGAGG CGCGGGTGGC GCAGATCAAG GCGCAGATCG AGGAGACGAC CTCGGACTAC
GACCGGGAGA AGCTGCAGGA GCGCCTGGCC AAGCTCGCGG GCGGAGTTGC GATCATCCGC
GTCGGCGGTT CGACCGAGGT CGAGGTCAAG GAGAAGAAGG ACCGTGTAGA GGACGCCCTT
CACGCCACCC GCGCGGCGGT GGAGGAGGGC ATCGTCCCGG GCGGCGGCAC CGCGCTCCTG
CGGGCCCAGG CGGCCGTGGC CGCGCTCACG AGCGATAACC CGGATGTCCA GGCTGGCATC
AAGATCGTGC TCAGGGCGCT GGAGGCCCCG ATCCGCCAAA TCGCCGAGAA CGCGGGCGTC
GAGGGCTCGA TCGTGGTCGG CCAGATCTCC AACAACACGG GCTCTGAGAC GTACGGCTTC
AACGCCCAGA CCGAGGAGTA CGTGGACCTG CTCGAGGCCG GTGTCGTCGA TCCGGCCAAG
GTGGTGCGCA CGGCCATGCA GGGTGCGGCC TCGGTCGCCG GCCTGCTCGT CACCACGGAG
GCGATGGTGG CGGATGCGCC GAAGAAGGAA AGCTCGGCTC CCGCCATGCC CGGCGGCGGC
ATGGGCGGCA TGGACTTCTG A
 
Protein sequence
MAAKDVRFSS DAREKMLRGV DILANAVKVT LGPKGRNVVL EKSFGAPRIT KDGVTVAKEI 
ELADKFENMG AQMVREVASK TSDLAGDGTT TATVLAQAIV KEGAKYVAAG MNPMDLKRGI
DLATAATVKD ITGRARKVVS SEGIAQVGTI SANGDKEIGE MIAQAMQKVG NEGVITAEEA
KTAVTELDVV EGMQFDRGYL SPYFITNAEK MIADLEDPYI LIHEKKLSSL QAMLPVLEAV
VQTGKPLLIV AEDIEGEALA TLVVNKLRGG LKVAAVKAPG FGDRRKAMLE DIAILTAGQM
IAEDLGIKLE NVTLPMLGRA KRVRIEKENT TIIDGAGEKA DIEARVAQIK AQIEETTSDY
DREKLQERLA KLAGGVAIIR VGGSTEVEVK EKKDRVEDAL HATRAAVEEG IVPGGGTALL
RAQAAVAALT SDNPDVQAGI KIVLRALEAP IRQIAENAGV EGSIVVGQIS NNTGSETYGF
NAQTEEYVDL LEAGVVDPAK VVRTAMQGAA SVAGLLVTTE AMVADAPKKE SSAPAMPGGG
MGGMDF