Gene GM21_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0233 
SymbolgroEL 
ID8135540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp275318 
End bp276961 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content62% 
IMG OID644867854 
Productchaperonin GroEL 
Protein accessionYP_003020076 
Protein GI253698887 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones88 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCAA AGCTTATCAA GTTCGATCAG GAAGCACGCA ACTGCATCCT CAAAGGTGTC 
AACACCCTGG CAGATGCCGT TAAAGTGACC CTGGGTCCCA AAGGGCGCAA CGTCGTCATC
GAGAAGTCCT ACGGCGCGCC GCTCATCACC AAGGACGGCG TCACCGTCGC CAAGGAAATC
GAGCTGGAAG ACAAGTTCGA GAACATGGGC GCGCAGTTGG TGAAGGAAGT CGCTTCCAAG
ACCTCCGACG TAGCCGGCGA CGGCACCACC ACCGCAACCG TGCTGGCACA GGCTATCTAC
CGCCAGGGCG CCAAGTTGGT CGCCGCGGGT CACAACCCGA TGGAAATCAA GCGCGGCCTG
GACCAGGCCG TCGAGGCCCT GGTCGCCGAG CTGAAGAACA TCTCCAAGCC GATCAAGGAC
CACAAGGAAA TCGCACAGGT CGGCACCATC TCCGCCAACA ACGACAAGAC CATCGGCGAC
ATCATCGCCG AGGCGATGGA GAAGGTCGGC AAGGAAGGGG TTATCACCGT CGAGGAAGCC
AAGGCGATGG AAACCACCCT TGAGACCGTC GAGGGGATGC AGTTCGACCG CGGCTACCTC
TCCCCCTACT TCGTCACCGA TCCGGAGCGC ATGGAAGCCG CGATGGACAA CGTCGCCATC
CTGATCCACG ACAAGAAGAT CGCTAACATG AAGGACCTCC TCCCGGTTCT CGAGCAGACC
GCCAAGTCCG GTCGTCCGCT GCTGATCATC GCCGAGGACA TCGAAGGCGA GGCCCTGGCA
ACTCTCGTGG TCAACAAGCT GCGCGGCGTC CTTAACGTCT GCGCCGTCAA GGCCCCGGGC
TTCGGCGACC GCCGTAAGGC CATGCTTGAA GACATCGCCA TCCTGACCGG CGGCAAGGTG
ATCTCCGAGG AAGTCGGCTT TAAACTCGAG AACACCACCC TCGACATGCT GGGCCAGGCC
AAAAAGATCA CCGTCGACAA GGACAACACC ACCATCATCG ACGGCTTCGG CGCCGAGGCC
GACATCCAGG GGCGCGTCAA GATGATCCGC GCCCAGATCG ATGAGACCTC CTCCGACTAC
GACCGCGAGA AGCTCCAGGA GCGCCTGGCG AAACTTGTGG GCGGCGTTGC CGTCATCAAG
GTCGGTGCCG CTACCGAGAT CGAGATGAAG GAGAAGAAGG CACGCGTCGA AGACGCACTG
CACGCAACCC GCGCAGCGGT CGACGAGGGT ATCGTCCCTG GAGGCGGGGT CGCTTACCTG
CGCGCCATGA AGGTGCTGGA AAACCTTCAG CTCGCACCGG AGCAGCAGTT CGGCGTAAAC
GTGATCAAGC GCGCCCTCGA GGAGCCGATC CGTCAGATCT CCCAGAACGC TGGCGTCGAC
GGCTCCATCG TCGTGGACAA GGTCAAAAAC GGCAAGGATG CCTTCGGCTA CAACGCCGCC
GACGACGTCT ATGTCGACAT GATCGAGGCC GGCATCATCG ACCCGACCAA GGTCTCCAGG
AGCGCGCTGC AGAACGCCGC TTCCGTGGCT GGTCTCATGA TGACGACCGA GGCGATGATC
GCCGACAAGC CGAAGGAAGA AGGCGCGATG CCGGCGATGC CGGGTGGCAT GGGCGGCATG
GGCGGCATGG GCGGCATGAT GTAG
 
Protein sequence
MAAKLIKFDQ EARNCILKGV NTLADAVKVT LGPKGRNVVI EKSYGAPLIT KDGVTVAKEI 
ELEDKFENMG AQLVKEVASK TSDVAGDGTT TATVLAQAIY RQGAKLVAAG HNPMEIKRGL
DQAVEALVAE LKNISKPIKD HKEIAQVGTI SANNDKTIGD IIAEAMEKVG KEGVITVEEA
KAMETTLETV EGMQFDRGYL SPYFVTDPER MEAAMDNVAI LIHDKKIANM KDLLPVLEQT
AKSGRPLLII AEDIEGEALA TLVVNKLRGV LNVCAVKAPG FGDRRKAMLE DIAILTGGKV
ISEEVGFKLE NTTLDMLGQA KKITVDKDNT TIIDGFGAEA DIQGRVKMIR AQIDETSSDY
DREKLQERLA KLVGGVAVIK VGAATEIEMK EKKARVEDAL HATRAAVDEG IVPGGGVAYL
RAMKVLENLQ LAPEQQFGVN VIKRALEEPI RQISQNAGVD GSIVVDKVKN GKDAFGYNAA
DDVYVDMIEA GIIDPTKVSR SALQNAASVA GLMMTTEAMI ADKPKEEGAM PAMPGGMGGM
GGMGGMM