Gene Anae109_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1472 
SymbolgroEL 
ID5375113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1669870 
End bp1671513 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content68% 
IMG OID640842983 
Productchaperonin GroEL 
Protein accessionYP_001378663 
Protein GI153004338 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0166914 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.319334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCCA AGGAGATCGC ATTCCACCAG GGGGCCCGCG AAGCCATCCT CCGCGGCGTC 
CAAACGCTCG CCGAGGCTGT CGCCGTGACC CTCGGCCCCA AGGGCCGCAA CGTCGTCATC
GAGAAGAGCT TCGGCTCGCC GACCATCACC AAGGACGGCG TCACGGTCGC GAAGGAGATC
GAGGTCGAGA ACAAGTTCGA GAACATGGGC GCGCAGATGG TCCGCGAGGT CGCGTCCCAG
ACCTCGGACA AGGCGGGCGA CGGCACCACC ACCGCCACCG TGCTCGCGCG CGCCCTCTTC
GAGGAGGGCC TGAAGCTCGT GGCGGCGGGC CACAACCCGA TGGACCTCAA GCGCGGCATC
GACCGGGCCG TCGAGGTGAT CGTCGCCGAG CTGAAGAAGC TCTCGAAGCC CACGCAGGGG
AAGAAGGACA TCGCCCAGGT CGGCACCATC TCCGCGAACG GCGACGAGAC GATCGGCAAC
ATCATCGCCG AGGCGATGGA GAAGGTGGGC AAGGAGGGCG TCATCACGGT CGAGGAGGCG
AAGGGCCTCG AGACGACGCT CGACGTGGTC GAGGGCATGC AGTTCGACCG CGGCTACTCC
TCCCCCTACT TCGTCACGAA CCCGGATCGC ATGGAGGCCG TGCTCGAGGA TCCGTTCATC
CTCATCACCG AGAAGAAGAT CTCGGCGATG GCCGACCTCA TCCCGGTGCT CGAGCAGGTC
GCCCGCTCCG GCAAGCCGCT CCTCATCGTC GCCGAGGACG TGGAGGGCGA GGCGCTCGCG
ACGCTCGTCG TGAACAAGCT GCGCGGCACG CTCCACGTGT GCGCGGTGAA GGCGCCCGGC
TTCGGCGACC GCCGCAAGGA GATGCTGAAG GACATCGCGA CGCTCACCGG CGGCAACGTG
GTCGCCGAGG AGCTCGGCAT CAAGCTCGAG CAGCTCACCG TGAAGGATCT CGGGCGCGCG
AAGCGCATCA CGATCGACAA GGAGAACACC ACGATCGTGG ACGGCGAGGG GAAGCGCGAG
GACATCGAGG CGCGCATCAA GCAGATCCGC GCGCAGATCG AGGAGACCAC GAGCGACTAC
GATCGCGAGA AGCTGCAGGA GCGGCTCGCG AAGCTCGTGG GCGGCGTCGC CGTGATCAAC
GTCGGCGCGG CCACCGAGAC CGAGATGAAG GAGAAGAAGG CCCGCGTCGA GGACGCGCTC
CACGCGACCC GCGCGGCCGT CGAGGAGGGC ATCGTCCCCG GCGGCGGCGT CGCCTACCTC
CGCGCGCTGC AGGCGCTGAA GAAGCTCGAG GTGCCCGAGG GCGATCAGCG CTTCGGCGTG
GCGATCGTGC AGAAGGCGCT CGAGTACCCG GCGCGCCGCA TCGCCGAGAA CGCCGGCTGG
GACGGCGCGG TGGTCGTCTC GAGGATCAAC GACGGCAAGG CGGCCCACGG CTTCAACGCC
GCGAGCGAGG TGTTCGAGGA TCTCGAGAAG GCGGGAGTCA TCGATCCGAC CAAGGTGTCC
CGCACCGCGC TCCAGAACGC CGCGTCCGTC GCGAGCCTCC TCCTCACCAC CGAGGCGATG
GTGGCCGAGA AGCCGAAGAA GAAGGGCGCG CCCGCCGGCG GCGGCATGGG CGGCATGGGC
GGCATGGACG AGATGGATTA CTGA
 
Protein sequence
MPAKEIAFHQ GAREAILRGV QTLAEAVAVT LGPKGRNVVI EKSFGSPTIT KDGVTVAKEI 
EVENKFENMG AQMVREVASQ TSDKAGDGTT TATVLARALF EEGLKLVAAG HNPMDLKRGI
DRAVEVIVAE LKKLSKPTQG KKDIAQVGTI SANGDETIGN IIAEAMEKVG KEGVITVEEA
KGLETTLDVV EGMQFDRGYS SPYFVTNPDR MEAVLEDPFI LITEKKISAM ADLIPVLEQV
ARSGKPLLIV AEDVEGEALA TLVVNKLRGT LHVCAVKAPG FGDRRKEMLK DIATLTGGNV
VAEELGIKLE QLTVKDLGRA KRITIDKENT TIVDGEGKRE DIEARIKQIR AQIEETTSDY
DREKLQERLA KLVGGVAVIN VGAATETEMK EKKARVEDAL HATRAAVEEG IVPGGGVAYL
RALQALKKLE VPEGDQRFGV AIVQKALEYP ARRIAENAGW DGAVVVSRIN DGKAAHGFNA
ASEVFEDLEK AGVIDPTKVS RTALQNAASV ASLLLTTEAM VAEKPKKKGA PAGGGMGGMG
GMDEMDY