Gene Afer_1846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_1846 
Symbol 
ID8323940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp1931994 
End bp1933616 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content66% 
IMG OID644952977 
Productchaperonin GroEL 
Protein accessionYP_003110433 
Protein GI256372609 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAGC TCATCGCGTT CGACGAGGAT GCTCGGCGCA AGCTCGAGCA GGGCATGAAC 
AAGCTTGCGG ACGCCGTGCG GGTGACCCTT GGACCGAAGG GCCGTAACGT CGTCCTCGAC
AAGAAGTGGG GCGCACCCAC GATCACCAAT GACGGCGTCT CCATCGCGAA GGAGATCGAG
CTCGAGGAGC CGTTCGAGAA GCTCGGTGCC GACCTCGTGA AGGAGGTCGC CAAGAAGACC
GACGACGTCG CTGGTGACGG TACGACCACG GCCACTGTGC TCGCGTGGGC GATGGTTCGC
GAGGGCCTGA AGAACGTTGC GGCAGGTGCC AACCCGATGT CACTCAAGCG TGGCATCGAG
GAGGCCGTTG CCGACGCAGT CCTCGCGCTC AAGTCCATCG CGAAGGAGAC CGAGACGCGC
GAGCAGGTCG CGCAGGTCGC CGCGATCTCC GCTGCCGATC CTGAGGTCGG AGCCATGATC
TCCGAGGCGA TCGAGCGCGT CGGCAAGGAC GGCGTGATCA CGGTGGAGGA GTCCCAGACC
TTCGGGATGG AGATCGACCT TGTCGAGGGC ATGCGCTTCG ACAAGGGCTA CATCTCGCCG
TACTTCGCCA CCGACACCGA GGCCATGACC GCGATCCTCG ACGACCCGTA CATTCTGCTG
GTCAGCTCGA AGATCTCGTC GGTGCGTGAG CTGTTGCCGG TGCTCGAGAA GGTGATGCAA
GCGGGCAAGC CGCTCCTCAT CATCGCAGAG GATGTCGAGG GCGAGGCGCT CGCGACGCTG
GTCGTGAACA AGATCCGAGG CACCTTCCGG TCGGTCGCGG TCAAGGCTCC TGGCTTTGGT
GAGCGCCGCA AGGCCATGCT GCAGGACATC GCGATCCTCA CCGGTGGCCA GGTCATCTCC
GAGGAGGTCG GGCTCAAGCT CGAGAACACG ACGCTCGACC TGTTGGGTCG CGCGCGCCGC
GTCGAGGTCA CGAAGGACGA GACGACGATC ATCGAGGGCG CGGGGAGCGA CGCCGACATC
AAGGGTCGGA TCCAGCAGAT CCGTAACGAG ATCGAGGCGA CGGACTCCGA CTACGACCGC
GAGAAGCTGC AGGAGCGTCT TGCCAAGCTG TCCGGCGGCG TCGCGATCAT CAAGGTCGGC
GCCGCGACCG AGGTGGAGCT CAAGGAGAAG AAGCACCGCA TCGAGGACGC CGTCTCGACC
ACCAAGGCAG CGATCGAAGA GGGTGTCGTG CCCGGCGGTG GCGTGGCGTT GCTGCGTGCC
CAGCGTGCGG TGCTCGACAA GGCAGAGAAG CTCGAGGGTG ACGAGGCGAC CGGTGCGAGG
ATCGTCGCCA AGGCGGTGGA GGAGCCGTTG CGGCAGATCG CGACCAACGC CGGCCTCGAA
GGTGGGGTCG TGGTCGAGCG AGTCAAGGCA CTCACCAACC CCAACGAGGG CTTGAACGCT
GCGACCGGGA CCTACGAGGA TCTGGTGGCG GCCGGTGTCA TCGACGCCGT GAAGGTCACC
CGTTCGGCGC TGCAGAACGC GGCGTCCATT GCGGCGCTCT TCCTGACCAC CGAGGCGGTC
GTGGTCGACA AGCCGGAGCC CAAGCCGGCG GTCAACCCTG GCGCCGGGAT GGAGGACTTC
TAA
 
Protein sequence
MAKLIAFDED ARRKLEQGMN KLADAVRVTL GPKGRNVVLD KKWGAPTITN DGVSIAKEIE 
LEEPFEKLGA DLVKEVAKKT DDVAGDGTTT ATVLAWAMVR EGLKNVAAGA NPMSLKRGIE
EAVADAVLAL KSIAKETETR EQVAQVAAIS AADPEVGAMI SEAIERVGKD GVITVEESQT
FGMEIDLVEG MRFDKGYISP YFATDTEAMT AILDDPYILL VSSKISSVRE LLPVLEKVMQ
AGKPLLIIAE DVEGEALATL VVNKIRGTFR SVAVKAPGFG ERRKAMLQDI AILTGGQVIS
EEVGLKLENT TLDLLGRARR VEVTKDETTI IEGAGSDADI KGRIQQIRNE IEATDSDYDR
EKLQERLAKL SGGVAIIKVG AATEVELKEK KHRIEDAVST TKAAIEEGVV PGGGVALLRA
QRAVLDKAEK LEGDEATGAR IVAKAVEEPL RQIATNAGLE GGVVVERVKA LTNPNEGLNA
ATGTYEDLVA AGVIDAVKVT RSALQNAASI AALFLTTEAV VVDKPEPKPA VNPGAGMEDF