Gene Afer_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_0444 
Symbol 
ID8322503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp441656 
End bp443290 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content67% 
IMG OID644951596 
Productchaperonin GroEL 
Protein accessionYP_003109085 
Protein GI256371261 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAAGA TCTTGGAGTT CGACGAGTCC GCTCGTCGTG CGCTCGAGGC GGGCGTCAAC 
AAGCTCGCCG ACACCGTCAA GGTGACGCTC GGCCCGAAGG GCCGCAACGT CGTGCTCGCC
AAGAGCTTCG GCGCGCCCAC GATCACCAAC GACGGCGTCT CCATCGCTCG CGAGATCGAG
CTCGAGGACC CCTTCGAGAA CATGGGTGCC CAGCTCGTCA AGGAGGTCGC CACCAAGACC
AACGACGTGG CTGGTGACGG CACCACGACC GCGACCGTCC TCGCGCAGGC GATGATCCGC
GAGGGGCTTC GCAACGTCGC GGCTGGTGCG AACCCGATGG CCCTCAAGCG CGGCATCGAG
CAGGCGGTCG CGGCCGCCGT CGAGTCCATC GCCGAGCAGG CCAAGCCGGT CGAGGGCCGC
AGTGACTTCG CCCAGGTCGC TGCCATCTCG GCAGCCGATC AGGCGGTCGG CGAGGTCCTC
GCCGAGGCCA TCGACAAGGT CGGCAAGGAC GGCACGGTCA CCGTCGAGGA GTCGAACACC
TTCGGGCTCG AGCTCGAGTT CACCGAGGGC ATGCAGTTCG ACAAGGGCTA CCTGTCGCCG
TACTTCGTGA CCAACCAGGA TCGCCAAGAG GCGGTGCTCG AGAACCCGTA CATCCTCTTC
TACGCGACGA AGATCTCCTC GATCCACGAG CTGCTGCCCG TGCTCGAGAA GGTCATGCAG
GCCGGTCGGT CGTTGCTCAT CGTCGCAGAG GACGTCGAGG GTGAGGCGCT CGCGACGTTG
GTCGTGAACA AGATCCGCGG CACCTTCACC TCGGTGGCGG TCAAGGCCCC CGGCTTCGGC
GAGCGCCGCA AGGCGATGCT GCAGGACATG GCGATCCTCA CGGGTGGCCA GGTCATCTCG
GAAGAGGTCG GCCTCAAGCT CGAGAACGTC ACGCTGGATC TGCTGGGCCA GGCTCGGCGC
ATCGAGGTGA CGAAGGACGA GACGAAGATC ATCGGCGGCG CCGGTCAGAA GGCGGACGTC
GATGGTCGGA TCGCCCAGAT CCGTCGTGAG ATCGAGGAGA CCGACTCCGA CTGGGACCGT
GAGAAGCTCC AGGAGCGCCT CGCCAAGCTC GCCGGTGGTG TCGCGGTCGT CAAGGTCGGC
GCCGCGACCG AGGTCGAGCT CAAGGAGAAG AAGCACCGCA TCGAGGATGC GCTCTCCGCG
ACTCGTGCAG CCATCGAGGA GGGGATCGTT GCCGGCGGTG GCACGGCGCT CATTCGCGCT
CGGGCGCGCG TGAACGACGT GGTCGCCAAG CTCGAGGGTG ACGAGGCGAC GGGCGCGACC
ATCGTGGCTC GTTCGCTCGA GGAGCCGCTC AAGTGGATCG CCTACAACGC GGGCATGGAA
GGCCCGGTGG TGGTCCAGAC GGTCGAGCAC GAGTCGGGCA ACGTTGGCCT GAACGCTCGT
ACCGGTGTGT ACGAGGACCT CGTGAAGGCC GGCGTGATCG ACCCGGCGAA GGTGACGCGC
TCTGCGCTGC AGAACGCAGC GTCCATCGCG GCCCTGCTGC TCACGACCGA GGCACTCGTG
GCCGACAAGC CCGAGGAGCC GGGTCAGGCG GCGGCCGGCG CTGGCGCAGC CGGCGGCATG
GGCGGCATGA TGTAA
 
Protein sequence
MPKILEFDES ARRALEAGVN KLADTVKVTL GPKGRNVVLA KSFGAPTITN DGVSIAREIE 
LEDPFENMGA QLVKEVATKT NDVAGDGTTT ATVLAQAMIR EGLRNVAAGA NPMALKRGIE
QAVAAAVESI AEQAKPVEGR SDFAQVAAIS AADQAVGEVL AEAIDKVGKD GTVTVEESNT
FGLELEFTEG MQFDKGYLSP YFVTNQDRQE AVLENPYILF YATKISSIHE LLPVLEKVMQ
AGRSLLIVAE DVEGEALATL VVNKIRGTFT SVAVKAPGFG ERRKAMLQDM AILTGGQVIS
EEVGLKLENV TLDLLGQARR IEVTKDETKI IGGAGQKADV DGRIAQIRRE IEETDSDWDR
EKLQERLAKL AGGVAVVKVG AATEVELKEK KHRIEDALSA TRAAIEEGIV AGGGTALIRA
RARVNDVVAK LEGDEATGAT IVARSLEEPL KWIAYNAGME GPVVVQTVEH ESGNVGLNAR
TGVYEDLVKA GVIDPAKVTR SALQNAASIA ALLLTTEALV ADKPEEPGQA AAGAGAAGGM
GGMM