Gene TBFG_10445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10445 
SymbolgroEL 
ID5221109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp531778 
End bp533400 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content65% 
IMG OID640605186 
Productchaperonin GroEL 
Protein accessionYP_001286390 
Protein GI148821636 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones248 
Plasmid unclonability p-value0.00698847 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones228 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA CAATTGCGTA CGACGAAGAG GCCCGTCGCG GCCTCGAGCG GGGCTTGAAC 
GCCCTCGCCG ATGCGGTAAA GGTGACATTG GGCCCCAAGG GCCGCAACGT CGTCCTGGAA
AAGAAGTGGG GTGCCCCCAC GATCACCAAC GATGGTGTGT CCATCGCCAA GGAGATCGAG
CTGGAGGATC CGTACGAGAA GATCGGCGCC GAGCTGGTCA AAGAGGTAGC CAAGAAGACC
GATGACGTCG CCGGTGACGG CACCACGACG GCCACCGTGC TGGCCCAGGC GTTGGTTCGC
GAGGGCCTGC GCAACGTCGC GGCCGGCGCC AACCCGCTCG GTCTCAAACG CGGCATCGAA
AAGGCCGTGG AGAAGGTCAC CGAGACCCTG CTCAAGGGCG CCAAGGAGGT CGAGACCAAG
GAGCAGATTG CGGCCACCGC AGCGATTTCG GCGGGTGACC AGTCCATCGG TGACCTGATC
GCCGAGGCGA TGGACAAGGT GGGCAACGAG GGCGTCATCA CCGTCGAGGA GTCCAACACC
TTTGGGCTGC AGCTCGAGCT CACCGAGGGT ATGCGGTTCG ACAAGGGCTA CATCTCGGGG
TACTTCGTGA CCGACCCGGA GCGTCAGGAG GCGGTCCTGG AGGACCCCTA CATCCTGCTG
GTCAGCTCCA AGGTGTCCAC TGTCAAGGAT CTGCTGCCGC TGCTCGAGAA GGTCATCGGA
GCCGGTAAGC CGCTGCTGAT CATCGCCGAG GACGTCGAGG GCGAGGCGCT GTCCACCCTG
GTCGTCAACA AGATCCGCGG CACCTTCAAG TCGGTGGCGG TCAAGGCTCC CGGCTTCGGC
GACCGCCGCA AGGCGATGCT GCAGGATATG GCCATTCTCA CCGGTGGTCA GGTGATCAGC
GAAGAGGTCG GCCTGACGCT GGAGAACGCC GACCTGTCGC TGCTAGGCAA GGCCCGCAAG
GTCGTGGTCA CCAAGGACGA GACCACCATC GTCGAGGGCG CCGGTGACAC CGACGCCATC
GCCGGACGAG TGGCCCAGAT CCGCCAGGAG ATCGAGAACA GCGACTCCGA CTACGACCGT
GAGAAGCTGC AGGAGCGGCT GGCCAAGCTG GCCGGTGGTG TCGCGGTGAT CAAGGCCGGT
GCCGCCACCG AGGTCGAACT CAAGGAGCGC AAGCACCGCA TCGAGGATGC GGTTCGCAAT
GCCAAGGCCG CCGTCGAGGA GGGCATCGTC GCCGGTGGGG GTGTGACGCT GTTGCAAGCG
GCCCCGACCC TGGACGAGCT GAAGCTCGAA GGCGACGAGG CGACCGGCGC CAACATCGTG
AAGGTGGCGC TGGAGGCCCC GCTGAAGCAG ATCGCCTTCA ACTCCGGGCT GGAGCCGGGC
GTGGTGGCCG AGAAGGTGCG CAACCTGCCG GCTGGCCACG GACTGAACGC TCAGACCGGT
GTCTACGAGG ATCTGCTCGC TGCCGGCGTT GCTGACCCGG TCAAGGTGAC CCGTTCGGCG
CTGCAGAATG CGGCGTCCAT CGCGGGGCTG TTCCTGACCA CCGAGGCCGT CGTTGCCGAC
AAGCCGGAAA AGGAGAAGGC TTCCGTTCCC GGTGGCGGCG ACATGGGTGG CATGGATTTC
TGA
 
Protein sequence
MAKTIAYDEE ARRGLERGLN ALADAVKVTL GPKGRNVVLE KKWGAPTITN DGVSIAKEIE 
LEDPYEKIGA ELVKEVAKKT DDVAGDGTTT ATVLAQALVR EGLRNVAAGA NPLGLKRGIE
KAVEKVTETL LKGAKEVETK EQIAATAAIS AGDQSIGDLI AEAMDKVGNE GVITVEESNT
FGLQLELTEG MRFDKGYISG YFVTDPERQE AVLEDPYILL VSSKVSTVKD LLPLLEKVIG
AGKPLLIIAE DVEGEALSTL VVNKIRGTFK SVAVKAPGFG DRRKAMLQDM AILTGGQVIS
EEVGLTLENA DLSLLGKARK VVVTKDETTI VEGAGDTDAI AGRVAQIRQE IENSDSDYDR
EKLQERLAKL AGGVAVIKAG AATEVELKER KHRIEDAVRN AKAAVEEGIV AGGGVTLLQA
APTLDELKLE GDEATGANIV KVALEAPLKQ IAFNSGLEPG VVAEKVRNLP AGHGLNAQTG
VYEDLLAAGV ADPVKVTRSA LQNAASIAGL FLTTEAVVAD KPEKEKASVP GGGDMGGMDF