Gene Mkms_1171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1171 
SymbolgroEL 
ID4614549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1255814 
End bp1257436 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content70% 
IMG OID639790847 
Productchaperonin GroEL 
Protein accessionYP_937174 
Protein GI119867222 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.320819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC AGATCGAATT CAACGAAACT GCGCGCCGCG CAATGGAGAT CGGCGTAGAC 
AAGCTCGCCG ACGCGGTGAA GGTGACCCTG GGGCCGCGCG GTCGCAACGT GGTGCTCGCC
AAGTCGTGGG GCGGCCCGAC CGTCACCAAC GACGGTGTCA CCATCGCCCG TGAGATCGAC
CTCGAGGATC CGTTCGAGAA CCTCGGCGCC CAGCTGGTCA AGTCGGTGGC GACCAAGACC
AACGACGTCG CCGGCGACGG CACCACCACC GCCACCGTGC TGGCGCAGGC GCTGGTGCGC
GCCGGTCTGC GCAACGTCGC GGCCGGCGCC AACCCGATCG CCCTGGGCGC GGGGATCAGC
AAGGCCGCCG ACGCCGTGTC CGAGGCGCTG CTGGCCGCGG CCACCCCCGT CGACGACAAG
AGCGGTATCG CCCAGGTCGC CACGGTCAGC TCACGCGACG AGCAGATCGG CGAGCTGGTC
GGCGAGGCGA TGACCAAGGT CGGCCACGAC GGTGTCGTGA CCGTCGAGGA GTCCTCGACC
CTGAACACCG AACTCGAGGT CACCGAGGGC GTCGGCTTCG ACAAGGGGTT CATCTCGGCG
TACTTCGTCA CCGACTTCGA CTCCCAGGAA GCGGTGCTCG AGGACGCGCT GGTGCTGCTG
CACCGCGAGA AGGTCAGCTC GCTGCCGGAC CTGCTGCCGC TGCTGGAGAA GGTCGCCGAA
GCCGGTAAGC CGCTGCTGAT CATCGCCGAG GACGTCGAGG GTGAGGCGCT GTCCACACTG
GTGGTCAACG CGATCCGTAA GACGCTCAAG GCCGTCGCCG TCAAGGCGCC GTTCTTCGGT
GACCGTCGCA AGGCGTTCCT CGACGACCTC GCCGTCGTCA CCGGCGGCCA GGTGATCAAC
CCCGACGTCG GCCTGGTGCT GCGCGAGGTC GGCCTCGACG TCCTGGGCAC CGCGCGCCGC
GTGGTGGTCA CCAAGGACAG CACCGTGATC GTCGACGGCG GCGGCAGCGC CGACGCCATC
GCCGACCGGG CCAAGCAGCT GCGGGCCGAG ATCGAGGCGA CCGACTCCGA CTGGGATCGC
GAGAAGCTCG AGGAGCGGCT GGCCAAGCTG GCCGGCGGCG TCGCGGTGAT CAAGGTCGGT
GCGGCCACCG AGACCGATCT GAAGAAGCGC AAGGAAGCCG TCGAGGACGC GGTCTCCGCG
GCCAAGGCGG CCGTCGAGGA GGGCATCGTC ACCGGCGGCG GTGCCGCCCT GGTGCAGGCC
CGCAAGGCGC TGGACAGCCT GCGGGGCTCG GTCTCCGGCG ACGAGGCGCT CGGTGTCGAG
GTGTTCAACT CCGCGCTGTC GGCTCCGCTG TACTGGATCG CCACCAACGC CGGGCTCGAC
GGTTCGGTCG TGGTGAACAA GGTCAGCGAA CTGCCTGCGG GACAGGGCTT CAACGCCGCG
ACGCTCGAGT TCGGCGACCT GCTCGCCGAC GGCGTCGTCG ACCCGGTCAA GGTGACGCGA
TCGGCGGTGC TCAACGCCGC GTCGGTCGCC CGCATGGTGC TCACCACCGA GACCGCGATC
GTCGACAAGC CGGCCGAGGA AGAGGATCAC GGCCACGGCC ACCATCACGG CCACGCTCAC
TGA
 
Protein sequence
MSKQIEFNET ARRAMEIGVD KLADAVKVTL GPRGRNVVLA KSWGGPTVTN DGVTIAREID 
LEDPFENLGA QLVKSVATKT NDVAGDGTTT ATVLAQALVR AGLRNVAAGA NPIALGAGIS
KAADAVSEAL LAAATPVDDK SGIAQVATVS SRDEQIGELV GEAMTKVGHD GVVTVEESST
LNTELEVTEG VGFDKGFISA YFVTDFDSQE AVLEDALVLL HREKVSSLPD LLPLLEKVAE
AGKPLLIIAE DVEGEALSTL VVNAIRKTLK AVAVKAPFFG DRRKAFLDDL AVVTGGQVIN
PDVGLVLREV GLDVLGTARR VVVTKDSTVI VDGGGSADAI ADRAKQLRAE IEATDSDWDR
EKLEERLAKL AGGVAVIKVG AATETDLKKR KEAVEDAVSA AKAAVEEGIV TGGGAALVQA
RKALDSLRGS VSGDEALGVE VFNSALSAPL YWIATNAGLD GSVVVNKVSE LPAGQGFNAA
TLEFGDLLAD GVVDPVKVTR SAVLNAASVA RMVLTTETAI VDKPAEEEDH GHGHHHGHAH