Gene Namu_4827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4827 
Symbol 
ID8450457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5382729 
End bp5384354 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content68% 
IMG OID645043866 
Productchaperonin GroEL 
Protein accessionYP_003204091 
Protein GI258654935 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGC TGATCGCTTT CGACGAGGAA GCGCGTCGCG GCCTCGAGCG GGGGATGAAC 
ACCCTCGCCG ACGCCGTCAA GGTGACCTTG GGGCCGCGCG GCCGCAACGT CGTCCTGGAG
AAGAAGTGGG GCGCGCCCAC CATCACCAAC GATGGTGTGT CGATCGCCAA GGAGATCGAG
CTCGAGGACC CGTACGAGAA GATCGGGGCC GAGCTCGTCA AGGAAGTTGC CAAGAAGACC
GACGACGTCG CCGGTGACGG CACCACCACC GCCACCGTGC TGGCCCAGGC GCTGGTCCGC
GAGGGCCTGC GCAACGTGGC CGCCGGCGCC AACCCGATGG GTCTGAAGCG GGGCATCGAG
AAGGCCGTCG AGGCCGTCTC CGCCCAGCTG CTCAAGGACG CCAAGGAGGT CGAGACCAAG
GAGCAGATCG CGGCCACCGC CTCCATCTCC GCGGCCGACT CCTCCATCGG CGAGCTCATC
GCCGAGGCGA TGGACAAGGT CGGCAAGGAA GGCGTCATCA CCGTCGAGGA GAGCAACACC
TTCGGCCTCG AGCTCGAGCT CACCGAGGGC ATGCGCTTCG ACAAGGGCTA CACCTCGCTG
TACTTCGCCA CCGACCAGGA GCGTCAGGAG GCCGTCCTCG AGGATCCCTA CATCCTGCTG
TACGGCTCGA AGATCTCCTC GGTCAAGGAC CTGCTGCCGC TGCTGGAGAA GGTCATCCAG
TCCGGCAAGG CCCTGCTGAT CATCGCCGAG GACGTCGAGG GCGAGGCCCT GGCGACCCTG
GTGGTCAACA AGATCCGTGG CACCTTCAAG TCGGTTGCCG TCAAGGCCCC CGGCTTCGGT
GACCGCCGCA AGGCCATGCT GCAGGACATC GCCATCCTCA CCGGTGGCCA GGTCATCAGC
GAGGATGTCG GCCTCAAGCT GGAGAACACC GACCTGTCCC TGCTGGGTCA GGCCCGCAAG
GTCGTCGTGA CCAAGGACGA GACCACCATC GTCGACGGTT CCGGCGATGC CGAGCAGATC
GCCGGCCGGG TGGCCCAGAT CCGCTCCGAG ATCGAGAAGA GCGACTCGGA CTACGACCGC
GAGAAGCTGC AGGAGCGGCT GGCCAAGCTG GCCGGCGGCG TTGCGGTCAT CAAGGCCGGA
GCGGCCACCG AGGTGGAGCT CAAGGAGCGC AAGCACCGCA TCGAAGATGC CGTGCGCAAC
GCCAAGGCTG CCGTGGAGGA GGGCATCGTC GCCGGTGGCG GCGTCGCCCT GCTGCAGGCC
GCGATCGTGG CCTTCCAGGG CCTGGAGCTG ACCGGGGACG AGGCGACCGG CGCGAACATC
GTGCGCGTGG CCGTCGAGGC TCCGCTCAAG CAGATCGCGA TCAACGCCGG CCTCGAGGGC
GGCGTCGTGG CGGAGAAGGT CAAGGGTCTG CCCGCGGGCG AGGGCCTGGA CGCCGCCACC
GGCGAGTACA AGGACCTGAT CAAGGCCGGC ATCATCGACC CGGCCAAGGT CACCCGGTCC
GCGCTGCAGA ACGCCGCGTC CATCGCCGCG CTGTTCCTGA CCACCGAAGC CGTGGTCGCG
GACAAGCCGG AGAAGGCCTC GGCGCCGGCC GGCGGCGGTA TGCCCGGCGG GGACATGGAC
TTCTGA
 
Protein sequence
MAKLIAFDEE ARRGLERGMN TLADAVKVTL GPRGRNVVLE KKWGAPTITN DGVSIAKEIE 
LEDPYEKIGA ELVKEVAKKT DDVAGDGTTT ATVLAQALVR EGLRNVAAGA NPMGLKRGIE
KAVEAVSAQL LKDAKEVETK EQIAATASIS AADSSIGELI AEAMDKVGKE GVITVEESNT
FGLELELTEG MRFDKGYTSL YFATDQERQE AVLEDPYILL YGSKISSVKD LLPLLEKVIQ
SGKALLIIAE DVEGEALATL VVNKIRGTFK SVAVKAPGFG DRRKAMLQDI AILTGGQVIS
EDVGLKLENT DLSLLGQARK VVVTKDETTI VDGSGDAEQI AGRVAQIRSE IEKSDSDYDR
EKLQERLAKL AGGVAVIKAG AATEVELKER KHRIEDAVRN AKAAVEEGIV AGGGVALLQA
AIVAFQGLEL TGDEATGANI VRVAVEAPLK QIAINAGLEG GVVAEKVKGL PAGEGLDAAT
GEYKDLIKAG IIDPAKVTRS ALQNAASIAA LFLTTEAVVA DKPEKASAPA GGGMPGGDMD
F