Gene Noca_3982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3982 
SymbolgroEL 
ID4598117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4200741 
End bp4202366 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content68% 
IMG OID639778587 
Productchaperonin GroEL 
Protein accessionYP_925166 
Protein GI119718201 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.455852 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAAGC TGATTGCTTT CAACGAGGAG GCCCGGCGCG GTCTCGAGCG TGGCATGAAC 
ACGCTCGCGG ACGCTGTCAA GGTCACGCTC GGGCCCAAGG GTCGCAACGT CGTCCTCGAG
AAGAAGTGGG GCGCCCCCAC GATCACCAAC GATGGTGTGT CCATCGCCAA GGAGATCGAG
CTCGAGGACC CCTACGAGAA GATCGGTGCC GAGCTCGTCA AGGAGGTCGC CAAGAAGACC
GACGACGTCG CCGGCGACGG CACCACCACC GCCACCGTCC TCGCCCAGGC GATGGTCCGC
GAGGGCCTGC GCAACGTGGC CGCCGGTGCG AACCCGATGG GCCTCAAGCG CGGCATCGAG
GCGGCCGTCG AGGCCGTGTC CGGCCAGCTG CTGAGCATGG CCAAGGACGT CGAGACCAAG
GAGCAGATCG CGTCCACCGC GAGCATCTCC GCGGCCGACA CGACCGTCGG CGAGATCATC
GCCGAGGCGA TGGACAAGGT CGGCAAGGAA GGCGTCATCA CCGTCGAGGA GTCGAACACC
TTCGGGCTCG ACCTCGAACT CACCGAGGGC ATGCGCTTCG ACAAGGGCTA CATCTCGGCG
TACTTCGTCA CCGACCCCGA GCGGATGGAG ACGGTCCTCG AGGACCCCTA CGTCCTGATC
GCGAACCAGA AGATCTCCTC GGTCAAGGAC CTGCTGCCGC TGCTCGAGAA GGTCATGCAG
TCCGGCAAGC CGCTGCTGAT CCTGGCCGAG GACGTCGACG GCGAGGCCCT GTCCACGCTG
GTCGTCAACA AGATCCGCGG CACCTTCAAG TCCGTCGCCG TCAAGGCCCC GGGCTTCGGT
GACCGCCGCA AGGCCATGCT GCAGGACATC GCGATCCTCA CCGGCGGCCA GGTCATCTCC
GAGGAGGTCG GCCTCAAGCT CGAGTCGACC GGCATCGAGC TGCTCGGCCA GGCCCGCAAG
GTCGTCATCA CCAAGGACGA GACCACGATC GTCGAGGGTG CCGGCGACGC CGACCAGATC
GCCGGCCGGG TCAACCAGAT CCGCGCCGAG ATCGAGAAGT CGGACTCCGA CTACGACCGC
GAGAAGCTCC AGGAGCGCCT CGCCAAGCTG GCCGGCGGCG TGGCCGTCAT CAAGGTCGGC
GCGGCCACCG AGGTCGAGCT CAAGGAGCGC AAGCACCGCA TCGAGGACGC CGTTCGCAAC
GCGAAGGCGG CCGTCGAGGA GGGCATCGTC GCCGGTGGTG GCGTGGCGCT CGTCCAGGCT
GCCAACGCCG CGTTCGACAA GCTCGACCTC ACCGGTGACG AGGCCGTGGG TGCGCAGATC
GTGCGCTTCG CGACCGATGC CCCGCTCAAG CAGATCGCGA TCAACGCCGG CCTCGAGGGC
GGCGTCGTGG CGGAGAAGGT GCGCGGCCTC ACGGCCGGTC ACGGCCTCAA TGCGGCCACC
GGCGAGTACG TCGACATGAT CGCCTCGGGC ATCATCGACC CCGCCAAGGT GACCCGCAGC
GCGCTGCAGA ACGCCGCCTC CATCGCGGCG CTCTTCCTCA CCACCGAGGC CGTCGTGGCC
GACAAGCCGG AGAAGGCCGC CCCGATGGGC GACCCGTCCG GCGGCATGGG CGGCATGGAC
TTCTGA
 
Protein sequence
MPKLIAFNEE ARRGLERGMN TLADAVKVTL GPKGRNVVLE KKWGAPTITN DGVSIAKEIE 
LEDPYEKIGA ELVKEVAKKT DDVAGDGTTT ATVLAQAMVR EGLRNVAAGA NPMGLKRGIE
AAVEAVSGQL LSMAKDVETK EQIASTASIS AADTTVGEII AEAMDKVGKE GVITVEESNT
FGLDLELTEG MRFDKGYISA YFVTDPERME TVLEDPYVLI ANQKISSVKD LLPLLEKVMQ
SGKPLLILAE DVDGEALSTL VVNKIRGTFK SVAVKAPGFG DRRKAMLQDI AILTGGQVIS
EEVGLKLEST GIELLGQARK VVITKDETTI VEGAGDADQI AGRVNQIRAE IEKSDSDYDR
EKLQERLAKL AGGVAVIKVG AATEVELKER KHRIEDAVRN AKAAVEEGIV AGGGVALVQA
ANAAFDKLDL TGDEAVGAQI VRFATDAPLK QIAINAGLEG GVVAEKVRGL TAGHGLNAAT
GEYVDMIASG IIDPAKVTRS ALQNAASIAA LFLTTEAVVA DKPEKAAPMG DPSGGMGGMD
F