Gene Noca_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3646 
Symbol 
ID4595758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3870577 
End bp3872202 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content70% 
IMG OID639778254 
Productchaperonin GroEL 
Protein accessionYP_924833 
Protein GI119717868 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAAGA TCCTGGAGTT CGACGAGCAC GCCCGGCGCG CCCTGGAGCG CGGCGTCGAC 
GCGCTCGCCA ACGCCGTCAA GGTCACGCTC GGCCCGAAGG GCCGCTACGT CGTCCTGGAC
AAGAAGTGGG GCGCCCCGAC CATCACCAAC GACGGCGTGA CCGTCGCGCG TGAGGTCGAG
CTGGACGACC CGTTCGAGAA CCTCGGTGCG CAGCTCACCA AGGAGGTCGC CACCAAGACC
AACGACATCG CGGGCGACGG TACGACGACC GCGACGGTGC TGGCCCAGGC GATGGTCCAC
GAGGGCCTGC GCGCGGTCAC CGCCGGCGCG AACCCGATGG GACTCAAGCG CGGCATGGAC
GCCGCTGCCG AGGCCGTGGG CGACGCCCTC AAGGAGGCGG CCCGCGAGGT CGAGTCGCGC
GAGGACATGG CGTCGGTCGC CACGATCTCG AGCCGTGACA GCGTGATCGG CGACCTGCTC
GCCGAGGCCT TCGACAAGGT CGGCAAGGAC GGCGTGATCA CGGTCGAGGA GTCCAACACC
ATGGGCACCG AGCTCGAGTT CACCGAGGGC ATGCAGTTCG ACAAGGGCTA CATCAGCCAG
TACTTCGTCA CCGACCCGGA GCGGATGGAG GCCGTCCTCG AGGACCCCTA CATCCTGCTG
CACCAGGGCA AGATCTCCGC GGTCGCCGAG CTGCTGCCGC TGCTGGAGAA GGTGATCCAG
TCCGGCAAGC CGCTGTTCAT CCTGGCCGAG GACGTGGAGG GCGAGGCGCT CTCCACCCTG
GTCGTGAACA AGATCCGCGG CACCTTCAAC GCGGTCGCGG TGAAGAGCCC GGCGTTCGGT
GACCGTCGCA AGGCGATGAT GCAGGACATC GCGACCCTGA CCGGCGGTCA GGTCGTCGCC
CCCGAGGTCG GACTCAAGCT CGACCAGGTC GGCCTCGAGG TGCTCGGCCA GGCCCGCCGC
GTCGTCGTCA CCAAGGACGA CACCACGATC GTGGACGGCG CCGGCGACCC CAAGGACGTC
GAGGGCCGGG TCAACCAGAT CAAGGCCGAG GTGGAGAACA CCGACTCCGA CTGGGACCGC
GAGAAGCTCC AGGAGCGGCT CGCGAAGCTG GCCGGCGGCG TGTGCGTGAT CAAGGTCGGC
GCCGCCACCG AGGTGGAGCT GAAGGAGAAG AAGCACCGCA TCGAGGACGC CGTCTCCGCG
ACGCGCGCCG CGATCGAGGA GGGCATCGTC GCCGGCGGCG GCTCCGCCCT CGTCCACGCG
GTCTCGGTGC TCTCCGACGA CCTCGGCCTC ACCGGCGACG AGGCGCTCGG CGTCCGCGTG
GTCCGCAAGG CCGCCGACGA GCCGCTGCGC TGGATCGCCG AGAACGGCGG GGTCAACGGC
TACGTCGTCA CCACCAAGGT CCGCGAGCTC GGCCTCGGCA ACGGCTTCAA CGCCGCCACC
GAGGAGTACG GCGACCTGGT CGCCCAGGGC GTCCTGGACC CGGTGAAGGT CACCCGCTCC
GCGCTGGTCA ACGCCACCTC GATCGCGGGC ATGCTGCTCA CGACCGAGAC CCTGGTCGTC
GACAAGCCCG AGGAGGAGGA GCCCGCGGCA GCCGGTCACG GCCACGGGCA CGGCCACGGC
CACTGA
 
Protein sequence
MPKILEFDEH ARRALERGVD ALANAVKVTL GPKGRYVVLD KKWGAPTITN DGVTVAREVE 
LDDPFENLGA QLTKEVATKT NDIAGDGTTT ATVLAQAMVH EGLRAVTAGA NPMGLKRGMD
AAAEAVGDAL KEAAREVESR EDMASVATIS SRDSVIGDLL AEAFDKVGKD GVITVEESNT
MGTELEFTEG MQFDKGYISQ YFVTDPERME AVLEDPYILL HQGKISAVAE LLPLLEKVIQ
SGKPLFILAE DVEGEALSTL VVNKIRGTFN AVAVKSPAFG DRRKAMMQDI ATLTGGQVVA
PEVGLKLDQV GLEVLGQARR VVVTKDDTTI VDGAGDPKDV EGRVNQIKAE VENTDSDWDR
EKLQERLAKL AGGVCVIKVG AATEVELKEK KHRIEDAVSA TRAAIEEGIV AGGGSALVHA
VSVLSDDLGL TGDEALGVRV VRKAADEPLR WIAENGGVNG YVVTTKVREL GLGNGFNAAT
EEYGDLVAQG VLDPVKVTRS ALVNATSIAG MLLTTETLVV DKPEEEEPAA AGHGHGHGHG
H