Gene Namu_1208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1208 
Symbol 
ID8446804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1326457 
End bp1328088 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content71% 
IMG OID645040344 
Productchaperonin GroEL 
Protein accessionYP_003200603 
Protein GI258651447 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.730524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGC AGATCCGCTT CGATACCGAC GCGCGCGCGG CCCTGCAGCG CGGCGTCGAC 
AAGCTCGCCG ACGCGGTGAA GGTCACCCTT GGGCCGCGCG GCCGGTACGT GGTGCTGGAC
AAGAAGTTCG GCGGCCCGAC CATCACCAAT GACGGCGTCA CCATCGCTCG CGACATCGAG
CTGGACGACC CGAACGAGAA CATGGGCGCG CAGCTGGCCA AGACCGCGGC GACCAAGACC
AACGACGTGG CCGGCGACGG CACCACGACC GCGACCATCC TGACCCAGGC GATGATCGCC
GAGGGCCTGC GCAACGTCAC CGCCGGGGCG AACCCGCTGG CGCTGCGCTC GGGCATCACC
CAGGCCGCCG ACCGGGTCAA CGAGCTGCTC ACTGAGTGGG CGACCCCGGT GGCCGGCGAC
CGCGAGGCCA TCGCCCAGGT CGGCACCATC GCCTCCCGCG ACGAGGTGAT CGGCGACCTG
CTGGGCGATG CCCTGCAGCA CGTCGGCGCC GACGGCGTGG TCAGCGTCGA GGAGCACTCC
GGGCTGACCA CCGAGCTCGA GTACACCGAC GGCGTGCAGT TCGACAAGGG CTACCTCTCG
CCGTACTTCG CGACCGACCC GGAGGCCGCC GAGGCCGTCC TGGAGGACGC GCTGGTGCTG
CTGGTGCGTG AGAAGATCTC CGCCCTGGCC GACCTGCTCC CGCTGCTGGA GAAGGTGCTG
GAGGCCAAGA AGCCGCTGCT GATCGTGGCC GAGGACGTCG ACGGCGAGGC GCTGTCCACC
CTGGTGGTCA ACGCCATCCG CAAGACGTTC ACCGTCGTCG CGGTCAAGGC GCCGTTCTTC
GGCGACCGGC GCAAGGCCTT CCAGCAGGAC CTGGCCATCG TCACCGGGGC CGAGGTCGTC
TCCGCCGAGG TCGGTCTCAA GCTGGCCGAG GTCGGCACCG AGGTGCTGGG CCGGGCCCGG
CGCATCACCG TCACCAAGGA CACGACCACG ATCGTGGACG GCGGCGGCTC GGCCGAGGCG
GTGGCCGATC GGGCCGCGCA GCTGCGGGCC GACATCGAGA GCACCGATTC GGACTGGGAT
CGGGAGAAGC TGCAGGAGCG GCTGGCCAAG CTGGCCGGTG GCGTGGCGCT GATCAAGGTC
GGCGCGGCCA CCGAGATCGA GGCCAAGGAG CGCAAGCACC GCATCGAGGA CGCGGTCAAC
GCGACCAAGG CGGCGGTGGC CGAAGGCATC ATCGCCGGCG GTGGATCCGC GCTGGTGCAC
GCCTCGGCCG CGCTGGCCGA GCTGCAGGAG CAGCTGTCCG GCGACGAGGC GCTCGGCGTC
GGCATCGTTC GGCGCGCGCT GTCCGCCCCG GCCTTCTGGA TCGCCGCCAA CGGTGGCCAG
GAGGGCGCCG TCGTGGTCAA CCGCATCGCG GATCTGCCGC GGGGCGAGGG CTATGACGCC
GGCCAGGACC GGTATGTCGA CCTGGTGCAG GCCGGCATCA TCGACCCGGT CAAGGTGACC
AAGTCGGCCG TGTCCAACGC TGCGTCGATC GCCGGCATGG TGCTGACCAC CGAGTCGACC
GTCGTCGACC TCCCGGAGGA CCAGCACGAC CACGGCGCTG ACGGCCACGG CCACCACGGC
CACAGCCACT GA
 
Protein sequence
MAKQIRFDTD ARAALQRGVD KLADAVKVTL GPRGRYVVLD KKFGGPTITN DGVTIARDIE 
LDDPNENMGA QLAKTAATKT NDVAGDGTTT ATILTQAMIA EGLRNVTAGA NPLALRSGIT
QAADRVNELL TEWATPVAGD REAIAQVGTI ASRDEVIGDL LGDALQHVGA DGVVSVEEHS
GLTTELEYTD GVQFDKGYLS PYFATDPEAA EAVLEDALVL LVREKISALA DLLPLLEKVL
EAKKPLLIVA EDVDGEALST LVVNAIRKTF TVVAVKAPFF GDRRKAFQQD LAIVTGAEVV
SAEVGLKLAE VGTEVLGRAR RITVTKDTTT IVDGGGSAEA VADRAAQLRA DIESTDSDWD
REKLQERLAK LAGGVALIKV GAATEIEAKE RKHRIEDAVN ATKAAVAEGI IAGGGSALVH
ASAALAELQE QLSGDEALGV GIVRRALSAP AFWIAANGGQ EGAVVVNRIA DLPRGEGYDA
GQDRYVDLVQ AGIIDPVKVT KSAVSNAASI AGMVLTTEST VVDLPEDQHD HGADGHGHHG
HSH