Gene Franean1_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0233 
SymboldnaK 
ID5668658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp285275 
End bp287113 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content68% 
IMG OID641239162 
Productmolecular chaperone DnaK 
Protein accessionYP_001504606 
Protein GI158312098 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0417702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.306693 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGAG CGGTCGGTAT CGACCTCGGC ACCACGAACT CCGTCGTGAG CGTGCTCGAG 
GGTGGAGAGC CCACGGTGAT CGCCAATGCC GAGGGTTCCC GGACGACTCC GTCAGTCGTG
GCCTTCGCGA AGAACGGCGA GGTGCTCGTC GGCGAGGTCG CCAAGCGGCA GGCCGTGACG
AACGTCGAGC GGACCATTCG GTCGGTCAAG CGTCACATGG GCACCGACTG GAAGATGACG
GTGGACTCCA AGGAGTTCAC CCCCCAGGAG ATCAGCGCCC GCATCCTGCA GAAGCTCAAG
CGGGATGCCG AGGCATACCT GGGTGAGTCG GTGTCCGACG CCGTCATCAC CGTCCCGGCC
TACTTCGACG ACGCCCAGCG GCAGGCGACC ACGGAGGCCG GCACCATCGC CGGTCTGAAC
GTCCTGCGCA TCGTCAACGA GCCCACCGCG GCGGCGCTGG CCTACGGGCT GGACAAGGGT
GAGAAGGAGC AGACCATCCT GGTCTTCGAC CTCGGTGGCG GCACGTTCGA CGTGTCCCTG
CTCGAGATCG GCGACGGCGT GGTCGAGGTC AAGTCGACCT CCGGCGACAC CCAGCTCGGT
GGTGACGACT GGGACCAGCG CATCACCGAC CACCTGATCA AGACCTTCAA CGGCCAGCAC
GGTGTCGACC TCGGCAAGGA CAAGATGGCG CTGCAGCGCC TGCGCGAGGC CGCCGAGAAG
GCCAAGATCG AGCTGTCGCA GTCCACCCAG ACCTCGATCA ACCTGCCGTA CATCACCGCG
TCGGCCGAGG GCCCGTTGCA CCTGGACGTC TCGCTGTCCC GGGCGGAGCT CCAGAGGATG
ACGCAGGACC TGATCGACCG CTGCAAGGGC CCGTTCCAGC AGGCCGTCAA GGACGCCGGC
ATCAAGATCA GCGACGTCGA CCACGTCGTG CTCGTCGGCG GTTCGACCCG GATGCCCGCC
GTCGTGGACC TCGTCCGCGA GCTGACCGGC GGCAAGGAGC CGAACAAGGG CGTCAACCCG
GACGAGGTCG TCGCGGTCGG CGCGTCCCTG CAGGCCGGTG TCCTCAAGGG TGAGGTCAAG
GACGTCCTGC TGCTCGACGT CACCCCGCTG TCCCTGGGTA TCGAGACCAA GGGCGGCATC
ATGACCAAGC TGATCGAGCG GAACACGACG ATCCCGACGA AGCGCTCGGA GATCTTCACC
ACCGCCGAGG ACAGTCAGCC CTCCGTGCAG ATCCAGGTCT TCCAGGGCGA GCGCGAGATG
GCGGCCTACA ACAAGAAGCT CGGCATGTTC GAGCTGACCG GCCTGCCCCC GGCCCCCCGC
GGCATGCCGC AGATCGAGGT CACCTTCGAC ATCGACGCCA ACGGCATCGT GCACGTGTCC
GCGAAGGACC TGGGCACCGG CAAGGAGCAG TCGATGACGA TCACCGGCGG CTCGGCGCTG
CCGAAGGACG ACATCAACCG CATGGTGCAG GACGCCGAGC AGTACGCCGA GGAGGACCGC
CAGCGCCGGG AGGAGGCCGA GACCCGCAAC CGCGCCGAGT CCCTCGTGTA CTCGACGGAG
CGGTTCCTGG CCGAGAACGG CGACAAGATC GAGGCCGGCG TGAAGTCCGA CGTCGAGGAG
AAGCTCAGCA CGCTGCGCGG CGCGGTCGGC GGCACGGACA TCGCCGCGAT CCGCGAGGCG
GCGGACGAGC TGACCAAGGC CAGCCAGGCC ATGGGCGAGT CGATGTACGC CCAGGCCGCC
GGCGCGCAGG GCGCCCCTGG CGCTCCCGGA GCCGGGGCGG GCCAGGCCGA CGACGACGTG
GTGGACGCCG AGATCGTCGA CGAGGAGGAC AAGAAGTGA
 
Protein sequence
MARAVGIDLG TTNSVVSVLE GGEPTVIANA EGSRTTPSVV AFAKNGEVLV GEVAKRQAVT 
NVERTIRSVK RHMGTDWKMT VDSKEFTPQE ISARILQKLK RDAEAYLGES VSDAVITVPA
YFDDAQRQAT TEAGTIAGLN VLRIVNEPTA AALAYGLDKG EKEQTILVFD LGGGTFDVSL
LEIGDGVVEV KSTSGDTQLG GDDWDQRITD HLIKTFNGQH GVDLGKDKMA LQRLREAAEK
AKIELSQSTQ TSINLPYITA SAEGPLHLDV SLSRAELQRM TQDLIDRCKG PFQQAVKDAG
IKISDVDHVV LVGGSTRMPA VVDLVRELTG GKEPNKGVNP DEVVAVGASL QAGVLKGEVK
DVLLLDVTPL SLGIETKGGI MTKLIERNTT IPTKRSEIFT TAEDSQPSVQ IQVFQGEREM
AAYNKKLGMF ELTGLPPAPR GMPQIEVTFD IDANGIVHVS AKDLGTGKEQ SMTITGGSAL
PKDDINRMVQ DAEQYAEEDR QRREEAETRN RAESLVYSTE RFLAENGDKI EAGVKSDVEE
KLSTLRGAVG GTDIAAIREA ADELTKASQA MGESMYAQAA GAQGAPGAPG AGAGQADDDV
VDAEIVDEED KK