Gene Smed_3389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3389 
SymboldnaK 
ID5324273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3593690 
End bp3595615 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content60% 
IMG OID640792340 
Productmolecular chaperone DnaK 
Protein accessionYP_001329045 
Protein GI150398578 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAG TTATTGGTAT TGACCTGGGA ACGACCAACT CCTGCGTCTC CGTCATGGAC 
GGAAAGGACG CGAAGGTAAT CGAGAACGCC GAGGGCGCGC GCACGACCCC CTCGATGGTG
GCATTCACCG AAGACGGCGA ACGTCTCGTC GGACAGCCGG CCAAGCGCCA GGCCGTGACC
AATCCCGAGA ACACGCTTTT TGCGATCAAG CGCCTGATCG GCCGCACCTT CGAGGATCCG
ACGACCCAGA AGGACAAGGG GATGGTCCCC TATAAGATCG TCAAGGCCGA TAATGGCGAC
GCCTGGGTCG AAGCCCATGA CAAGAGCTAT TCTCCCTCGC AGATTTCCGC GATGATCCTT
CAGAAAATGA AGGAAACGGC CGAGTCCTAT CTCGGTGAAA AGGTTGAGAA AGCGGTTATC
ACCGTTCCGG CCTACTTCAA CGACGCCCAA CGACAGGCCA CCAAGGACGC CGGCAAGATC
GCCGGCCTGG ACGTGTTGCG CATCATCAAC GAGCCGACCG CGGCCGCGCT TGCCTATGGC
CTCGACAAGA AGGAAGGCAA GACGATCGCC GTTTACGACC TTGGCGGCGG CACCTTCGAT
ATTTCGGTTC TCGAAATCGG CGACGGCGTC TTTGAAGTGA AGTCGACCAA TGGCGACACC
TTCCTTGGCG GCGAAGACTT CGACATGCGT CTGGTCGAAT ATCTTGCATC CGAGTTCAAG
AAGGAGCAGG GAATCGACCT GAAGAACGAT AAGCTCGCTT TGCAGCGCCT GAAGGAAGCT
GCCGAAAAGG CGAAGATCGA GCTTTCGTCC TCGCAGCAGA CCGAAATCAA CCTGCCGTTC
ATCACCGCGG ACGCTTCCGG TCCGAAGCAC CTGACGATGA AGCTGTCACG CGCCAAATTC
GAGAGCCTCG TCGACGACCT GATCCAGAAG ACCATCGCGC CGTGCAAGGC CGCGCTCAAG
GATGCAGGCG TTTCGGCTGC CGAGATCGAC GAAGTCGTTC TCGTCGGCGG CATGACCCGC
ATGCCGAAGG TCCAGGAAAC GGTGAAGCAA CTTTTCGGCA AGGAGCCACA TAAGGGCGTC
AACCCGGATG AGGTCGTCGC CATGGGCGCC GCCATCCAGG CCGGCGTTCT GCAGGGCGAC
GTCAAGGACG TGCTGCTGCT GGACGTTACC CCGCTCTCGC TCGGCATCGA AACGCTGGGC
GGCGTCTTCA CCCGCCTGAT CGAGCGCAAC ACCACGATCC CGACCAAGAA GAGCCAGGTC
TTCTCGACGG CCGACGACAA CCAGTCCGCC GTGACGATCC GCGTTTCCCA GGGCGAACGT
GAAATGGCGG CCGACAACAA GCTGCTCGGC CAGTTCGATC TCGTCGGCAT TCCGCCGGCG
CCTCGCGGCG TGCCGCAGAT CGAAGTCACC TTCGACATCG ATGCGAACGG CATCGTGCAG
GTGTCTGCCA AGGACAAGGG CACGGGCAAG GAGCACCAGA TCCGCATTCA GGCCTCTGGT
GGTCTTTCCG ACGCCGAGAT CGAGAAGATG GTCAAGGATG CCGAGGCCAA TGCGGAAGCC
GACAAGAAGC GCCGCGAAGG CGTAGAGGCC AAGAACCAGG CCGAAAGCCT GGTTCATTCC
TCGGAGAAAT CGCTACAGGA ACATGGCGAC AAGGTTTCCG AGACGGACCG GAAGGCCATC
GAGGATGCGA TTGCAGCGCT GAAGAGCGCC GTCGAAGCTT CCGAGCCGGA CGCCGAGGAC
ATCAAGGCCA AGACCAATAC GCTCATGGAA GTCTCCATGA AGCTCGGTCA GGCGATCTAT
GAGGCTCAGC AGACGGAATC CGCCCATGCT GATGCCGCCG CCGACGCTAA GCGCTCCGGG
GACGACGTGG TCGACGCCGA CTACGAGGAA GTCAAGGACG AGGACGACCG TAAGCGGTCG
GCGTAA
 
Protein sequence
MAKVIGIDLG TTNSCVSVMD GKDAKVIENA EGARTTPSMV AFTEDGERLV GQPAKRQAVT 
NPENTLFAIK RLIGRTFEDP TTQKDKGMVP YKIVKADNGD AWVEAHDKSY SPSQISAMIL
QKMKETAESY LGEKVEKAVI TVPAYFNDAQ RQATKDAGKI AGLDVLRIIN EPTAAALAYG
LDKKEGKTIA VYDLGGGTFD ISVLEIGDGV FEVKSTNGDT FLGGEDFDMR LVEYLASEFK
KEQGIDLKND KLALQRLKEA AEKAKIELSS SQQTEINLPF ITADASGPKH LTMKLSRAKF
ESLVDDLIQK TIAPCKAALK DAGVSAAEID EVVLVGGMTR MPKVQETVKQ LFGKEPHKGV
NPDEVVAMGA AIQAGVLQGD VKDVLLLDVT PLSLGIETLG GVFTRLIERN TTIPTKKSQV
FSTADDNQSA VTIRVSQGER EMAADNKLLG QFDLVGIPPA PRGVPQIEVT FDIDANGIVQ
VSAKDKGTGK EHQIRIQASG GLSDAEIEKM VKDAEANAEA DKKRREGVEA KNQAESLVHS
SEKSLQEHGD KVSETDRKAI EDAIAALKSA VEASEPDAED IKAKTNTLME VSMKLGQAIY
EAQQTESAHA DAAADAKRSG DDVVDADYEE VKDEDDRKRS A