Gene Gdia_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1973 
SymboldnaK 
ID6975399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2188981 
End bp2190897 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content67% 
IMG OID643391502 
Productmolecular chaperone DnaK 
Protein accessionYP_002276348 
Protein GI209544119 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.730312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG TTATCGGTAT CGACCTCGGC ACGACCAACT CGTGCGTCGC CATCCGCGAA 
GGGGACGAAA CCCGCGTCAT CGAGAACAGC GAAGGCGCGC GCACGACCCC GTCGATGGTC
GCCTTCGCCG ACAACGGCGA AATGCTGGTC GGCCAGTCCG CCAAGCGCCA GGCCGTCACC
AACCCCGCCA ACACGCTGTA TGCCGTCAAG CGCCTGATCG GCCGTCGCTA TGACGACCCG
ACGGTGACCA AGGACAAGGC GCTGGTCCCC TATTCGATCT TCGCGGGTGA CAACGGCGAC
GCGTGGGTCG AGGCGCAGGG CAAGAAATAC GCGCCGTCGC AGATCGCCGC CTTCGTGCTG
GGCAAGATGA AGGAAACCGC CGAGGCCTAT CTGGGCGAGC CGGTGTCGCA GGCCGTCATC
ACGGTTCCGG CCTACTTCAA CGACGCGCAG CGCCAGGCGA CCAAGGATGC CGGCCGCATC
GCCGGCCTGG AAGTCCTGCG CATCATCAAC GAGCCGACGG CCGCGGCCCT GGCCTATGGC
CTGCAGAAGA AGAACGGCGG CACCATCGCG GTCTATGACC TGGGCGGCGG CACGTTCGAC
GTGTCGATCC TGGAGATCTC GGACGGCGTG ATCGAGGTGA AGTCCACCAA CGGCGACACG
TTCCTGGGCG GCGAGGATTT CGACGCGCGG GTCATCAGCT ATCTGGCCGA CGAGTTCAAG
CGCGAGCAGG GCATCGACCT GCGCGCCGAC AAGCTGGCGC TGCAGCGCCT GAAGGAAGCG
GCGGAAAAGG CGAAGATCGA GCTGTCCTCC TCGAAGGAGA CCGAGATCAA CCTGCCCTTC
ATCACCGCCG ACGCCTCGGG CCCGAAGCAT CTGGTGCTGA AGCTGAGCCG CGCGAAGCTG
GAAAGCCTGG TCGACGACCT GATCCAGCGC ACGATGGAGC CCTGCCGCGC CGCCCTGAAG
GACGCGTCGG TCTCGGCGGC GGAAATCGAT GAGGTGATCC TGGTCGGCGG CATGACCCGC
ATGCCCAAGG TCATCGAGGC GGTGAAGCAG TTCTTCGGCA AGGAGCCGGC CCGCAACGTG
AACCCCGACG AGGTCGTGGC GATCGGCGCC GCGGTGCAGG GCGCGGTGCT GAAGGGTGAC
GTCAAGGACG TCCTGCTGCT GGACGTGACG CCGCTGTCGC TGGGCATCGA GACGCTGGGT
GGCGTGTTCA CCCGCCTGAT CGACCGCAAC ACCACGATCC CGACCAAGAA GAGCCAGACC
TTCTCGACCG CCGAGGACAA CCAGGGCGCC GTGACCATCA AGGTGTTCCA GGGCGAGCGC
GAGATGGCGG CGGACAACAA GCTGCTGGGC AATTTCGACC TGACCGGCAT CGCCCCCGCC
CCGCGCGGCG TGCCGCAGAT CGAGGTGACG TTCGACATCG ACGCCAACGG CATCGTGTCG
GTGTCGGCCA AGGACAAGGC GACCAACAAG GAACAGCAGA TCAAGATCCA GGCGTCGGGC
GGCCTGTCCG AGTCCGACAT CGAGAAGATG GTCAAGGACG CCGAGGCGAA CGCCACGGCC
GACAAGGCCA AGAAGGAACT GGTCGAGGCC CGCAACCAGG CCGAAAGCCT GGCGCACCAG
GTCGAGAAGT CGCTGGCCGA GGCGGGCGAC AAGGTTCCGG CGTCCGACAA GGCCGAAGCC
GAGGCCGCGA TCGCCGCTGT CCGCACGGCG CTGGAAGGCA GTGACGCGGC CGCGCTGAAG
AGCGCGTCCG AGACCCTGTC GCAGGCGGCG ATGAAGATCG GCGAGGCCGT CTACAAGGCC
GGTCAGGCCG AGGGCGCGGC GGGTGCCGGT GCCCCGGGTG CCGGCGCCAC CGGCCCGAAC
GGCGAGAAGG TCGTGGACGC CGATTTCGAG GACGTGGACG ACAAGAAGTC GGCCTGA
 
Protein sequence
MSKVIGIDLG TTNSCVAIRE GDETRVIENS EGARTTPSMV AFADNGEMLV GQSAKRQAVT 
NPANTLYAVK RLIGRRYDDP TVTKDKALVP YSIFAGDNGD AWVEAQGKKY APSQIAAFVL
GKMKETAEAY LGEPVSQAVI TVPAYFNDAQ RQATKDAGRI AGLEVLRIIN EPTAAALAYG
LQKKNGGTIA VYDLGGGTFD VSILEISDGV IEVKSTNGDT FLGGEDFDAR VISYLADEFK
REQGIDLRAD KLALQRLKEA AEKAKIELSS SKETEINLPF ITADASGPKH LVLKLSRAKL
ESLVDDLIQR TMEPCRAALK DASVSAAEID EVILVGGMTR MPKVIEAVKQ FFGKEPARNV
NPDEVVAIGA AVQGAVLKGD VKDVLLLDVT PLSLGIETLG GVFTRLIDRN TTIPTKKSQT
FSTAEDNQGA VTIKVFQGER EMAADNKLLG NFDLTGIAPA PRGVPQIEVT FDIDANGIVS
VSAKDKATNK EQQIKIQASG GLSESDIEKM VKDAEANATA DKAKKELVEA RNQAESLAHQ
VEKSLAEAGD KVPASDKAEA EAAIAAVRTA LEGSDAAALK SASETLSQAA MKIGEAVYKA
GQAEGAAGAG APGAGATGPN GEKVVDADFE DVDDKKSA