Gene GM21_3574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3574 
SymboldnaK 
ID8138947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4155147 
End bp4157069 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content64% 
IMG OID644871194 
Productmolecular chaperone DnaK 
Protein accessionYP_003023353 
Protein GI253702164 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones124 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAG TAATAGGAAT AGACCTCGGG ACCACCAACT CCTGCGTGGC AGTTATGGAA 
GGTGGTGAAC CGGTTGTCAT AGCGAACGCA GAAGGTAGCC GCACCACTCC TTCCATGATC
GCCTTCGCCG AAAGCGGCGA GCGTCTGGTG GGGCAGCAGG CCAAGCGCCA GGCGGTCACC
AACCCGGAAA ACACCCTGTA CGCCATCAAG CGCCTGATCG GCCGCAAGTT CGATACCGAG
GCGGTCAAGA AGGACATCGC CATCTCCCCG TTCAAGATCG TCAAGGCGGA CAACTCCGAC
GCCTGGGTCG AGGTGCGGGG CCAGAAGTAC TCGCCCCCCG AGATCTCGGC GATGGTGCTG
CAGAAGATGA AGAAGACCGC CGAGGACTAC CTGGGCGAGA CCGTCACCGA CGCGGTCATC
ACCGTCCCGG CTTACTTCGA CGACTCCCAG CGCCAGGCGA CCAAGGACGC CGGCAAGATC
GCGGGCCTCA ACGTGCTCCG CATCATCAAC GAGCCGACCG CGGCAGCACT CGCCTACGGC
CTGGACAAGA AGAAGGACGA GAAGATCGCC GTGTTCGACC TGGGCGGCGG CACCTTCGAC
GTCTCCATCC TGGAGCTGGG CGAGGGGGTC TTCGAGGTGA AGTCCACCAA CGGCGACACC
TTCCTGGGCG GCGAGGACTT CGACCAGAAG ATCATCGACC ACATAGCCGA CGAGTTCAAA
AAGGACCAGG GGATCGATCT CAGGGGCGAC AAGATGGCCC TGCAGAGGCT GAAAGAGGCG
GGCGAGAAGG CGAAGTGCGA GCTTTCCACC TCGCTTGAGA CCGACATCAA TCTCCCCTTC
ATCACGGCCG ACGCCTCGGG TCCCAAGCAC CTGACCATGA AGCTCACCCG CGCGAAGCTG
GAGTCCATCT GCGCCGAGCT GATCGCCAAC CTGGAAGGCC CCTGCCGCAC CGCCTTGAAA
GACGCCGGGC TCTCCGCCTC GGACATCGAC GAAGTCATCC TGGTGGGGGG CATGACCCGC
ATGCCGATCG TGCAGAAGAA GGTGCAGGAC ATCTTCGGCA AGGTCCCCAA CCGCGGCGTG
AACCCGGACG AGGTGGTCGC CATCGGCGCA GCCATCCAGG GCGGCGTTCT GCGCGGCGAC
GTGAAGGACG TGCTCCTTCT CGACGTCACC CCGCTTTCCC TGGGCATCGA GACCCTGGGG
GGCGTGTTGA CCAAGCTGAT CGACAAGAAC TCCACCATCC CCTGCCGGAA GAGCCAGGTC
TTCTCCACCG CCGCCGACAA CCAGCCCGCG GTCAGCATCC ACGTGCTGCA GGGCGAGCGC
GAGATGGCGG CCGACAACAA GACGCTCGGC AACTTCGAGC TCTCCGGCAT CCCGTCGGCT
CCCCGCGGCG TCCCGCAGAT CGAGGTGACC TTCGACATCG ACGCCAACGG CATCGTCCAC
GTCTCCGCCA AGGACCTCGG CACCGGCAAG GAGCAGTCCA TCCGCATCAC CGCTTCCTCG
GGTCTTTCCA AGGAAGAGGT CGAGAAGATG GTGCGCGAGG CCGAGGCGCA CGCGGCCGAC
GATAAGAAAA AGCGCGAGCT GATCGAGGCG AAGAACCAGG CGGACAACCT GATCTACCAG
ACCGAGAAGT CGCTCACCGA GTTCGGCGAC AAGATCGACG CCTCCGAGAA GCAGAAGATC
GAGGAAGGTG TCGCCGCCCT CAAGAAGGCA CTGGAAGGAA GCGACGCCGA CGAGATCAAG
AAGGCGAGCG ACTCCCTGAT GCAGGCTTCC CACAAGCTGG CCGAGGCGGT CTACGCGAAG
ACTCAGGGTG CTGGCGCCGA GGGTAGCGAG CAGCCGCACG GCGAGCAGGA GGCAGGCGGC
GCGGCCAAGG GGGAGACGGT CGTCGACGCC GACTTCGAGG AAGTGAAGGA CGACAAGAAG
TAA
 
Protein sequence
MSRVIGIDLG TTNSCVAVME GGEPVVIANA EGSRTTPSMI AFAESGERLV GQQAKRQAVT 
NPENTLYAIK RLIGRKFDTE AVKKDIAISP FKIVKADNSD AWVEVRGQKY SPPEISAMVL
QKMKKTAEDY LGETVTDAVI TVPAYFDDSQ RQATKDAGKI AGLNVLRIIN EPTAAALAYG
LDKKKDEKIA VFDLGGGTFD VSILELGEGV FEVKSTNGDT FLGGEDFDQK IIDHIADEFK
KDQGIDLRGD KMALQRLKEA GEKAKCELST SLETDINLPF ITADASGPKH LTMKLTRAKL
ESICAELIAN LEGPCRTALK DAGLSASDID EVILVGGMTR MPIVQKKVQD IFGKVPNRGV
NPDEVVAIGA AIQGGVLRGD VKDVLLLDVT PLSLGIETLG GVLTKLIDKN STIPCRKSQV
FSTAADNQPA VSIHVLQGER EMAADNKTLG NFELSGIPSA PRGVPQIEVT FDIDANGIVH
VSAKDLGTGK EQSIRITASS GLSKEEVEKM VREAEAHAAD DKKKRELIEA KNQADNLIYQ
TEKSLTEFGD KIDASEKQKI EEGVAALKKA LEGSDADEIK KASDSLMQAS HKLAEAVYAK
TQGAGAEGSE QPHGEQEAGG AAKGETVVDA DFEEVKDDKK