Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3574 |
Symbol | dnaK |
ID | 8138947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4155147 |
End bp | 4157069 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871194 |
Product | molecular chaperone DnaK |
Protein accession | YP_003023353 |
Protein GI | 253702164 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR02350] chaperone protein DnaK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 124 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAG TAATAGGAAT AGACCTCGGG ACCACCAACT CCTGCGTGGC AGTTATGGAA GGTGGTGAAC CGGTTGTCAT AGCGAACGCA GAAGGTAGCC GCACCACTCC TTCCATGATC GCCTTCGCCG AAAGCGGCGA GCGTCTGGTG GGGCAGCAGG CCAAGCGCCA GGCGGTCACC AACCCGGAAA ACACCCTGTA CGCCATCAAG CGCCTGATCG GCCGCAAGTT CGATACCGAG GCGGTCAAGA AGGACATCGC CATCTCCCCG TTCAAGATCG TCAAGGCGGA CAACTCCGAC GCCTGGGTCG AGGTGCGGGG CCAGAAGTAC TCGCCCCCCG AGATCTCGGC GATGGTGCTG CAGAAGATGA AGAAGACCGC CGAGGACTAC CTGGGCGAGA CCGTCACCGA CGCGGTCATC ACCGTCCCGG CTTACTTCGA CGACTCCCAG CGCCAGGCGA CCAAGGACGC CGGCAAGATC GCGGGCCTCA ACGTGCTCCG CATCATCAAC GAGCCGACCG CGGCAGCACT CGCCTACGGC CTGGACAAGA AGAAGGACGA GAAGATCGCC GTGTTCGACC TGGGCGGCGG CACCTTCGAC GTCTCCATCC TGGAGCTGGG CGAGGGGGTC TTCGAGGTGA AGTCCACCAA CGGCGACACC TTCCTGGGCG GCGAGGACTT CGACCAGAAG ATCATCGACC ACATAGCCGA CGAGTTCAAA AAGGACCAGG GGATCGATCT CAGGGGCGAC AAGATGGCCC TGCAGAGGCT GAAAGAGGCG GGCGAGAAGG CGAAGTGCGA GCTTTCCACC TCGCTTGAGA CCGACATCAA TCTCCCCTTC ATCACGGCCG ACGCCTCGGG TCCCAAGCAC CTGACCATGA AGCTCACCCG CGCGAAGCTG GAGTCCATCT GCGCCGAGCT GATCGCCAAC CTGGAAGGCC CCTGCCGCAC CGCCTTGAAA GACGCCGGGC TCTCCGCCTC GGACATCGAC GAAGTCATCC TGGTGGGGGG CATGACCCGC ATGCCGATCG TGCAGAAGAA GGTGCAGGAC ATCTTCGGCA AGGTCCCCAA CCGCGGCGTG AACCCGGACG AGGTGGTCGC CATCGGCGCA GCCATCCAGG GCGGCGTTCT GCGCGGCGAC GTGAAGGACG TGCTCCTTCT CGACGTCACC CCGCTTTCCC TGGGCATCGA GACCCTGGGG GGCGTGTTGA CCAAGCTGAT CGACAAGAAC TCCACCATCC CCTGCCGGAA GAGCCAGGTC TTCTCCACCG CCGCCGACAA CCAGCCCGCG GTCAGCATCC ACGTGCTGCA GGGCGAGCGC GAGATGGCGG CCGACAACAA GACGCTCGGC AACTTCGAGC TCTCCGGCAT CCCGTCGGCT CCCCGCGGCG TCCCGCAGAT CGAGGTGACC TTCGACATCG ACGCCAACGG CATCGTCCAC GTCTCCGCCA AGGACCTCGG CACCGGCAAG GAGCAGTCCA TCCGCATCAC CGCTTCCTCG GGTCTTTCCA AGGAAGAGGT CGAGAAGATG GTGCGCGAGG CCGAGGCGCA CGCGGCCGAC GATAAGAAAA AGCGCGAGCT GATCGAGGCG AAGAACCAGG CGGACAACCT GATCTACCAG ACCGAGAAGT CGCTCACCGA GTTCGGCGAC AAGATCGACG CCTCCGAGAA GCAGAAGATC GAGGAAGGTG TCGCCGCCCT CAAGAAGGCA CTGGAAGGAA GCGACGCCGA CGAGATCAAG AAGGCGAGCG ACTCCCTGAT GCAGGCTTCC CACAAGCTGG CCGAGGCGGT CTACGCGAAG ACTCAGGGTG CTGGCGCCGA GGGTAGCGAG CAGCCGCACG GCGAGCAGGA GGCAGGCGGC GCGGCCAAGG GGGAGACGGT CGTCGACGCC GACTTCGAGG AAGTGAAGGA CGACAAGAAG TAA
|
Protein sequence | MSRVIGIDLG TTNSCVAVME GGEPVVIANA EGSRTTPSMI AFAESGERLV GQQAKRQAVT NPENTLYAIK RLIGRKFDTE AVKKDIAISP FKIVKADNSD AWVEVRGQKY SPPEISAMVL QKMKKTAEDY LGETVTDAVI TVPAYFDDSQ RQATKDAGKI AGLNVLRIIN EPTAAALAYG LDKKKDEKIA VFDLGGGTFD VSILELGEGV FEVKSTNGDT FLGGEDFDQK IIDHIADEFK KDQGIDLRGD KMALQRLKEA GEKAKCELST SLETDINLPF ITADASGPKH LTMKLTRAKL ESICAELIAN LEGPCRTALK DAGLSASDID EVILVGGMTR MPIVQKKVQD IFGKVPNRGV NPDEVVAIGA AIQGGVLRGD VKDVLLLDVT PLSLGIETLG GVLTKLIDKN STIPCRKSQV FSTAADNQPA VSIHVLQGER EMAADNKTLG NFELSGIPSA PRGVPQIEVT FDIDANGIVH VSAKDLGTGK EQSIRITASS GLSKEEVEKM VREAEAHAAD DKKKRELIEA KNQADNLIYQ TEKSLTEFGD KIDASEKQKI EEGVAALKKA LEGSDADEIK KASDSLMQAS HKLAEAVYAK TQGAGAEGSE QPHGEQEAGG AAKGETVVDA DFEEVKDDKK
|
| |