Gene Anae109_4474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4474 
SymboldnaK 
ID5373851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp5240952 
End bp5242778 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content70% 
IMG OID640846002 
Productmolecular chaperone DnaK 
Protein accessionYP_001381636 
Protein GI153007311 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.306982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.169072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAGG TCATCGGCAT CGACCTCGGC ACCACGAACT CCTGCGTCTC GGTGATGGAG 
GGGGGCGACG CCGTCGTCAT CCCGAACAGC GAGGGCTCGC GCACCACCCC GTCCATGGTG
GCCTTCACCG AGGGCGGCGA GCGGCTCGTC GGGCAGATCG CGAAGCGCCA GGCCATCACC
AACCCCGAGG CCACCGTCCA CGGCGTGAAG CGGCTCATCG GGCGCAAGTT CGACGACGCG
GAGGTGAGGC GCTCCGTCGG GCTCGTGCCG TACCGCATCG CGGCGGCCGA GAACGGCGAC
GCCTGGGTGG AGGCCGCCGG CAAGCCGCAC TCGCCGGCCG AGATCTCGGC GATGGTCCTC
GCCAAGATGA AGCAGACCGC GGAGGACTAC CTCGGCGAGC CCGTCTCCGA GGCGATCGTC
ACCTGCCCCG CCTACTTCAA CGACGCCCAG CGCCAGGCGA CCAAGGACGC GGGGCGCATC
GCCGGGCTGA ACGTCCTCCG CATCATCAAC GAGCCGACCG CGGCGGCCCT CGCCTACGGC
ATCGACAAGC AGAAGGCGGG CGCGACCGAG CGCGTCGCGG TGTACGACCT CGGGGGCGGC
ACCTTCGACA TCACGGTCCT CGAGCTCAAC CTCGGCGTGT TCGAGGTGAA GGCGACGAAC
GGCGACACGT TCCTCGGCGG CGAGGACTTC GACCAGCGGC TCATCGACTG GATGGCGCGG
CGCTTCCGCG AGCAGACCGG CGTCGACCTC ACCCGCGACC GCATGGCGCT CCAGCGGCTC
AAGGAGGCCG CGGAGCGCGC CAAGCACGAG CTCTCCAGCG CGATCGAGAC GGAGGTGAAC
CTCCCGTTCA TCACCGCCGA CGCCACCGGC CCCAAGCACC TCGCCGAGAC CATCGACCGC
GCCACGCTCG AGGAGCTGTG CGGCGACCTG ATCGAGCGGA CGCTGGAGCC GTGCCGCACC
GCCCTCGAGG ACGCGGGCGT CTCGGTGCAG CAGATCGACA CCGTCATCCT GGTCGGCGGC
ATGACCCGCA TGCCGAAGGT GCAGGAGGTG GTGAAGCGCT TCTTCGGCAG GGAGCCGCAC
AAGGGCGTGA ACCCGGACGA GGTCGTCGCG GTGGGCGCGG CGATCCAGGG CGGCGTGCTG
AAGGGCGAGG TGAAGGACGT CCTGCTCCTC GACGTGACGC CCCTCTCGCT GGGCGTCGAG
ACCGCGGGCG GCGTCTTCAC CAAGATCATC GAGAAGAACA CCACCGTCCC CTGCAAGAAG
TCGCAGGTGT TCTCGACGGC GGTGGACAAC CAGCCGCTCG TGTCCGTCCA CGTGCTGCAG
GGCGAGCGCG GCATGGCCGC GGACGACAAG ACCCTGGGGC GCTTCGAGCT CGTCGGCATC
CCGCCCGCGC CCCGCGGCGT GCCGCAGATC GAGGTCACCT TCGACATCGA CGCGAACGGG
ATCGTGCACG TCTCCGCGAA GGACCTCGGC ACCGGCAAGC AGCAGCAGAT CCGGATCACC
GGCTCCTCCG GCCTCACGGA GGCGGAGATC CAGCGGATGA TCCGCGACGC GGAGGCGAAC
CGCGCCGACG ACGCCGCGAA GAAGGAGCTC GCCGATCTCA AGAACAACGC CGAGGGGCTC
GTCTACACGA CCGAGAAGAG CCTCGAGGAG TACGCGAGCG CGCTCGCCGC GGACGACCTC
GCCGAGATCC GCGCCGACCT CGAGCTGCTG AAGGGCGTGC TGCAGGGCTC CGACGGCGGG
GCGATCAAGG AAGCGCTCAC CCGCCTGGAG GGCAGCGCCT ACCGGATCGC CGACGCCATC
TACGCGCAGC AGGGCGGCGG GACGTAG
 
Protein sequence
MGKVIGIDLG TTNSCVSVME GGDAVVIPNS EGSRTTPSMV AFTEGGERLV GQIAKRQAIT 
NPEATVHGVK RLIGRKFDDA EVRRSVGLVP YRIAAAENGD AWVEAAGKPH SPAEISAMVL
AKMKQTAEDY LGEPVSEAIV TCPAYFNDAQ RQATKDAGRI AGLNVLRIIN EPTAAALAYG
IDKQKAGATE RVAVYDLGGG TFDITVLELN LGVFEVKATN GDTFLGGEDF DQRLIDWMAR
RFREQTGVDL TRDRMALQRL KEAAERAKHE LSSAIETEVN LPFITADATG PKHLAETIDR
ATLEELCGDL IERTLEPCRT ALEDAGVSVQ QIDTVILVGG MTRMPKVQEV VKRFFGREPH
KGVNPDEVVA VGAAIQGGVL KGEVKDVLLL DVTPLSLGVE TAGGVFTKII EKNTTVPCKK
SQVFSTAVDN QPLVSVHVLQ GERGMAADDK TLGRFELVGI PPAPRGVPQI EVTFDIDANG
IVHVSAKDLG TGKQQQIRIT GSSGLTEAEI QRMIRDAEAN RADDAAKKEL ADLKNNAEGL
VYTTEKSLEE YASALAADDL AEIRADLELL KGVLQGSDGG AIKEALTRLE GSAYRIADAI
YAQQGGGT