Gene Mbar_A3433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3433 
SymboldnaK 
ID3624720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4400267 
End bp4402129 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content48% 
IMG OID637702261 
Productmolecular chaperone DnaK 
Protein accessionYP_306886 
Protein GI73670871 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0457031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0513788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAA TACTTGGTAT TGACCTTGGT ACTACGAACT CATGCATGGC AGTAATGGAA 
GGCGGAGAAG CTGTCGTGAT TCCAAATGCC GAAGGCGCCA GAACAACCCC ATCAGTGGTC
GGATTTTCCA AAAAAGGAGA GAAACTTGTA GGACAGGTTG CAAAGAGGCA GGCCATTTCG
AATCCTGAAA ATACTGTTTA TTCCATTAAA AGGCATATGG GAGATGCCAA TTACACGGTG
ACTCTTCAGG GAACACAGTA TAAGCCGCAG GAAATTTCTG CAATGATTCT TCAGAAGCTC
AAGACCGATG CAGAAGCTTA TCTTGGAGAG ACTATCAAAC AGGCTGTTAT CACGGTTCCT
GCTTATTTCA ATGACGCCCA GAGACAGGCT ACAAAGGACG CAGGGGCAAT TGCAGGCCTT
GATGTCCTGA GAATTATCAA TGAACCAACT TCCGCATCCC TGGCTTATGG TCTCGATAAG
GGAGATATAG AACAAAAAAT TCTTGTTTAT GACCTGGGTG GCGGAACCTT TGATGTATCT
ATTCTCGAAC TCGGAGGCGG AGTCTTTGAG GTGAAATCTA CAAGTGGTGA CACTCGCCTT
GGAGGAGATG ACTTCGACCA GCGTATAGTT AATTACTTAC TTGCTGAGTT CAGGAAAATC
GAGGGAATCG ACCTCTCCAA AGACAAGGCT GTACTCCAGC GTTTAACTGA TGCTGCGGAA
AAAGCCAAGA TCGAGCTGTC TGGAGTTGCA AGTACAAATA TCAACCTTCC CTTCCTAACA
GTTGGTGCGG ACGGAGAACC AAAGCACCTT GATATTGACC TGACAAGAGC TCAGTTCCAG
AAGATGACTG AGGACCTCCT TGAAAAGACC CTTGTATCTA TGCGCCAGGC TCTCAGCGAT
GCAAAGCTTA CTCCAAATGA TCTTGACAAA GTGATTCTCG TCGGAGGTGC CACAAGGATG
CCTGCAGTAG TCGAGCTTGT GGAAAACTTT ACAGGCAAAA AGCCCTACAA GAACATCAAC
CCTGATGAAG CCGTTGCAAT TGGAGCAGCC ATCCAGGCCG GTGTGCTCGG TGGCGAAGTC
AAAGATGTCC TGCTGCTTGA TGTCACTCCT CTGACTCTCG GAATAGAAAC GCTCGGAGGT
ATAGCAACCC CACTTATCCC GAGAAACACG ACAATTCCTA CCAAGAAAAG TCAGGTTTTC
TCAACTGCTG CTGATAACCA GCCTTCAGTA GAGATTCATG TCCTTCAGGG AGAAAGAGGA
GTTGCTTCCG AAAACAAGAC TCTCGGTCGC TTTACTCTGG ACGGTATACC ACCAGCTCCA
AGAGGCATCC CGCAGATTGA AGTTACATTT GATATCGACG CAAACGGAAT CCTGCATGTA
GGTGCAAAAG ACCTCGGAAC CGGTAAGGAA CAGTCCATTT CCATCCAGAA GCCGGGTGGT
CTCTCGGATG ACGAAATCGA TCGCATGGTC AAAGACGCGG AATTGCATGC TGAAGAAGAC
AAAAAGCGCA AAGAAGATGT CGAAACCAGG AACAATGCCG AAGCCCTGAT CAATGCTGCG
GAAAAGACTT TGAAGGAAGC CGGAGATGCG GCAACCGAAG ACCAGAAGTC AAAGGTAACT
GCCGCAATCG ACGACCTTAA GAAAGCTCTT GAAGGTAAGG ACAGCGAAGA TATTAAATCA
AAAACCGAAG CTCTTCAGGA AGCCGTATAC CCAATTTCCA CTGCAATGTA CCAGAAAGCC
CAGCAGCAGG CTCAGCAAGC CCAGCAGGCT GAAGGTGAAG CAGGAAGCCA TGATGCACAA
GGTCCTGATG AGACAGTCGT TGATGCCGAT TATGAGGTAG TTGACGACGA AAAGCGTAAA
TAA
 
Protein sequence
MAKILGIDLG TTNSCMAVME GGEAVVIPNA EGARTTPSVV GFSKKGEKLV GQVAKRQAIS 
NPENTVYSIK RHMGDANYTV TLQGTQYKPQ EISAMILQKL KTDAEAYLGE TIKQAVITVP
AYFNDAQRQA TKDAGAIAGL DVLRIINEPT SASLAYGLDK GDIEQKILVY DLGGGTFDVS
ILELGGGVFE VKSTSGDTRL GGDDFDQRIV NYLLAEFRKI EGIDLSKDKA VLQRLTDAAE
KAKIELSGVA STNINLPFLT VGADGEPKHL DIDLTRAQFQ KMTEDLLEKT LVSMRQALSD
AKLTPNDLDK VILVGGATRM PAVVELVENF TGKKPYKNIN PDEAVAIGAA IQAGVLGGEV
KDVLLLDVTP LTLGIETLGG IATPLIPRNT TIPTKKSQVF STAADNQPSV EIHVLQGERG
VASENKTLGR FTLDGIPPAP RGIPQIEVTF DIDANGILHV GAKDLGTGKE QSISIQKPGG
LSDDEIDRMV KDAELHAEED KKRKEDVETR NNAEALINAA EKTLKEAGDA ATEDQKSKVT
AAIDDLKKAL EGKDSEDIKS KTEALQEAVY PISTAMYQKA QQQAQQAQQA EGEAGSHDAQ
GPDETVVDAD YEVVDDEKRK