Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1306 |
Symbol | dnaK |
ID | 4242457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 1990542 |
End bp | 1992458 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638106494 |
Product | molecular chaperone DnaK |
Protein accession | YP_721105 |
Protein GI | 113475044 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR02350] chaperone protein DnaK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.35463 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAAG TAGTCGGAAT TGATTTAGGC ACAACAAACT CTTGTGTTGC AGTTATGGAA GGTGGTAAGC CTACAGTTAT AGCTAATGCA GAAGGTTTTC GTACAACGCC TTCGGTAGTA GCTTATGGCA AAAAAGAAGA ACAACTGGTA GGTCAAATTG CCAAGCGTCA AGCGGTGATG AACACCCAAA ATACTTTCTA TTCGGTAAAG CGGTTTATCG GGCGTAAATT TGATGAAGTA ACCCATGAAA CCACTGAAGT TTCCTATAAA GTTCTCAACA TTAACGGTAA CGTTAAATTA GATTGCCAAG CACTAAAAAA ACAATTTGCC TCTGAAGAAA TTTCAGCCCA AGTACTTCGT AAACTAATAG ATGATGCTAG TAAATATCTT GGTGAACAAG TAACGCAAGC CGTAATTACC GTACCTGCTT ACTTCAATGA CTCCCAAAGA CAAGCAACCA AGGATGCTGG TAAAATTGCT GGTGTAGAAG TACTGCGAAT TATCAACGAA CCAACAGCAG CTTCTCTAGC TTATGGTTTA GATAAAAAAA GTAACGAAAC TATTTTAGTA TTTGACCTTG GTGGTGGTAC ATTCGACGTT TCTATTCTAG AAGTGGGTGA TGGTGTCTTT GAAGTATTAG CTACTTCTGG TGATACTCAC CTTGGTGGTG ATGACTTTGA TAAGAAAATT GTTGATTATC TCGCTGCAGA ATTCAACAAA GTAGAAGGCA TTGATCTGCG TAAAGATAAA CAAGCACTAC AACGTTTAAC AGAAGCAGCA GAAAAAGCTA AGATTGAACT TTCTAGTGTT ACCCAGGCAG AAATTAACTT ACCATTTATT ACTGCTACTC AAGATGGGCC CAAGCATTTA GATCTAACTC TTACTAGAGC TGAATTTGAA GGTCTATGTT CCGACCTCAT TGATCGCTGT CGTATTCCTG TAGAAAATGC CATTCGGGAT GCGAAGTTAG ATAAAAAAGC CATCGATGAA GTAGTATTGG TTGGTGGTTC CACTCGAATT CCAGCAGTTC AAGAAATAGT CAAAAAAGTC CTAGGTAAAG AACCAAATCA AAGTGTTAAC CCTGATGAAG TTGTAGCTGT TGGAGCTGCT ATTCAAGCTG GTGTGTTAGC AGGTGAAGTT AAAGATATTC TCTTATTAGA TGTTACTCCT CTATCTCTGG GTGTGGAAAC TCTTGGTGGA GTAATGACCA AAATTATTCC GCGCAACACT ACTATTCCTA CTAAAAAATC AGAAGTATTT TCAACTGCTG TAGATGGTCA GACAAATGTA GAAATTCATG TACTGCAGGG CGAACGAGAA ATGTCTAGCG ATAACAAGAG TCTGGGAACT TTCCGCTTAG ATGGTATCCC CGCAGCACCT CGTGGCGTAC CTCAAATTGA AGTTACCTTT GATATTGATG CTAATGGTAT TCTGAATGTA ACTGCGAAGG ATAAAGGTAC TGGAAAAGAA CAATCTATTA GTATTACAGG TGCTTCGACT CTTCCTGATA CAGAAGTCGA TCGGATGGTT AAAGAAGCAG AAGCAAATGC TACTGCAGAC AAAGAGCGCC GTGATAAAAT TGACCGTAAG AACCAAGCTG ACTCTCTAGC ATATCAGGCT GAGAAGCAAA TTCAGGAGTT GGGGGATAAG GTTCCTGAAG CGGATAAGAC TAAAATTGAA GGCTTGATTA AAGACTTGCG CGATGCTGTC GCTCAAGAAG ACGATGAGAA GATTACTTCT CTGACTACTG AGTTACAACA AGCTCTCTAC AGTGTTAGCA GCAATATGTA TCAACAATCT GGTGGTGCAC CTGGTGAAGG TTCTACACCA AATAGTGATG CTGGTTCTGG TGCTTCTTCT TCAAATGCTA CAGGTGGAGA TGATGTAATT GATGCTGATT TTACTGAAAC AAAGTAA
|
Protein sequence | MAKVVGIDLG TTNSCVAVME GGKPTVIANA EGFRTTPSVV AYGKKEEQLV GQIAKRQAVM NTQNTFYSVK RFIGRKFDEV THETTEVSYK VLNINGNVKL DCQALKKQFA SEEISAQVLR KLIDDASKYL GEQVTQAVIT VPAYFNDSQR QATKDAGKIA GVEVLRIINE PTAASLAYGL DKKSNETILV FDLGGGTFDV SILEVGDGVF EVLATSGDTH LGGDDFDKKI VDYLAAEFNK VEGIDLRKDK QALQRLTEAA EKAKIELSSV TQAEINLPFI TATQDGPKHL DLTLTRAEFE GLCSDLIDRC RIPVENAIRD AKLDKKAIDE VVLVGGSTRI PAVQEIVKKV LGKEPNQSVN PDEVVAVGAA IQAGVLAGEV KDILLLDVTP LSLGVETLGG VMTKIIPRNT TIPTKKSEVF STAVDGQTNV EIHVLQGERE MSSDNKSLGT FRLDGIPAAP RGVPQIEVTF DIDANGILNV TAKDKGTGKE QSISITGAST LPDTEVDRMV KEAEANATAD KERRDKIDRK NQADSLAYQA EKQIQELGDK VPEADKTKIE GLIKDLRDAV AQEDDEKITS LTTELQQALY SVSSNMYQQS GGAPGEGSTP NSDAGSGASS SNATGGDDVI DADFTETK
|
| |