Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_3239 |
Symbol | |
ID | 8409317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013201 |
Strand | + |
Start bp | 28381 |
End bp | 31857 |
Gene Length | 3477 bp |
Protein Length | 1158 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645018178 |
Product | Heat shock protein 70 |
Protein accession | YP_003175699 |
Protein GI | 257372925 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone [COG0576] Molecular chaperone GrpE (heat shock protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.573429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGGGA CACAGCCAGG AGCGACGCTG GCCGCGAGCG TCGGCATCGA GATCGAGGGC GGCCACTGCG AGCAGTTGCT CTCCCGAGGG CAGTCGCTGC CGGCGTCGGC GACAGAGACC TTCACGACGG CCGAAGATAA CCAGCAGACG GTCCAGATTC GTCTGTTCCA GGGCGATCAG GAACGTGTCG AGGGAAACGA GCTGCTCGGA GAGTGTACGG TCTCCGGCCT CGTACCTGCC CCCGCGGGCG TGCCGGACGT AGCAGTCACG TTCACCGTCG ACAGAGCGGG AGTCCTGCAG GTCTCGGTCC AGAACGGCAC CGGCGACGCG ACACTCCAGA TCGAGGGACA GCGTGAGTTC GACGGCGTCG CGGTCGAGAC GGCGGGAAGG ACCGACGCAG ATCCGACCAA CGACGTGATA CTCGGCGTCG ATCTGGGGAC GACCGCCAGC GTCTGTGCAG TTCCGGTGGA CGGCGAGCCC GAAATCGTCG TCAACAGCGA GGGAGACCGT GCGACACCCT CCGTGCTGTC CGTCGACGAC GACGGAACCC TGCTGGTGGG CAAAGCGGCC CGGAAGCGAG CGATCTCTCG ACCGGAACAG ACCATCGCGT CGGTCAAGCG TGTCCTCTTG GGCGAGGACG GGACGGTCGA GTTAGGCGAG CGAGAGTACT CGACCGTGGA GCTGGCGGGG ATGCTGTTCG AGAAGTTGCG CTCTGACGCC GAGAGCGCCG TCGGGCGGCC CGTCGAGAAG GCGGTCGTCA CCGTCCCCGC GGTGGCGTCG GTTCGCCAGC GGGGCCGCAT CGACCGAGCC GGCGAGATCG CCGGCCTGGA GATCGAACGG ACGATCGGCG ACGCGGCGGC CGCGGTGATG GGATACGCTT ACGGGAGCGA CGGAGAACAG ACCGTGCTGG TCTGCGATCT CGGCGGTGGC TCGCTGAGCG TCTCGCTGCT GGACGTGGAG AACGACATCT ACGAGATCGT GGCCAACGGC GGAGACGACG AGCTGGGGGG CAACGAGTGG GACGCGGCGA TCGTCGACCA CCTCGCCGAC CAGTTCGAGG CGGACCACGG CATCGATCTG CGCGAGGACC CACAGGCCCG CAGACGGCTG GCCGACGCAG CGGCGGCCGC GAAGATCGAA CTCGCCTCGC GCGAGCGGAC GCGGATCGAC GTACCTTACG TCGCGGCGAC GGACGACGGC CCGCTGGATC TGAACGCGAC GCTCACCCGC GATACCGTCG CGTCACTGAC CGAACCCCTC GTCGAGCGCG TCGTCGAGTC GGTCAGGGCA GTCCTGTCCG GAAGTCGACA CGGCGCGGAC GACATCGACG AGATCCTCCT CGTCGGCGGC GCGAGCCGGC TCCCGCAGGT CCGAAACAGA ATCGAGACGC TGGTCGGACA GCAGACGGCC AGACGCGTCA ACGAGGAGGT CACCGCACTC GGGACGGCCG TGCAGGCGGG CATCCTCTCG GGCGCGGTCG ACGACGTGGT GTTGCTGGAC GTGACCTCAC TGTCGCTGGG CATCGCGGTC GAGGGCGGTG GGTTCGAGCG CATCGTCGAG CGAGGCGAGA CCATCCCCAC CGTCGAGTCA GTGGAGTTCA CCACGACGGT GGACGACCAG ACGGCGGTGC CGATACGGGT GTTCCAGGGC GAACACGAAA CCGCCACACG GAACGAGTTG CTCGCCGAGT TCGTACTGAC CGACGTGCCG CCGACCTCGG CCGGGACGCC GGCGATCGAG GTCACGGTGA GCATCGACGA GAACCGGTGG ATCAACGTCG AGGCAGACCA CGACGGCGAC AGTGAAGCGA TCACGATCGA TCCCGCGACG GAGCGATCGC TGCGGGATGG AACGAGACGC CAGTCCGAGC AGGAACGGCC CGCACCCCGA CTGGTTACCA CTCGGGACGA AGACGAGGCG ACGGCATCCG ACGAAAGCCC GGAGACGAGC GCCGCGAGGG ATCTGGAGAC GCTGATCGAC GAGGCCCTCG ACGTACGCAA CCAGCTCCAT CGTGGCGTCC TCTACTGTCG CCGGTCGATC GACAACGAGG TGGCGGATCT CCGCGACAAC CTCGAAAAGA CGACCCGAAA CATCGAGTCG ATCGTCGGGA CGAACACGGA GAGCGCGGAC AAACGCGACG AGGACGGCGA GGAGGTGGGC GACCGGCTCG TCGGCGAGCT GGCGGCCGTC GAGGACTCGC TGCGGGCACT GCTCGACGAG GACACGTCGA CGGGGCTGGC GTTCGACGAC CTCGAAGCGC TGGTCGACCG GATCGACCGC GGGCTCGAAG CGGCGGGACT CGTGCTCGTC GATCCCGACG GCGGGGCCGA GACGGACCCG TACCGCCATC GAGTGGTCTC CTCAACAGAG AGCTCCGTCC CGGAGGGGCG AATCGTCTCC GTCCAGCAGA TCGGATACGA GCGCGACGGT GCGGTGTGCC GGGAGGCGAC GGTGGTCGTC AGCGCCGGTT CTCCAGCGGA ACGTGTCACA GCGGCCACGG GGCAGACCGA TCCCGACCGG TCGGGCGATA CACTCGATCG AGCGATCAGA GACGCACGCG CCCCGGCCGT CGAGCAGTTC CCGAGCCCAC CACGACGGTC GGTGAGCTAC GAGGACTTCG AGATCAGCGC GACGCTGGGA ACCGGCGGCC AGGCAGTCGT GTACGAGACG ACCCACCCGG CGATCGAGTC GCCGGCCCGG ATCGCGCTAA AAGAACCGGC TCGCGGCGAG CGGACCCTGA CACACGACGC GATCAGTTCG TTTCTACAGC GGGCACAGAC GTGGCGGACC GTCGACGACC GCGAGCGAGA GAGTCGCCGG TGGCGCTCCC ACGAGTACAT CGTCGGCGTC GTCGACGTCG GGGACACCCG ACCGTGGATC GCCATGGAGT ACATGGACGG CGGTGACCTC GCCGAGCGAC TCGACGGCAC CGAGGGGCTG CCGGTCGAGG AGGCCGTGTG GATCGGACAG TGTCTCTGCC GCGCGCTCGA AGTCGCACAC GAGTACGGGA TCGCCCACCT GGACATCAAG CCGGGGAACG TCCTCTTCAC CGAAACGACA CCGTCACGCT GGGACCTGCC GAAGCTCGCC GACTGGGGCC TCGCGAGAAA GCTGCTGGAC CGAGCCGACT CGATGGAAGG CTTCTCCCGA CACTTCGCCG CGCCCGAACA GTTCGACGCC GAGACCTTCG GCGAACCGGA CGCCGTCACG GACATCTACC AGGTCGGCGC TGTGCTGTAC GCGATGCTAC GCGGCGAGCC ACCCGTGAGC GGCGGCCGCC TCGCCGTCCG CAAGCGAATC GTCCGCGACG ACGGGCCACC GCCAGCCCCG AGTGCCGATC GTGACGACGT GCCGCCCGAA CTCGACGAGA TCGTACGCAC GGCGCTGGCG ACGGCAAAAC GGGATCGGTA CGAATCGATC CGGTATCTGC GAGACGATCT GGAAGCCCTG TGGAAGTCGC CGGACGATAC CGTCTAG
|
Protein sequence | MGGTQPGATL AASVGIEIEG GHCEQLLSRG QSLPASATET FTTAEDNQQT VQIRLFQGDQ ERVEGNELLG ECTVSGLVPA PAGVPDVAVT FTVDRAGVLQ VSVQNGTGDA TLQIEGQREF DGVAVETAGR TDADPTNDVI LGVDLGTTAS VCAVPVDGEP EIVVNSEGDR ATPSVLSVDD DGTLLVGKAA RKRAISRPEQ TIASVKRVLL GEDGTVELGE REYSTVELAG MLFEKLRSDA ESAVGRPVEK AVVTVPAVAS VRQRGRIDRA GEIAGLEIER TIGDAAAAVM GYAYGSDGEQ TVLVCDLGGG SLSVSLLDVE NDIYEIVANG GDDELGGNEW DAAIVDHLAD QFEADHGIDL REDPQARRRL ADAAAAAKIE LASRERTRID VPYVAATDDG PLDLNATLTR DTVASLTEPL VERVVESVRA VLSGSRHGAD DIDEILLVGG ASRLPQVRNR IETLVGQQTA RRVNEEVTAL GTAVQAGILS GAVDDVVLLD VTSLSLGIAV EGGGFERIVE RGETIPTVES VEFTTTVDDQ TAVPIRVFQG EHETATRNEL LAEFVLTDVP PTSAGTPAIE VTVSIDENRW INVEADHDGD SEAITIDPAT ERSLRDGTRR QSEQERPAPR LVTTRDEDEA TASDESPETS AARDLETLID EALDVRNQLH RGVLYCRRSI DNEVADLRDN LEKTTRNIES IVGTNTESAD KRDEDGEEVG DRLVGELAAV EDSLRALLDE DTSTGLAFDD LEALVDRIDR GLEAAGLVLV DPDGGAETDP YRHRVVSSTE SSVPEGRIVS VQQIGYERDG AVCREATVVV SAGSPAERVT AATGQTDPDR SGDTLDRAIR DARAPAVEQF PSPPRRSVSY EDFEISATLG TGGQAVVYET THPAIESPAR IALKEPARGE RTLTHDAISS FLQRAQTWRT VDDRERESRR WRSHEYIVGV VDVGDTRPWI AMEYMDGGDL AERLDGTEGL PVEEAVWIGQ CLCRALEVAH EYGIAHLDIK PGNVLFTETT PSRWDLPKLA DWGLARKLLD RADSMEGFSR HFAAPEQFDA ETFGEPDAVT DIYQVGAVLY AMLRGEPPVS GGRLAVRKRI VRDDGPPPAP SADRDDVPPE LDEIVRTALA TAKRDRYESI RYLRDDLEAL WKSPDDTV
|
| |