Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1951 |
Symbol | |
ID | 8411479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1859934 |
End bp | 1862858 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645020282 |
Product | hypothetical protein |
Protein accession | YP_003177771 |
Protein GI | 257387998 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGGAA CTAACGATAA GGTTCGCAGC CTGTTCCTCT CTGCGCTGAT GGTCATGTCC GTCGTCGCGA TGGGCACGGC TTTCACTGCA GGAGCAGTTG CGGACACTAG CGGTAATGCG ACTGTCGAAA GCCTCGAGGT TACGCAAAAC CGCGTCCTCG AAGCTGACGA CTCTCGTAGT AGCGACATTC AAACACAGAC CGTCGAATTC GACGCATATC TCGAATCCGG CGACACGGAA GAGATCACGA TCGACACGGC CGACGCTGAA GACAACGGCC TGACGATCGA CGACGTTTCC GTCGACGGCT CCACGTCTAG CGACGTTTCC GTCGAGGACG TCGACTTCGA CGACGACCAG ATCACCTTCG ATCTGAGCGA GGACGCCGGT GGCTCCACGG TCAATGATGC CACCGTCGCC CTCAACGTCG ACTTCGACGT GAGTGACGGT GAGAACGCAA ACCGTCTCGA ACACGTCGTC ACGGCAACTG GTGACGACGG GTCCACGACG CTCTCCCCGG TCACCGCCAC GTACAAGATT GCCGGCGATT ACTCGAAGAC CGTCAACGGC GAGGATTCGG TCTTCGCTGG TGAGACGATC CGATTCACCC CCGGTGCCGA AAACACCCAG ATCGAGGTCT ACACGACCGA CGACGAGGGT GCTCCGACGA ACAACCGTGT TGCAAACTTC AACACGGCGT TCGACACCGC GATCAACTTC GACACGTCGA ACCTCGACAC TGGCGAGACG TACAACGTTC AGATCAGTGG CAGTGGTGTC ACCGATACTG ACTACGATCT CGCTGTCAAC GACCTCGGCC TCGACGCCGA AGCACGTGAC ACGCAGATCT CCACGGAAGA CCGCATCGTC GCGGACATCA CGTCGAACGC GAACCTCGAC GACCCGTGGA CGGCTACGCT CCTCGACAGC GACGGTGAGG CCGTCGAGGA CGACAACGGT GATGCAATCG AACGAACTGG TAGCTTCTCC GGCTCCGGTG CAGCAACCGT TCGATTCACC GCTCCGACCG ACTCTGACCA CTACGGCACT GGCAACTACT CGGTCGAAGT CACGCACGAC GACACCGGCA TCACGTCGGA AACCGACTCG ATCGAAGTCT CGGAAGCCGG TGACGGCGAC GTCTCCTTCG CTGGCGACGG CGTCTTCACC GAAGAAGCCG GTGACGTCGC GAACGTCACG GTCGAGATGA CCAACACCGA CGAAGCGACG GTCACCATCG GTACCGAAGA TCAGGGTTAC TACATCGTCT ACCAGGTCTC TGACGAGAGC GGTGACGGTC AGGTCTCCCT CGAATTCAAC AGCTACACCG CAGGCCGCAC GTCCCAGGAC AACGTCGTCA GCGTCGCTAA CTCTGACGAC GAGATCGAGT TCCTCACGCA GGGTGGCGAC TTCACCAACA CCTCCGAGGC TGTCGGCTCC GACACCCTCG ACCCGACGGA ATACGAGATG AACGCTTCCG TCGGCCACGT CGACATGGGC ACCGACGACT ACACTGACTC CGACGCGCTC GGCTCGCTCT CGCTGCAGCC GCGCTCGACC GACGGCATGC AGTCGTGGGT CACGCCGAAG GGCACCGACA TCAGCGACCT CAGTGATGAC GTTGGCAACA TCTACGACGA GATCGGCTCG TCGATCACCG AGTCCGGCGA AGTCGCCTAC GACGACGGTG ACGCCGACGG CGACACCGTC ATCCTCCAGC TCGAAGCTGG CGGTATCGAG GGTGCCATCG ACCACGCAAA TGGTCTGAAG ACCCTCATCG ACAACAACCA GGACAACGAC GACGATGAAG CGATCGACCT CACGATCGAG CAGACCAACG AGGATGCAAA CCGGAACGCA AAGACGCTCG ACTTCGTGAA CGACGACGCG ATCTCGGTCA TCGAAGACCC CGAGAACAAC TCGTACTTCA TCGCCGTTGA GCTGTCGAAG GTCGACCAGC CGACCCGCGA CCTCGCGGAA GGCGACAACT TCGAGGCGAC ATTCACCATC GAGGACGACA ACGTCCTCAA CACGATCGAC GACAACGAGG AAGTCACCAG TGACTTCACG CTCGTCGAAC CGAACACTGA GCTCGTCACG AACTCTGACG ACCTCATCCT GATCGAGTCC GCGTCCGGTC AGACCATCTC GGGTAGCACC AACTACGCGC CTGGCACCGA CCTCAACGTT CGGGTCAAGT CCAGCGACAC CGCGAGTCCG TTCCAGACTC GCCCCGAGGC AACCGTCCAG ACGGACGGTA CCTTCACGAC CGAAGGTGCT GACTTCAGCG AGGTCTCGCC CGGGACCAAC CTGACGCTGC AGACGCGTCG CGGTGGCTCC GCGGTCGGTG ACGAGTACGA CGGTCGCATC GGCGAAGTCC CGACTGCCTC CGTCAGCATC AGCAACCAGA CCACCGACGG CTCCACCGTG ACGGTCGACT CGGTTACGAC CGAGAACGGT GGCTTCGTCG CGATCCACCT CAACAACGCC AGCGGTGAGG TCATCGGCAA CTCCGAGTAC CTCGACTCTG GCACCCAGCA GGACGTCGAG ATCAGCCTCG ACTCCGCTCT GGACGAGAAC GCGACGGTCG TCGCGATGCC CCACCAGGAC ACCAACGACA ACGAAGAGTA CGACTTCGGC GATGGTGACG GTGCGGACGG TCCGTACACC GAGAACGGCT CGGCAGTGAC CGACAGCGCG TCGATCACGA TCCAGACCAC CGAGGACACG CCGACGCCGA CGGACACGCC GGAAGACACG CCGACGGACA CGGCGACGCC TGACGAAGGC GAAGACATGA CGGACACGCC GACTGACGAC GGCGGCGACG AGACCACCAC CGGCGACGGT GCTGGCTTCG GCGCAGTCGT CGCACTCGTC GGCCTCCTCG CTGCTGCGCT GCTCGCAACG CGGCGCAACA ACTGA
|
Protein sequence | MTGTNDKVRS LFLSALMVMS VVAMGTAFTA GAVADTSGNA TVESLEVTQN RVLEADDSRS SDIQTQTVEF DAYLESGDTE EITIDTADAE DNGLTIDDVS VDGSTSSDVS VEDVDFDDDQ ITFDLSEDAG GSTVNDATVA LNVDFDVSDG ENANRLEHVV TATGDDGSTT LSPVTATYKI AGDYSKTVNG EDSVFAGETI RFTPGAENTQ IEVYTTDDEG APTNNRVANF NTAFDTAINF DTSNLDTGET YNVQISGSGV TDTDYDLAVN DLGLDAEARD TQISTEDRIV ADITSNANLD DPWTATLLDS DGEAVEDDNG DAIERTGSFS GSGAATVRFT APTDSDHYGT GNYSVEVTHD DTGITSETDS IEVSEAGDGD VSFAGDGVFT EEAGDVANVT VEMTNTDEAT VTIGTEDQGY YIVYQVSDES GDGQVSLEFN SYTAGRTSQD NVVSVANSDD EIEFLTQGGD FTNTSEAVGS DTLDPTEYEM NASVGHVDMG TDDYTDSDAL GSLSLQPRST DGMQSWVTPK GTDISDLSDD VGNIYDEIGS SITESGEVAY DDGDADGDTV ILQLEAGGIE GAIDHANGLK TLIDNNQDND DDEAIDLTIE QTNEDANRNA KTLDFVNDDA ISVIEDPENN SYFIAVELSK VDQPTRDLAE GDNFEATFTI EDDNVLNTID DNEEVTSDFT LVEPNTELVT NSDDLILIES ASGQTISGST NYAPGTDLNV RVKSSDTASP FQTRPEATVQ TDGTFTTEGA DFSEVSPGTN LTLQTRRGGS AVGDEYDGRI GEVPTASVSI SNQTTDGSTV TVDSVTTENG GFVAIHLNNA SGEVIGNSEY LDSGTQQDVE ISLDSALDEN ATVVAMPHQD TNDNEEYDFG DGDGADGPYT ENGSAVTDSA SITIQTTEDT PTPTDTPEDT PTDTATPDEG EDMTDTPTDD GGDETTTGDG AGFGAVVALV GLLAAALLAT RRNN
|
| |