Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_3174 |
Symbol | |
ID | 8412727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 3065499 |
End bp | 3067085 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645021521 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003178986 |
Protein GI | 257389213 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.108856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.164805 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTGC CAGACGTGGC CGAACTCGAT CTCGCCACCA AAGTCGGACA GCTGTTCGTC GTCGGCTTCG AGGGACCGGA GCCGACGGAG GACCTCCGCG AACTCCTCAC CGACTATCGC TGTGGCAACG TCATCTACTT CAGCCGGAAC ATCGACTCGC CCGAGCAGGT CGCCGAGCTC AGTCGCGAGC TACAGACCAT CGCCACCGAA GCGGGACCCG AGATCCCGCT GTTCGTGACG GCCGACCAGG AGGGCGGCGT CGTCTCACGG ACCGACTGGG GGACCGAACC CCCGAGTCAG ATGAGCATCG GTGCGGGCCG GGACGCCGAC CTCGCCCGTT CGGTGGGTGG CGCTGTCGGG GCGGAACTCG CGTCGATCGG CGTCAACTTC GATCTGACGC CGGTGCTGGA CGTGAACAAC AACCCGGACA ACCCCGTCAT CGGCGTCCGG TCGTTCGGCG AGGAGCCCGA ACTCGTCGGC GATCTCGGCG CGGCGATGGC CGACGGCATG CAGTCCGAGG GCGTGCTGGC CTGTGGGAAG CACTTCCCGG GCCACGGCGA CACCAGCGCC GACTCGCACC ACTCGCTACC CGTCGTCGAC CACGACCGCG AGCGTCTCGA CGCCGTCGAA CTCGCCCCCT TCCGCCGGGC GATCGACGCC GGGATCGACG CGATCATGAC GACACACGTC TCGTTCCCGA CGATCACCGG CGACGACGAA CTCCCGGCGA CCGTCTCCCG AGACGTACAG ACCGGACTCC TCCGCGAGCA GTTAGGGTTC GATGGCCTGG TCGTCACCGA CGGAATGGAG ATGAACGCCA TCGCCGACGA GATGGGGACG CCGGAAGGGT GTGTCCAGGC CGTCGAAGCG GGCTGTGATC TCCTCCTGGT CTGTCACACC CCCGCGGTCC AGAAAGACTC GGTCGAGGCG GTCATCGACG CCGTCGAGTC CGGCCGCATC GACGAGTCGC GGATCGACGA CGCTGTCGAG CGCGTCCTCG AGTACAAGGA GCGACGCGGC GTCGGCCAGC AGACGCCCTC GCTCGACCGC TGGGAGGCGA CCAGCGATCG CTCCCGCGAG GTCGGTCGTG AGGTCGCCGC GGCCGGCATC ACGGTCGCTC GGGATCGAAA CGAGACGATT CCCTTCGACA CCGGACGACC GCTTCACCTC GTCGGCTTCC CCGGCGGACG CGCTTCGCCG GCAGAGGACG ACCGCTACGA GCCGACGCTG GTCGCCGACG CACTCGAAGC CGGCGGCTTC GACGTGGAAC TGCACGAGGT GGAGACCGCC GACGCCCTGC CGTCGTTCGA GGGCGACGAA CAGGTCGTGC TGGCGACCTA CAACGCCGCT GGCGACGACG AACAGGTGCG AGCGGTCGAG CGACTGGACG AGGCCGTCGA CGCCTTCGCC GCGCTGGTGG TGCGCAACCC CTACGACCTC GCTCGCTTCC CCGACGTGTC GACGGCGGTG TCGACCTACG ACTACACGCC CGCGACGCTG TCGGTGACCG GCGAGATCCT GGCCGGCCGG CGTCGGGCCA GCGGTCGGCT GCCGGTGACG ATCGCGGGCT TCGAGGCCGA GAACTGA
|
Protein sequence | MSLPDVAELD LATKVGQLFV VGFEGPEPTE DLRELLTDYR CGNVIYFSRN IDSPEQVAEL SRELQTIATE AGPEIPLFVT ADQEGGVVSR TDWGTEPPSQ MSIGAGRDAD LARSVGGAVG AELASIGVNF DLTPVLDVNN NPDNPVIGVR SFGEEPELVG DLGAAMADGM QSEGVLACGK HFPGHGDTSA DSHHSLPVVD HDRERLDAVE LAPFRRAIDA GIDAIMTTHV SFPTITGDDE LPATVSRDVQ TGLLREQLGF DGLVVTDGME MNAIADEMGT PEGCVQAVEA GCDLLLVCHT PAVQKDSVEA VIDAVESGRI DESRIDDAVE RVLEYKERRG VGQQTPSLDR WEATSDRSRE VGREVAAAGI TVARDRNETI PFDTGRPLHL VGFPGGRASP AEDDRYEPTL VADALEAGGF DVELHEVETA DALPSFEGDE QVVLATYNAA GDDEQVRAVE RLDEAVDAFA ALVVRNPYDL ARFPDVSTAV STYDYTPATL SVTGEILAGR RRASGRLPVT IAGFEAEN
|
| |