Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_3226 |
Symbol | |
ID | 8409301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013201 |
Strand | - |
Start bp | 4681 |
End bp | 7782 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645018162 |
Product | glycoside hydrolase family 2 TIM barrel |
Protein accession | YP_003175687 |
Protein GI | 257372913 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.553706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGACT GGACAGATCC GCGAGTAGTC GGTCGTAACA GGCTCGCACC ACACACTGAC GTGTTCCCGT TCTCGGACGA GCAATCTGCA CGCCGGGACA GCGTGACTGC CTCGCCGTGG GTTCGGCGAT TGAACGGCGA GTGGCAGTTT CACCTCGCCG AGACCCCTGC GGACGCGCCA GCGATCCCGG GCGCGACGGA CGACGTCGAC TGGGACCGAA TCGAGGTCCC GCTGAACTGG CAACTCGACG GCCACGGACA CCCACACTAC ACGAACGTCG TCTACCCGTT CCCGGTCGAT CCGCCCCACG TTCCGACCGA GAACCCGACG GGGACGTACC GGCGCTCCGT CCACGTCGAC GAGGACTGGG ACGGTCGACA GATCCGCCTG CGCTTCGAGG GGGTCGACTC CGCCTTCCAC CTCTGGGTCA ACGGCGAGCG CGTCGGCTAC AGCGAAGGCG CGCGTCTCCC CGCGGAGTTC GACGTGAGCG ACTACGTCGA GCCGGGGGAG AACACGGTCA CTGCACGGGT CTACAAGTGG TCTAACGGGA GCTACCTCGA AGATCAGGAC ATGTGGTGGC TCAGCGGCAT CTTCCGTGAC GTCACCCTGT ACGCCGTGCC GGAGACTCAC GTCGCGGACG TCGACGTCCG TACCGAACTG GACGACGACT ACGTCGACGC CCGGTTCTCC GCGGCGGTCG ACATCGAATC GGTAGCCGGT GGCGACGCGA CGGGCCGTCT CCGCGCCTCT CTGGTCGACG AAGCGGGCGA CACTGCCGCC ACCTTCGCGG AGTCGTACGC GCTGACCGAC GGTGAGACGA CGCTCTCGCT GTCGACGACC GTCGAGGACC CGGACAAGTG GACCGCAGAG ACGCCGGACC GGTACACGCT CCTGGTGACG CTACTGGACG ACGGAACGCC CGTCGAGACC GTTCGCGAGA CGGTCGGCTT TCGCGAGGTC GAGATCGACG GCGGCCAGTT CCTGGTCAAC GGCGAAGCCG TGACGATCCG CGGCGTGAAC CGCCACGACT TCGATCCCGA CCGGGGGCGG GCCGTCACCG TCGACCAGAT GCGCGCGGAC ATCGAGCTGA TGAAACGGCA CAACGTCAAC GCCGTCCGGA CCGCTCACTA CCCCAACGAC ACGCGGTTCT ACGACCTCTG TGACGAGTAC GGGCTCTACG TCATGGACGA GACCGACATC GAGTGTCACG GGATGGAACG CATCGACGCC GTCCAGCATC TCAGTGACGA CCCCGCGTGG GAAGACACCT ACGTCGATCG GATGGTCCGG ATGCTCGAAC GAGACAAGAA CCACCCGAGC GTCGTGATCT GGTCGCTGGG CAACGAGTCC GCAGTCGGGG CCCACCACGA GACGATGTAC GAGCTGACAC GCGAGCGCGA TCCGACGCGG CCGGTCCACT ACGAGCAGGA CCACGACCAG CGGGTCAGCG ACATCGTCGG GCCCATGTAC ACCCCGCCCG AGGACATCGA GGCCCTGGCA GTCGACGATC CGGACCACCC CGTGATCCTC TGTGAGTACG CCCACGCCAT GGGGAACGGT CCGGGCGGGT TCGAGGAGTA CTGGGACGTA TTCGACGGCC ACGAGCGACT GCAGGGCGGG TTCGTCTGGG ACTGGATCGA CCAGGGCATC CGCCAGACGA CCGAGGACGG AGCGGAGTGG TTCGCCTACG GCGGCGACTT CGGCGACGAG CCCAACACGG GGAACTTCAA CATCAACGGT CTCGTCTTTC CCGATCGAAC GCCCTCGCCG GGCCTCACCG AGTTCAAGAA CGTCGTCGAG CCGGTCACGT TCGAGCCCGC AGCCCTCGAA CGGGGCGAAC TCGTCGTCGA GAACCGCTAC GACTTCCGCG GGCTAGATCA CCTCCGCGCC CGGTGGCGCG TCGAGGCCGA CGGCCGGGTC GTCCAGAGTG GCACGCTCGA CCTCCCAGCC GTCGCACCCG GCGACAGCGA ACCCGTCACG GTGCCGTTCG AGGATGCTGG CCGGGGCGAG CGGTTCCTCG TCGTCGAGGT CTCGCTGGCC AGTGACACGT CGTGGGCACC GGCCGGCCAC ACGGTCGCAA CGAGCCAGCA CGAACTCCCG ACGAGCGAGG CACCGACGGT GCCGGCCGCG ACGCTTCCCG CACCGCTGTC CTGTGAACGC AGCGACGACG AGATCGTCGT CTCGAACGCG GAGTTCGAGC TCCGGTTCGA CGCCCGATAC GGCGTCGTCG ACTCGCTGTC CTACCGCGGT CGGTCGGTCG TCACCGAGGG TCCCGAGGTC GGACTCTGGC GTGCCCCGAC GGACAACGAC AGAGGGTTGC CGCTGGTCCC GACTTTCTTC ACGCGCTTCC TCGAACTACA CGAAAACGAG GAACCGATCG CCGACTGGGA CGCCCGGACG GTCGGCTTCG CACAGATCTG GCGCGAGCAC GGTCTCGACA GCTTGCAGTC CCGCGTCGAC GCCGTCGACA CCGAGGTCGG GGAGGAGACG GTCACGATCA CCGTCGACGG TCGCCTCGCG CCGCCGATGT TCGCCCACGG GTTCGCGACG ACGCAGACGT ACACGATCCA CCCCACCGGA GCCGTCGAGA TCGAGACGGA TCTCGACCCT GAGGGCGACC TCTCGATGCT CCCGTCGCTG CCACGGATCG GACTCGACCT GACGCTCGAC GGGGACTTCG ACCACGTCAC GTGGTACGGC CGCGGCCCGG GCGAGTCCTA CGCCGACAGC GAGCAGGCGT CCCCCGTCGG TCGGTACGAG GCCGACGTTG CCGACCTCCA CACGCCCTAC GTCAGGCCCC AGGCGAACGG CACTCGCAGC GACACCCGCT GGGTGGCCGT TACCGACCGC AACGGTACCG GGCTCCTCGC GACCGGGGAC TCGCTGCTCG ACGTGACCGC ACACCACTAC TCGACCGAGC AGCTGGAGGC CGCCGATCAC GAACACGAAC TGTCCCGCGA GGACGACGTG TTCCTCTCGC TGGACCACGC ACACTCCGGA CTCGGGTCGG GCAGCTGTGG GCCCGAGACG TTCGAGTCCG ACCGCGTCCA GCCCGAGCGG ACCTCGTTCA CGGTCACGCT CCACCCGTAC GTCGACGACT GA
|
Protein sequence | MPDWTDPRVV GRNRLAPHTD VFPFSDEQSA RRDSVTASPW VRRLNGEWQF HLAETPADAP AIPGATDDVD WDRIEVPLNW QLDGHGHPHY TNVVYPFPVD PPHVPTENPT GTYRRSVHVD EDWDGRQIRL RFEGVDSAFH LWVNGERVGY SEGARLPAEF DVSDYVEPGE NTVTARVYKW SNGSYLEDQD MWWLSGIFRD VTLYAVPETH VADVDVRTEL DDDYVDARFS AAVDIESVAG GDATGRLRAS LVDEAGDTAA TFAESYALTD GETTLSLSTT VEDPDKWTAE TPDRYTLLVT LLDDGTPVET VRETVGFREV EIDGGQFLVN GEAVTIRGVN RHDFDPDRGR AVTVDQMRAD IELMKRHNVN AVRTAHYPND TRFYDLCDEY GLYVMDETDI ECHGMERIDA VQHLSDDPAW EDTYVDRMVR MLERDKNHPS VVIWSLGNES AVGAHHETMY ELTRERDPTR PVHYEQDHDQ RVSDIVGPMY TPPEDIEALA VDDPDHPVIL CEYAHAMGNG PGGFEEYWDV FDGHERLQGG FVWDWIDQGI RQTTEDGAEW FAYGGDFGDE PNTGNFNING LVFPDRTPSP GLTEFKNVVE PVTFEPAALE RGELVVENRY DFRGLDHLRA RWRVEADGRV VQSGTLDLPA VAPGDSEPVT VPFEDAGRGE RFLVVEVSLA SDTSWAPAGH TVATSQHELP TSEAPTVPAA TLPAPLSCER SDDEIVVSNA EFELRFDARY GVVDSLSYRG RSVVTEGPEV GLWRAPTDND RGLPLVPTFF TRFLELHENE EPIADWDART VGFAQIWREH GLDSLQSRVD AVDTEVGEET VTITVDGRLA PPMFAHGFAT TQTYTIHPTG AVEIETDLDP EGDLSMLPSL PRIGLDLTLD GDFDHVTWYG RGPGESYADS EQASPVGRYE ADVADLHTPY VRPQANGTRS DTRWVAVTDR NGTGLLATGD SLLDVTAHHY STEQLEAADH EHELSREDDV FLSLDHAHSG LGSGSCGPET FESDRVQPER TSFTVTLHPY VDD
|
| |