Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2040 |
Symbol | |
ID | 8411571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1940986 |
End bp | 1941936 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645020374 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_003177860 |
Protein GI | 257388087 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.68178 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAC CCAACCTCAG CGACAGATCG GTGCTGGTCA CGGGCGGTGC CGGCTTCGTC GGCGGCCAGC TCGTCCAGAC GCTCGCCCCG GACAACGACG TGACCGTCCT CGACGACCTC TCGACGGGCG AACGCGACCG CGTTCCCGAC GACGTGACCT TCGTCCACGG CGACGTACGC GACCAGCGCA AGCTCAAGCA GGAGATCGAG GCGGCAGACG TCGTCTTCCA CGAGGCGGCC GTCGTCGGCG TCCCGGCCTC GTTGCGAGAT CCGCCCCGGA GCAACCACGT CAACACCGGC GCGACCGTCC AGTTGCTCGA CTACGCTCGC CAGTACGACA CGCGGGTCGT CCTCGCCTCC AGCGCCGCGA TCTACGGCGA GCCCGAGTCG GTTCCCATCG AGGAGGACCA CCCCCTCGAA CCGACCTCAC CGTACGGCGT CGACAAGCTC GCAGTCGATC ACTACGCCCG CGTGTTCGCC CAGCAGTACG ATCTCCCTGT CGTTCCGCTG CGGTACTTCA ACATCTACGG GCCGCGGACC GGCCCCAATC CCTACAGTGC CGTGGTCGAC GTCTTCCTCG AACAGGCTCG GAGCGGCGAC CCGATCACGG TCCACGGGAC CGGCGAGCAG ACCCGTGACT TCGTCCACGT CGACGACGTG GTCCAGGCGA ACCTCCGGGC CGCGACCACC GACGAGGTCG GCGTGGCCTA CAACGTCGGC ACCGGCTCGT CGGTCTCTAT CGCGGAGCTC GCCGAGCTGA TCCGGACGGC GACGGACAGC GACTCGCCGA TCACACACAC CGACGAGCGA CCGGGCGACA TCAGCGACAG CGAGGCGGAC ATCTCTCGCG CACGCGAACG ACTCGGCTAC GAGCCGACGG TCGATCTCCG TTCTGGCATC GACCGGCTCG TCGACGCGGC CGCCCCGGCG GATCCCCCCT CCTCCTCGTA G
|
Protein sequence | MSEPNLSDRS VLVTGGAGFV GGQLVQTLAP DNDVTVLDDL STGERDRVPD DVTFVHGDVR DQRKLKQEIE AADVVFHEAA VVGVPASLRD PPRSNHVNTG ATVQLLDYAR QYDTRVVLAS SAAIYGEPES VPIEEDHPLE PTSPYGVDKL AVDHYARVFA QQYDLPVVPL RYFNIYGPRT GPNPYSAVVD VFLEQARSGD PITVHGTGEQ TRDFVHVDDV VQANLRAATT DEVGVAYNVG TGSSVSIAEL AELIRTATDS DSPITHTDER PGDISDSEAD ISRARERLGY EPTVDLRSGI DRLVDAAAPA DPPSSS
|
| |