Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2329 |
Symbol | |
ID | 8411870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 2244963 |
End bp | 2246372 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645020672 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_003178148 |
Protein GI | 257388375 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.485013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGGCA ATCGAGCCAG CAGCAGGGGG ATCAGTCGCC GTCGGTTTCT CCGACGGAGC GGTGCGATCG TCGGCGTCGG CCTCGTCGCC GGTTGTACGG GATCGAATCC GAACGACGGC ACTCGAACGA CGAGCGAGGG CGGTACCGAC ACGCCGGCCG GCCCCGCGGC GCTCGGATAC GATCTCTCGG TGACCCACGA GCTCACAGAG TGGGACCGCT ACGATCCCGA CTGGGAACCG CCGAGCGACT CGCCCCGAGA GGAGTACACC GCGGAGGTGC TGGCGACCGG GCTCGAAGTG CCCTGGGACC TCTCGTTTGC GGGCGAAGAC ACGCTGTTCG TGACAGAGCG GACCGGCCGT ATCACCGAGT TCGACAGCGG GACGCTGCGG ACGGTCGCCG AGCCGTCCGA CATCATCGAC GCAGCCGCGA TCGAGGCGGG CTCCGACGAG AGTCGGTGGC GGCTCACCGG CGGAGAAGGC GGTCTGCTCG GCGTCGCCGC CCACCCGTCG TATCCGGATC CGCCGGTCGT CTACGTCTAC TACACGGCCG AGACGAGCGA GGGGAAGCGC AATCGGGTGG TCGCCTTCGA CGCCAGCGCA CCAGCTCCCG ACGAGACTGT CGTCCCGGTC GTCGACGAGA TCCCGGCCGA CACCTACCAC AACGGCGGTC GGATCGCGTT CGGACCGGCC GACTACCTGT GGATTACGAC CGGCGACGCC GATCCCGGAC TCGAACACAC GGAACAGACG AGAGACCCCG CCTCCCTGGC CGGGAAAGTC CTCCGCGTTC GGCCCGACGG GTCGCCACCA CCGGACAACC CCGACAGCAC GTCGGACGCC GACCCGCGCG TGTTCACCTA CGGCCACCGG AACCCCCAGG GTATCGACTG GCTCCCGGAC GGGACCCCGA TAATCACCGA GCACGGACCC GGCGCTGGAG ACGAGCTCAA CGTCCTCAGG CCGGGCGTCG ACTACGGCTG GCCAGTGGTC CGGAACAGCG GCGATCACGA GCGATACCCA GAGACCGAGT TCCAGTCGCC GGTCGCGGAC GCCTCGTCGT GGGCCCCGGC CGGGGGCGTG TTCTACACCG GCGAGAGCGT TCCGAGCCTG CGGAACCGGT TCGTGTTCGG TGGCCTGATC AGTCAGCGAG TCACGGCCGC GACGATCACG CCCGCCGACG GGCCACAGCC CGCAGACGGA CACGAGCGAC GCCACGATGC CTCGTGGTAC GACGCCGACT ACCGGGCCGG GACCAGCGGG CTCCTGAGCG AGGAACTCGG CCGTGTCCGC CACGTCGAAC AGGGACCGGA GGGCGATCTC TACGCGATCA CGTCGAACCG TGACGGCCGC GCGAACGGAC CGTTCCCGCG CGACGACGAC GATCGACTGG TCCGGATCCG TCCGGCCTGA
|
Protein sequence | MGGNRASSRG ISRRRFLRRS GAIVGVGLVA GCTGSNPNDG TRTTSEGGTD TPAGPAALGY DLSVTHELTE WDRYDPDWEP PSDSPREEYT AEVLATGLEV PWDLSFAGED TLFVTERTGR ITEFDSGTLR TVAEPSDIID AAAIEAGSDE SRWRLTGGEG GLLGVAAHPS YPDPPVVYVY YTAETSEGKR NRVVAFDASA PAPDETVVPV VDEIPADTYH NGGRIAFGPA DYLWITTGDA DPGLEHTEQT RDPASLAGKV LRVRPDGSPP PDNPDSTSDA DPRVFTYGHR NPQGIDWLPD GTPIITEHGP GAGDELNVLR PGVDYGWPVV RNSGDHERYP ETEFQSPVAD ASSWAPAGGV FYTGESVPSL RNRFVFGGLI SQRVTAATIT PADGPQPADG HERRHDASWY DADYRAGTSG LLSEELGRVR HVEQGPEGDL YAITSNRDGR ANGPFPRDDD DRLVRIRPA
|
| |