Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2106 |
Symbol | |
ID | 8384400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2143123 |
End bp | 2144133 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644973175 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_003131006 |
Protein GI | 257053173 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0557769 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGTAC TGATTATCGG CGGTACTGGC GTCATCTCGA CCGGCATCAC GCGACAACTC GTCGAGGCGG GCCACGACGT GACCATCTTC AACCGCGGGG AGACTGACAT CGACATCCCC GAGGCAGTCG CAGAGATCCA CGGAGATCGC TTCGATCACG ATGCGTTCGA ATCCACAGTC GCGGACGTCG ACGTGGACGT CGTGATCGAC ATGATGTGTT TCAGCGTGGA GGACGCCAAA AGCGACATCC GGGCCTTCGC CGGGGAGATC GAGCAGTGCA TCTTCACCAG TACGGTCGAC GTCTATCACC GGCCGCCCGA GCGTAATCCG GTCACTGAGG ACGCCGCCCG CGAGCCGCCG GTCAGCGACT ACGCCGAGGG CAAGGCCGCC GCCGAAGATC GCTTCCGTGA GGCAGAGGCC GAGGGTGCCT TCGACGTGAC GATCATCCGC CCGTGGAGCA CCTACGGCGA GGGCGGGAGT ATTTTCCACA CCTTCGGCGG CGACACCTAC TACATCGAAC GAATCCGGCA GGGCAAACCC ATCGTCGTCC ACGGTGACGG GACCTCGCTG TGGGGCTCGT GTCACCGTGA CGACGTCGCC GCCGCGTACG TCAACGCCGT CGGCAACGAG ACGGCCTACG GCGAGACCTA TCACGTCACC AGCGAGGAGG TCATCACCTG GAACCAGTAC CACCGCCGGG TCGCGGCCGC ACTCGACGCA CCGGAACCCG ACCTCGTCCA CATCCCGACC GACGAGCTCC GGGACGTCGC ACCGGAGCGC ACGGAGATGC TCCGGGATCA CTTCCAGTAC AGCACCGTCT TCGACAACAG CAAGGCCAAA CGCGATCTGG ACTTCGAGTA CACGGTTTCC TTCGAGGACG GCGTCGAACG GACCGTCGCA TGGCTGGACG AACACGACGG GATCGAGGTC GGCGAGGGTG ACGCCTTCGA GGACGACCTC GTCGCCGCCT GGCGGGAGTC GACCGACGAC TTCGTCGCCG ACTTCGAGTG A
|
Protein sequence | MDVLIIGGTG VISTGITRQL VEAGHDVTIF NRGETDIDIP EAVAEIHGDR FDHDAFESTV ADVDVDVVID MMCFSVEDAK SDIRAFAGEI EQCIFTSTVD VYHRPPERNP VTEDAAREPP VSDYAEGKAA AEDRFREAEA EGAFDVTIIR PWSTYGEGGS IFHTFGGDTY YIERIRQGKP IVVHGDGTSL WGSCHRDDVA AAYVNAVGNE TAYGETYHVT SEEVITWNQY HRRVAAALDA PEPDLVHIPT DELRDVAPER TEMLRDHFQY STVFDNSKAK RDLDFEYTVS FEDGVERTVA WLDEHDGIEV GEGDAFEDDL VAAWRESTDD FVADFE
|
| |