Gene Huta_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2106 
Symbol 
ID8384400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2143123 
End bp2144133 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content64% 
IMG OID644973175 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_003131006 
Protein GI257053173 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0557769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTAC TGATTATCGG CGGTACTGGC GTCATCTCGA CCGGCATCAC GCGACAACTC 
GTCGAGGCGG GCCACGACGT GACCATCTTC AACCGCGGGG AGACTGACAT CGACATCCCC
GAGGCAGTCG CAGAGATCCA CGGAGATCGC TTCGATCACG ATGCGTTCGA ATCCACAGTC
GCGGACGTCG ACGTGGACGT CGTGATCGAC ATGATGTGTT TCAGCGTGGA GGACGCCAAA
AGCGACATCC GGGCCTTCGC CGGGGAGATC GAGCAGTGCA TCTTCACCAG TACGGTCGAC
GTCTATCACC GGCCGCCCGA GCGTAATCCG GTCACTGAGG ACGCCGCCCG CGAGCCGCCG
GTCAGCGACT ACGCCGAGGG CAAGGCCGCC GCCGAAGATC GCTTCCGTGA GGCAGAGGCC
GAGGGTGCCT TCGACGTGAC GATCATCCGC CCGTGGAGCA CCTACGGCGA GGGCGGGAGT
ATTTTCCACA CCTTCGGCGG CGACACCTAC TACATCGAAC GAATCCGGCA GGGCAAACCC
ATCGTCGTCC ACGGTGACGG GACCTCGCTG TGGGGCTCGT GTCACCGTGA CGACGTCGCC
GCCGCGTACG TCAACGCCGT CGGCAACGAG ACGGCCTACG GCGAGACCTA TCACGTCACC
AGCGAGGAGG TCATCACCTG GAACCAGTAC CACCGCCGGG TCGCGGCCGC ACTCGACGCA
CCGGAACCCG ACCTCGTCCA CATCCCGACC GACGAGCTCC GGGACGTCGC ACCGGAGCGC
ACGGAGATGC TCCGGGATCA CTTCCAGTAC AGCACCGTCT TCGACAACAG CAAGGCCAAA
CGCGATCTGG ACTTCGAGTA CACGGTTTCC TTCGAGGACG GCGTCGAACG GACCGTCGCA
TGGCTGGACG AACACGACGG GATCGAGGTC GGCGAGGGTG ACGCCTTCGA GGACGACCTC
GTCGCCGCCT GGCGGGAGTC GACCGACGAC TTCGTCGCCG ACTTCGAGTG A
 
Protein sequence
MDVLIIGGTG VISTGITRQL VEAGHDVTIF NRGETDIDIP EAVAEIHGDR FDHDAFESTV 
ADVDVDVVID MMCFSVEDAK SDIRAFAGEI EQCIFTSTVD VYHRPPERNP VTEDAAREPP
VSDYAEGKAA AEDRFREAEA EGAFDVTIIR PWSTYGEGGS IFHTFGGDTY YIERIRQGKP
IVVHGDGTSL WGSCHRDDVA AAYVNAVGNE TAYGETYHVT SEEVITWNQY HRRVAAALDA
PEPDLVHIPT DELRDVAPER TEMLRDHFQY STVFDNSKAK RDLDFEYTVS FEDGVERTVA
WLDEHDGIEV GEGDAFEDDL VAAWRESTDD FVADFE