Gene Huta_2347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2347 
Symbol 
ID8384646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2389985 
End bp2391724 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content67% 
IMG OID644973420 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003131246 
Protein GI257053413 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAACG ACGACAGGTT CTCGCGTGAC AAAGACGAGG ACCTTCCGAG CACCGACGTC 
ACTGAAGGAC CCGACAAGGC ACCCCACCGG GCGATGTTCC GCGCGATGGG CTACGACGAT
GCCGACTTCG ACTCGCCGCT GGTGGGCATC GCCAACCCCG CTGCCGACAT CACGCCCTGT
AACGTCCATC TCGACGACGT GGCCGAGACG GCCTGGGACG CCACCGACGA AGCGGGCGGG
ATGCCCGTCG AGTTCGGGAC GATCACCATC TCCGACGCCA TCTCGATGGG CACCGAGGGG
ATGAAGGCCT CCCTGATCTC CCGGGAGGTC ATCGCTGACT CCGTCGAACT CGTCGCGTTC
GGCGAGCGCG TCGACGGCCT CGTCACCATC GGCGGCTGCG ACAAGAACAT GCCCGGGATG
ATGATGGCGA TGATCCGGAC GGATCTACCG TCTGTGTTCC TCTATGGCGG CTCGATCATG
CCCGGCGAGC ACGACGGGAG AGACGTCACC ATCGTCCAGG TGTTCGAGGG TGTCGGCGCC
TACGCCACCG GCGACATGGA CGCCGACGAA CTCGACGACC TCGAACGCAA CGCCTGCCCC
GGCGCGGGGG CCTGTGGCGG GATGTTCACC GCCAACACGA TGGCCTCCAT CTCGGAGGTC
ATCGGGCTGG CACCGCTGGG CAGCGCAAGC CCGCCCGCCG AAGAGGAAAG CCGTTACGAC
GTGGCCCGCG AGACCGGAGA ATTGGCCGTC GAAGTCATCG AAGAGCGCCG CCGACCGTCG
GACATCCTCA CGCGGGAGTC CTTCGAGAAC GCCATCGCGC TGCAAACGGC GATCGGTGGG
TCGACCAACG CCGTCCTGCA CCTGCTGGCG ATGGCCGCCG AGGCCGGCGT CGAGCTGGAC
ATCGAGGACT TCGACGAGAT CAGCCGTCGG ACGCCGAAGA TCGCCGACCT CCAGCCTGGC
GGCGAGAGCG TGATGAACGA CCTCCACGAG ATCGGCGGCG TCCCGGTCGT GCTCCGCCGG
TTGCTGGAGG CCGACCTGCT GCACGGCGAT GCGATGACGA TCACCGGCCG GACGCTCGCC
GAGGAGATCG AGCACTTAGA AGAGAAGGGG CGACTCCCGC CCGAGGAAGA GATCGACGCC
GACTTCCTCT ACTCGATCGA CGACCCGAAG GAACCCGAGG GCGCGATCAA GATCCTGACG
GGCAACCTCG CGCCCGACGG CGCGGTCCTG AAGGCGACGG GCAACGACGA GTTCTACCAC
CAGGGGCCGG CGCGGATCTT CGAGGACGAG GAAGACGCAA TGGCGTACGT CCAGGAGGAT
CGCATCGAGT CCGGCGACGT GATCATCATC CGCGGTGAGG GGCCCAAGGG TGGCCCCGGA
ATGCGGGAGA TGCTCGGCGT CACCGCCGCC GTGGTCGGCC AGGGCCACGA GGACGACGTG
GCGTTGCTGA CTGACGGCCG GTTCTCCGGC GGGACGCGCG GGCCGATGAT CGGCCACGTC
GCCCCCGAGA GTTTCGTCGG CGGGCCGATC GGCGCGCTCG AAGACGGCGA CACCGTGACG
GTGGACATTC CCGAGCGCTC GCTCGACGTT GACCTTAGCG ACGCGGAGAT CCAACAGCGT
CTCGACGAGC GCGACGATCC CGAGCCGACC TACGAGAATG GTGTGCTGGC GAAGTACCAC
CGGGACTTCG ACTCGGCGGC CAACGGTGCG GTGAGCAACC CCGGTGTCAA GCGGGAATAA
 
Protein sequence
MSNDDRFSRD KDEDLPSTDV TEGPDKAPHR AMFRAMGYDD ADFDSPLVGI ANPAADITPC 
NVHLDDVAET AWDATDEAGG MPVEFGTITI SDAISMGTEG MKASLISREV IADSVELVAF
GERVDGLVTI GGCDKNMPGM MMAMIRTDLP SVFLYGGSIM PGEHDGRDVT IVQVFEGVGA
YATGDMDADE LDDLERNACP GAGACGGMFT ANTMASISEV IGLAPLGSAS PPAEEESRYD
VARETGELAV EVIEERRRPS DILTRESFEN AIALQTAIGG STNAVLHLLA MAAEAGVELD
IEDFDEISRR TPKIADLQPG GESVMNDLHE IGGVPVVLRR LLEADLLHGD AMTITGRTLA
EEIEHLEEKG RLPPEEEIDA DFLYSIDDPK EPEGAIKILT GNLAPDGAVL KATGNDEFYH
QGPARIFEDE EDAMAYVQED RIESGDVIII RGEGPKGGPG MREMLGVTAA VVGQGHEDDV
ALLTDGRFSG GTRGPMIGHV APESFVGGPI GALEDGDTVT VDIPERSLDV DLSDAEIQQR
LDERDDPEPT YENGVLAKYH RDFDSAANGA VSNPGVKRE