Gene Hmuk_1347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1347 
Symbol 
ID8410867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1271902 
End bp1273635 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content68% 
IMG OID645019678 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003177175 
Protein GI257387402 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0142863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC AGGAGCGGAC ACCTGATGCC GACGAGGGGA AGGACCCGGA ACTCCGGAGT 
TCTGAAGTCA CGGAGGGCTA CGAGAAGGCA CCCCACCGCG CGATGTTCCG TGCGATGGGG
TACGACGACG AAGACCTCTC CTCGCCGATG ATCGGCGTCG CCAACCCCGC GGCCGACATC
ACGCCGTGTA ACGTCCACCT GGACGACGTG GCCGACGCGG CCTACGAGGG GATCGACGAG
ACCGAGGGGA TGCCCATCGA GTTCGGGACG ATCACGATCT CTGACGCCAT CTCCATGGGG
ACCGAGGGGA TGAAGGCGTC GCTGATCTCC CGGGAGGTCA TCGCCGACTC GGTCGAGCTC
GTCTCTTTCG GCGAGCGCAT GGACGGTCTG GTCACCATCG GTGGCTGTGA CAAGAACATG
CCCGGGATGA TGATGGCCGC GATCCGGACG GACCTGCCAA GCGTCTTCCT CTACGGGGGG
TCGATCATGC CCGGCGAGCA CGACGGCCGC GAGATCACGA TCCAGAACGT CTTCGAGGGC
GTCGGCGCGG TCGCCGACGG CGACATGAGC GAGGACGAAC TCGACGAGAT GGAGCGCAAC
GCCTGCCCCG GCGCGGGCTC CTGTGGCGGG ATGTTCACCG CCAACACGAT GGCCTCCATC
TCGGAGGCGC TCGGGTTTGC GCCGCTGGGC TCGGCCTCGC CGCCCGCGGA ACACGAGTCC
AGATACGAGG AGGCACGCCG GGCCGGCGAA CTCGCGGTCG AGGCAGTCCA GGAGCAGCGT
CGCCCCTCCG ACTTTCTCAG CCGCGAGTCC TTCGAGAACG CCATCGCGCT GCAGGTCGCG
GTCGGCGGCT CGACCAACGC CGTGCTCCAC ATCCTGGCGC TCGCGGCGGA GGCCGGCGTC
GACCTCGACA TCGAGGCGTT CAACGAGATC AGCAAGCGCA CGCCGAAGAT CGCCGACCTC
CAGCCCGGCG GCGAGCGCGT CATGAACGAC CTCCACGAGG TCGGCGGCGT CCCGGTCGTC
CTGAACGCCC TCTACGAGGC GGACCTGCTC CACGGCGACG CGCTGACGGT GACCGGCGAC
ACGCTCGGAG ACGCCCTCGA AGCCTACGAC CCGCCCGCGA TCGAGGACGT CGACGTCGAC
TACCTCTACA GCGTCGACGA ACCCAAGAAC GAGCAGGGTG CCATCCGCAT CCTCACGGGG
AACCTCGCGC CCGACGGCGC GGTCATCAAG ATCTCGGGCG AGGAGTACCT CCGCCACGAG
GGACCGGTCC GCATCTTCGA CGAGGAGTCG GCGGCGATGA AGTACGTCCA GGAGGGCCAC
ATCGAGTCCG GCGACGTGAT CGGCATCCGC AACGAGGGCC CCCGCGGCGG TCCCGGAATG
CGGGAGATGC TGGGGGTCAC GAGCGCCGTC GCCGGCCAGG GTCACGCCGA CGACGTGGCG
CTCTTTACCG ACGGCCGCTT CTCCGGTGCG ACCCGTGGCT TCTCGATCGG CCACGTCGCC
CCCGAGGCCT ACGTCGGTGG TCCCATCGCG GCGCTGGAGG ACGGCGATAC GATCACGATC
GACATCGAGA ATCTCGAACT CTCCGTGGAC CTCTCCGACG AGGAGATCGA ACAGCGCCTC
GAAGAGTACG ACCCCGACCC CAACTACGAC AGCGGCGTGC TGGCGAAGTA CCACCGCGAC
TTCGGCTCCG CGGCCAACGG CGCGGTGACC AACCCCGGCG CGAAGTGGGA CTGA
 
Protein sequence
MSKQERTPDA DEGKDPELRS SEVTEGYEKA PHRAMFRAMG YDDEDLSSPM IGVANPAADI 
TPCNVHLDDV ADAAYEGIDE TEGMPIEFGT ITISDAISMG TEGMKASLIS REVIADSVEL
VSFGERMDGL VTIGGCDKNM PGMMMAAIRT DLPSVFLYGG SIMPGEHDGR EITIQNVFEG
VGAVADGDMS EDELDEMERN ACPGAGSCGG MFTANTMASI SEALGFAPLG SASPPAEHES
RYEEARRAGE LAVEAVQEQR RPSDFLSRES FENAIALQVA VGGSTNAVLH ILALAAEAGV
DLDIEAFNEI SKRTPKIADL QPGGERVMND LHEVGGVPVV LNALYEADLL HGDALTVTGD
TLGDALEAYD PPAIEDVDVD YLYSVDEPKN EQGAIRILTG NLAPDGAVIK ISGEEYLRHE
GPVRIFDEES AAMKYVQEGH IESGDVIGIR NEGPRGGPGM REMLGVTSAV AGQGHADDVA
LFTDGRFSGA TRGFSIGHVA PEAYVGGPIA ALEDGDTITI DIENLELSVD LSDEEIEQRL
EEYDPDPNYD SGVLAKYHRD FGSAANGAVT NPGAKWD