Gene Hmuk_0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0122 
Symbol 
ID8409619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp123147 
End bp124301 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content68% 
IMG OID645018447 
Productgalactonate dehydratase 
Protein accessionYP_003175967 
Protein GI257386194 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.041758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.798356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA TTACCGACTA CGAGCTGTTC GAAGTCCCGC CGCGGTGGCT GTTCCTCCGG 
ATCGAGACCA GCGACGGCCT CGTCGGCTGG GGCGAACCCG TCGTCGAAGG CCGGGCGAGG
ACCGTCCGCA CCGCCGTCGA GGAACTGATG GACAACTACC TGCTGGGCGA GGACCCGAGT
CGGATCGAAG ACCACTGGCA GACGATGTAC CGCGGCGGCT TCTACCGCGG CGGGCCCGTG
CTCATGTCGG CGATCGCCGG GATCGACCAG GCGCTGTGGG ACATCAAGGG CAAGCACTTC
GACGCCCCGG TCCACGAGCT GCTGGGCGGC AAGGCCCGCG ACCGCATTCG CGTCTACCAG
TGGATCGGCG GGGACCGTCC CTCCGACGTG GCCGAGCAGG CCCGCGAGCA GGTCGAAGCG
GGCTTCACGG CGCTGAAGAT GAACGCCACC GAGGAGATCG AGCGCGTCGA CGACCCCGCC
ACCATCCAGG CCGCCGTCGA CCGGCTCCGG CAGGTCAGAG AAGCGGTCGG CGACGAGGTC
GACATCGGCG TCGACTTCCA CGGCCGCGTC ACGAAACCGA TGGCCAAGCG CCTGGTCGAG
GAGCTGGCTC CCTACGAACC GATGTTCGTC GAGGAGCCCG TGTTGCCCGA ACACAACGAC
GCCCTCCCCG AGATCGCCCA GCACACGACG ACGCCCATCG CCACGGGCGA GCGAATGTTC
TCCCGGTGGG ACTACAAGTC GCTGTTCGAG AACGGCACCG TCGACGTGAT CCAGCCGGAC
CTCTCTCACG CGGGCGGTAT CACCGAGGTC AAGAAGATCG CGGCGATGGC CGAGGCCTAC
GACGTGGCGA TGGCCCCCCA CTGCCCGCTG GGACCGATCG CGCTGGCGTC GTGTCTCCAG
GTCGACGCCA CCGCCGCCAA CGCCTTCATC CAGGAGCAGA GCCTCGACAT CCACTACAAC
GAGACCAGCG ACGTGCTCGA CTACCTCGCC GATCCCGCGG TCTTCGACTA CGACGACGGC
TACGTCGAGA TCCCCGACGA CCCCGGTCTG GGCGTCGAGA TCGACGAGGA GTACGTCCGC
GAGCAGGCCG AGCTCGACCA CGACTGGCAC AACCCCGTCT GGCGACACGA CGACGGCAGC
GTCGCCGAGT GGTAG
 
Protein sequence
MSEITDYELF EVPPRWLFLR IETSDGLVGW GEPVVEGRAR TVRTAVEELM DNYLLGEDPS 
RIEDHWQTMY RGGFYRGGPV LMSAIAGIDQ ALWDIKGKHF DAPVHELLGG KARDRIRVYQ
WIGGDRPSDV AEQAREQVEA GFTALKMNAT EEIERVDDPA TIQAAVDRLR QVREAVGDEV
DIGVDFHGRV TKPMAKRLVE ELAPYEPMFV EEPVLPEHND ALPEIAQHTT TPIATGERMF
SRWDYKSLFE NGTVDVIQPD LSHAGGITEV KKIAAMAEAY DVAMAPHCPL GPIALASCLQ
VDATAANAFI QEQSLDIHYN ETSDVLDYLA DPAVFDYDDG YVEIPDDPGL GVEIDEEYVR
EQAELDHDWH NPVWRHDDGS VAEW