Gene Hmuk_0941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0941 
Symbol 
ID8410457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp903025 
End bp904194 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content67% 
IMG OID645019276 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_003176777 
Protein GI257387004 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.748604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.464463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATCA CCGACGTGAC AGCGACGACT GTCGACGTAC CGCTGGTGGA CCTCGACGAA 
CGTCTGGGCA TCGGCCCGTA CGTGACGAAC CACGGACGCG TCGAATCGAT GGAACGTGTC
CTCGTCCGCG TGGACACCGA CGAGGGCATC GCCGGCTGGG GCGAGATGCG GACCTTCCTC
TCACCGGCGG CCACGGAGTC GATCATCGAA GACGGAATCG GCCCGCTGAT CGAAGGCCAG
TCGCCCTTCG AGGTCGAACG GCTCCGCCGA CAGGTGTTCA TCGAGTACAC CAACATCGAA
CTGTTCTTCG CCGCCGTCGA GACTGCCTGC TGGGACATCG CCGGCAAGGC GCTCGACAAG
CCGGTCTTCG AACTGCTCGG CGGGGCGACC GCACCGTACC AGACGACCGC GATGAACGCC
GCCGCTGGCG GTGCCGACGC CGCCGACGAC CGCGCGGTCG AGTTCGCCTT CTGTCTGGGT
ATCCTCTCTC CGGAAGAGTC CCGCGTGAAG GCCCAGGAAG CCCTCGACGC CGGCTTCTCC
GTCCTCAAGA CCAAGGCCGG CCGTGACTGG CAACAGGACG TGGCCCGCAT CCAGGCGATG
CACGACGAGG TCGACGGGCA ACTGGAGTTC CGGCTGGACC CCAATCAGGG GTGGTCGCTC
GATCAGGCGG TTCGCGTCGG GGCCGCCCTC GAAGAGTCCG GCATCTTCCT GCAGTACATG
GAACAGCCCA TCCGCGTCAA CGCCCACGAC TCGCTGGCGA CACTGCGCCA GCGTCTCCGC
CAGCCGATCG CTCCCAACGA GGACACGTAC ATCCCCAACA ACCTCCGCTC GCTCGTCGAG
GCCGGTGCGA TGGACGTGGC GGTCCTCGAT CTGACGCCAG CGGGCGGCAT CGCGGGACTG
CGACAGCAGG CGGCCATCGT CGAGGACGCC GGCGTTCCCT TCACCCACCA CTGCGCGTTC
GATCTCGGGA TCCGGACGGC CGCAATCTTG CACGCGGTCC ACGGGATCCC CGGATTTTCC
CTCCCGCCGG ACACGACCTA CTACGGCTGG GAAGACGACG TCATCGCGGA CCCCTTCACG
GTCGAGGAGG GCCGGATGAC CGTGCCGGAC GAACCCGGCC TCGGAATCGA CGTGGACCTC
GACACCGTCG AGGAGTACCG CGTCCGATAG
 
Protein sequence
MEITDVTATT VDVPLVDLDE RLGIGPYVTN HGRVESMERV LVRVDTDEGI AGWGEMRTFL 
SPAATESIIE DGIGPLIEGQ SPFEVERLRR QVFIEYTNIE LFFAAVETAC WDIAGKALDK
PVFELLGGAT APYQTTAMNA AAGGADAADD RAVEFAFCLG ILSPEESRVK AQEALDAGFS
VLKTKAGRDW QQDVARIQAM HDEVDGQLEF RLDPNQGWSL DQAVRVGAAL EESGIFLQYM
EQPIRVNAHD SLATLRQRLR QPIAPNEDTY IPNNLRSLVE AGAMDVAVLD LTPAGGIAGL
RQQAAIVEDA GVPFTHHCAF DLGIRTAAIL HAVHGIPGFS LPPDTTYYGW EDDVIADPFT
VEEGRMTVPD EPGLGIDVDL DTVEEYRVR