Gene Hmuk_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0120 
Symbol 
ID8409617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp120533 
End bp121666 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content71% 
IMG OID645018445 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_003175965 
Protein GI257386192 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.102708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.754762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGA TCGTCGACTA CGAACTGTTC GAGGTGCCGC CGCGCTGGCT GTTCCTGAAG 
CTCGAAACGG CCGACGGAAC CACGGGCTGG GGCGAACCCA TCCTCGAAGG GCGTGCCGCG
ACCGTCCGCA CCGCCGTCGA GGAGCTACTG GAGGGGTACC TGCTGGGCGA GGCGGCGGGC
CGGATCGAGG ATCACTGGCA GACGATGTAT CGCGGCGGCT TCTACCGCGG TGGCCCCGTG
CTCATGTCGG CGATCGCCGG GATCGACCAG GCGCTGTGGG ACATCGAGGG CAAGCGGACG
GATCGGTCGG TCGCCGACCT GCTGGGCGGT TCCACGCGCG AGCACGTCCC GGTCTACAAG
AAGCTCGTCC CGGAGCGCGT CGACAGGATC CCCGAACTGG CGACCGACGC CGTCGAGGCG
GGCTACGAGA CGCTCAAGCT ACTGACGACC TACCAGACCG CCCCGCTGGA GTCGGGGGCC
GACGTCGACG CGATCTGTGA GCACCTGTCG CTCGCTCGCG ACGCGGTCGG TCGGGCGGTC
GACATCGGGG TCGACCTCCA CGGTCACGTC TCGGCGAGCA TGGCCCCGCG GGTGTGCGCC
CGGCTCGCGG CGGACGACCC CGCGTTCGTC GAGGAGCCCG TCCGGCCCGA GCACCTGCGG
ACGCTGGATC GGTCGGCCAC CCACGACGTT CCGGTGGCGT TCGGCGAACG GCTCTACTCG
CGCTGGGAGT TCCGTCCGCA CCTCGAAGCG GGGCGGGTCG ACATCGTCCA GCCCGACGTC
AGTCACGCGG GCGGGATCAC CGAGATCGCG AAGATCGCGT CGATGGCCGA GACCTACGGG
GCGCGGGTGA TGCCGAGTTG CTCGGTCGGG CCGATCGCCC ACGCGGCCAG CACGCAGCTC
AGCCACCACC TCCCGAACGC CGTCACGCAG CCCGACCTGG GCGAGCACTA CGTCGACGCC
TACGTCGACA ACGCCGACGA ACTGCGCAGC GAGAACGGGC GGGTAACCCT TCCTGATCGG
CCCGGCCTCG GCGTCGACGT GAACGAGGCG GGCGTCCGGG ACCACGCCGG CACGGGAAGC
GACTGGCGAC CGCCGGTCCG ACGGTACGAC GACGGGAGTT TCGTCGAGTG GTGA
 
Protein sequence
MTRIVDYELF EVPPRWLFLK LETADGTTGW GEPILEGRAA TVRTAVEELL EGYLLGEAAG 
RIEDHWQTMY RGGFYRGGPV LMSAIAGIDQ ALWDIEGKRT DRSVADLLGG STREHVPVYK
KLVPERVDRI PELATDAVEA GYETLKLLTT YQTAPLESGA DVDAICEHLS LARDAVGRAV
DIGVDLHGHV SASMAPRVCA RLAADDPAFV EEPVRPEHLR TLDRSATHDV PVAFGERLYS
RWEFRPHLEA GRVDIVQPDV SHAGGITEIA KIASMAETYG ARVMPSCSVG PIAHAASTQL
SHHLPNAVTQ PDLGEHYVDA YVDNADELRS ENGRVTLPDR PGLGVDVNEA GVRDHAGTGS
DWRPPVRRYD DGSFVEW