Gene Hmuk_2547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2547 
Symbol 
ID8412091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2450933 
End bp2454004 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content69% 
IMG OID645020888 
ProductD-lactate dehydrogenase (cytochrome) 
Protein accessionYP_003178362 
Protein GI257388589 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.846231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.179542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACAC GACACGATCG GGGTGGGCCG GACCCGGCTA CCGACGATCG GGCGGACTAC 
GACTACGAGG GGGGTTCCGT CGAGCGCCCC GCGTTCGTCG CCGCCCTCGA AGACCGCATC
GACGGGACGG TCCGGTTCGA CGAGTACACG CGACAGTTGT ACGCGACCGA CGCGAGCGCC
TACGAGGTGA CGCCGATCGG CGTCGTCTAT CCGACCTCGA CGGCCGACGT GGCAGCGGTC
GTCGACTACT GCGCCGAGCG GGGAACGCCG GTCCTGCCGC GAGGCGGGGG GACGAGCCTC
GCGGGACAGG CGGTCAACGA GGCGGTCGTC GTCGACTGCT CGCGCCACAT GGACGCGATC
GAGTCGGTCG ATCCGGCGGG ACACACGGCA CGCGCACAGG TCGGGGTCAC CCTCGGGGCG
TTGAACGACA GGCTCGCCGA CCACGGCCTG AAGTTCGCCC CCGACCCGGC GTGGGGCGAC
AAGAGCGTCC TCGGTGGCGC GATCGGCAAC AACTCCACGG GCGCACACTC CCTGAAGTAC
GGCAAGACCG ACGCGTACAT CGAGGACTGC GAGGTCGTCC TCGCGGACGG GACCGTGACG
ACGTTCGGCG AGGTCACGCT CGACGAACTG CGATCGCGGG CCGACCGGGA CGGGCTCGAA
GGCCAGATAT ACGCGGCGAT CGCCCGGCTC GTCGACGACG AGCGCGAGGC GGTCACCGCG
GCGTTTCCCG ACCTCAAGCG AAACGTCTCC GGCTACAATC TCGATCGACT CCTCTCGGAG
GCCGAAGACG GGTCGGTCAA CGTCGCGCGG CTGCTCGCGG GCAGCGAGGG GACCCTGGCC
ATCGTGACCG AGGCGACGGT CTCGCTCGAA CCCCTCCCGG AAACGAAGTC GCTGGCGCTG
CTCTCCTATC ACGACCTCAT CGACGCGATG GCCGACGTGC CGGCGATCCT CGAACACGAT
CCCGCGGCGG TCGAAGTCCT CGACGACGTG TTGCTGGAGC TGGCCGCCGA CACCGAGGAG
TTCGGCGACC TGGTCGACCA GCTCCTCCCG GCGGACACCG GCGCGGTGCT GCTCGTCGAG
TTCTACGCCG AGAACGACCC GCAGGGCAAA CAGCGGGTCG CGGACCTCCT CGCGGATCGG
GTCGGGAACG TCGGTACCGA CGCCGTCGCA CAGAGCGGGG CCGACTCTCT GACCGACGCG
CCGCGGGAGG CGTTTCACGG CCAGGAGGCC CACGAGGAGA GCGAGCGAAA GCGATTCTGG
AAGCTCCGCA AGAGCGGGCT CCCGATCCTC CTCGGGCGGA CCTCGGACGC GAAACACATC
AGCTTCATCG AGGACACCGC CGTCCCGCCC GAGAACCTCG CGGACTACGT CGCCGAGTTC
CAGGAGTTGC TGGCCGACAA CGACACGTTC GCGAGCTTCT ACGCTCACGC TGGACCGGGC
TGTCTCCACA TCCGACCCCT GGTGAACACG AAGACCGTCG AGGGGGTCGA GCAGATGGCA
GCTATCGCGG ACGGCGCGAC CGACCTCGTC ACGACCTACG GCGGCAGCGT CTCGGGCGAG
CACGGCGACG GGCGTGCGCG CACCCAGTGG AACCGAAAGC TGTACGGACA GGACGTGTGG
GAGGTGTTCC GAGAGCTGAA AGCGGCCTTC GATCCGGACT GGCTGTTGAA CCCCGGACAG
GTGTGTGGCT ACGCCGCCGA CGAGGCGATT CCAGAGGGCG TCCCCGCGCG GGCCCGCGCC
GTCGACATGA CCGACGACCT GCGGTTCGAT CCCGACTACG AGTTCGAGAT GGCGTTCGAG
CCGGCCATGG AGTGGGACAA CGAGAACGGG TTCCAGGGGA TGGTCGAGCT CTGTCACGGC
TGTGGGGGCT GTCGCGGCCC ACAGGAGACG ACCGGCGGCG TCATGTGTCC GACCTACCGG
GCCGCGGGCG AGGAGTCGAC CGCGACGAGG GGGCGAGCCA ACGCCCTCCG CCAGGCGATG
AGCGGCGACC TGCCGGCAGA TCCGACGGAC GAGGCGTTCG TCGACGAGAT CATGGACCTC
TGTATCGGCT GCAAGGGATG TGCGAAAGAC TGCCCGAGCG AGGTCGACAT GGCCAAGCTC
AAGACCGAAG TCGAACACGC ACATCATCAG GAACACGGCG CGAGCCTCCG GTCGAAGCTG
TTCGCACACG TCGAGACCCT CAGCGCCTGG GGGAGTCGTC TCGCACCGCT GTCGAACTGG
CTGGCCGGGG CACCGGGCAG TGACAGACTG GCCGAGCGCC TGGTCGGGAT CGCCAGCGAG
CGATCGCTCC CGACGTTCAA ACGCGAGTCC TTCGAGGACT GGTTCGCCCA GCGCGGCCCC
GCCGTCGACC CGGAGGACGC ACAGCGCCGG GCGCTGCTGG TCCCCGACAC GTACAACAAC
TACAGCAACC CCGACGTGCT CCGTGCCGCC GTCCGCGTCC TCGAAGCCGC CGACGTACAC
GTCGCCGTCC CGGACGACGC GACCAGCAGC GGCCGCGCCG CCCACTCGAA GGGCTTCGTC
GACGTGGCTC GCGAACGCGC ACGGACGAAC GTCGACGCGC TGGACGGCCG CGTGGCCGAC
GGGTGGGACG TGGTCCTCGT CGAGCCGTCC GACGCCGTGA TGTTCCAGTC GGACTACCGT
GACCTCCTGG GCTCGGACGC CGCGCCCGTC GCCGACAACG CGTACGGCCT CTGTGAGTAC
CTCGATCGGT TCCGCCTCGA CGAGCGCGTC GACTGGACGG GCGGCGAGGA GACGCTGACC
TACCACGGCC ACTGCCACCA GAAGGCCGTC TCGCGGGACC ACCACGCCGT CGGCGTCCTC
AGGCGGGCGG GCTACGCCGT CGACCCGCTG GATTCGGGCT GCTGTGGGAT GGCCGGCAGC
TTCGGCTACG AGGCCGAACA CTACTCGATG AGCCAGGCGA TCGGACGGAT TCTCTTCGAC
CAGATCGCGG ACAGCGACGG CGACGCCGTC GTCGCACCGG GGGCGTCCTG TCGCACGCAG
CTCGGTGATC GACGGGGCCA CGAATCTCCG TCACATCCCG TCGAACGGCT CGCAGACGCG
CTCGCTGACT GA
 
Protein sequence
MGTRHDRGGP DPATDDRADY DYEGGSVERP AFVAALEDRI DGTVRFDEYT RQLYATDASA 
YEVTPIGVVY PTSTADVAAV VDYCAERGTP VLPRGGGTSL AGQAVNEAVV VDCSRHMDAI
ESVDPAGHTA RAQVGVTLGA LNDRLADHGL KFAPDPAWGD KSVLGGAIGN NSTGAHSLKY
GKTDAYIEDC EVVLADGTVT TFGEVTLDEL RSRADRDGLE GQIYAAIARL VDDEREAVTA
AFPDLKRNVS GYNLDRLLSE AEDGSVNVAR LLAGSEGTLA IVTEATVSLE PLPETKSLAL
LSYHDLIDAM ADVPAILEHD PAAVEVLDDV LLELAADTEE FGDLVDQLLP ADTGAVLLVE
FYAENDPQGK QRVADLLADR VGNVGTDAVA QSGADSLTDA PREAFHGQEA HEESERKRFW
KLRKSGLPIL LGRTSDAKHI SFIEDTAVPP ENLADYVAEF QELLADNDTF ASFYAHAGPG
CLHIRPLVNT KTVEGVEQMA AIADGATDLV TTYGGSVSGE HGDGRARTQW NRKLYGQDVW
EVFRELKAAF DPDWLLNPGQ VCGYAADEAI PEGVPARARA VDMTDDLRFD PDYEFEMAFE
PAMEWDNENG FQGMVELCHG CGGCRGPQET TGGVMCPTYR AAGEESTATR GRANALRQAM
SGDLPADPTD EAFVDEIMDL CIGCKGCAKD CPSEVDMAKL KTEVEHAHHQ EHGASLRSKL
FAHVETLSAW GSRLAPLSNW LAGAPGSDRL AERLVGIASE RSLPTFKRES FEDWFAQRGP
AVDPEDAQRR ALLVPDTYNN YSNPDVLRAA VRVLEAADVH VAVPDDATSS GRAAHSKGFV
DVARERARTN VDALDGRVAD GWDVVLVEPS DAVMFQSDYR DLLGSDAAPV ADNAYGLCEY
LDRFRLDERV DWTGGEETLT YHGHCHQKAV SRDHHAVGVL RRAGYAVDPL DSGCCGMAGS
FGYEAEHYSM SQAIGRILFD QIADSDGDAV VAPGASCRTQ LGDRRGHESP SHPVERLADA
LAD