Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2547 |
Symbol | |
ID | 8412091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2450933 |
End bp | 2454004 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645020888 |
Product | D-lactate dehydrogenase (cytochrome) |
Protein accession | YP_003178362 |
Protein GI | 257388589 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.846231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.179542 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAACAC GACACGATCG GGGTGGGCCG GACCCGGCTA CCGACGATCG GGCGGACTAC GACTACGAGG GGGGTTCCGT CGAGCGCCCC GCGTTCGTCG CCGCCCTCGA AGACCGCATC GACGGGACGG TCCGGTTCGA CGAGTACACG CGACAGTTGT ACGCGACCGA CGCGAGCGCC TACGAGGTGA CGCCGATCGG CGTCGTCTAT CCGACCTCGA CGGCCGACGT GGCAGCGGTC GTCGACTACT GCGCCGAGCG GGGAACGCCG GTCCTGCCGC GAGGCGGGGG GACGAGCCTC GCGGGACAGG CGGTCAACGA GGCGGTCGTC GTCGACTGCT CGCGCCACAT GGACGCGATC GAGTCGGTCG ATCCGGCGGG ACACACGGCA CGCGCACAGG TCGGGGTCAC CCTCGGGGCG TTGAACGACA GGCTCGCCGA CCACGGCCTG AAGTTCGCCC CCGACCCGGC GTGGGGCGAC AAGAGCGTCC TCGGTGGCGC GATCGGCAAC AACTCCACGG GCGCACACTC CCTGAAGTAC GGCAAGACCG ACGCGTACAT CGAGGACTGC GAGGTCGTCC TCGCGGACGG GACCGTGACG ACGTTCGGCG AGGTCACGCT CGACGAACTG CGATCGCGGG CCGACCGGGA CGGGCTCGAA GGCCAGATAT ACGCGGCGAT CGCCCGGCTC GTCGACGACG AGCGCGAGGC GGTCACCGCG GCGTTTCCCG ACCTCAAGCG AAACGTCTCC GGCTACAATC TCGATCGACT CCTCTCGGAG GCCGAAGACG GGTCGGTCAA CGTCGCGCGG CTGCTCGCGG GCAGCGAGGG GACCCTGGCC ATCGTGACCG AGGCGACGGT CTCGCTCGAA CCCCTCCCGG AAACGAAGTC GCTGGCGCTG CTCTCCTATC ACGACCTCAT CGACGCGATG GCCGACGTGC CGGCGATCCT CGAACACGAT CCCGCGGCGG TCGAAGTCCT CGACGACGTG TTGCTGGAGC TGGCCGCCGA CACCGAGGAG TTCGGCGACC TGGTCGACCA GCTCCTCCCG GCGGACACCG GCGCGGTGCT GCTCGTCGAG TTCTACGCCG AGAACGACCC GCAGGGCAAA CAGCGGGTCG CGGACCTCCT CGCGGATCGG GTCGGGAACG TCGGTACCGA CGCCGTCGCA CAGAGCGGGG CCGACTCTCT GACCGACGCG CCGCGGGAGG CGTTTCACGG CCAGGAGGCC CACGAGGAGA GCGAGCGAAA GCGATTCTGG AAGCTCCGCA AGAGCGGGCT CCCGATCCTC CTCGGGCGGA CCTCGGACGC GAAACACATC AGCTTCATCG AGGACACCGC CGTCCCGCCC GAGAACCTCG CGGACTACGT CGCCGAGTTC CAGGAGTTGC TGGCCGACAA CGACACGTTC GCGAGCTTCT ACGCTCACGC TGGACCGGGC TGTCTCCACA TCCGACCCCT GGTGAACACG AAGACCGTCG AGGGGGTCGA GCAGATGGCA GCTATCGCGG ACGGCGCGAC CGACCTCGTC ACGACCTACG GCGGCAGCGT CTCGGGCGAG CACGGCGACG GGCGTGCGCG CACCCAGTGG AACCGAAAGC TGTACGGACA GGACGTGTGG GAGGTGTTCC GAGAGCTGAA AGCGGCCTTC GATCCGGACT GGCTGTTGAA CCCCGGACAG GTGTGTGGCT ACGCCGCCGA CGAGGCGATT CCAGAGGGCG TCCCCGCGCG GGCCCGCGCC GTCGACATGA CCGACGACCT GCGGTTCGAT CCCGACTACG AGTTCGAGAT GGCGTTCGAG CCGGCCATGG AGTGGGACAA CGAGAACGGG TTCCAGGGGA TGGTCGAGCT CTGTCACGGC TGTGGGGGCT GTCGCGGCCC ACAGGAGACG ACCGGCGGCG TCATGTGTCC GACCTACCGG GCCGCGGGCG AGGAGTCGAC CGCGACGAGG GGGCGAGCCA ACGCCCTCCG CCAGGCGATG AGCGGCGACC TGCCGGCAGA TCCGACGGAC GAGGCGTTCG TCGACGAGAT CATGGACCTC TGTATCGGCT GCAAGGGATG TGCGAAAGAC TGCCCGAGCG AGGTCGACAT GGCCAAGCTC AAGACCGAAG TCGAACACGC ACATCATCAG GAACACGGCG CGAGCCTCCG GTCGAAGCTG TTCGCACACG TCGAGACCCT CAGCGCCTGG GGGAGTCGTC TCGCACCGCT GTCGAACTGG CTGGCCGGGG CACCGGGCAG TGACAGACTG GCCGAGCGCC TGGTCGGGAT CGCCAGCGAG CGATCGCTCC CGACGTTCAA ACGCGAGTCC TTCGAGGACT GGTTCGCCCA GCGCGGCCCC GCCGTCGACC CGGAGGACGC ACAGCGCCGG GCGCTGCTGG TCCCCGACAC GTACAACAAC TACAGCAACC CCGACGTGCT CCGTGCCGCC GTCCGCGTCC TCGAAGCCGC CGACGTACAC GTCGCCGTCC CGGACGACGC GACCAGCAGC GGCCGCGCCG CCCACTCGAA GGGCTTCGTC GACGTGGCTC GCGAACGCGC ACGGACGAAC GTCGACGCGC TGGACGGCCG CGTGGCCGAC GGGTGGGACG TGGTCCTCGT CGAGCCGTCC GACGCCGTGA TGTTCCAGTC GGACTACCGT GACCTCCTGG GCTCGGACGC CGCGCCCGTC GCCGACAACG CGTACGGCCT CTGTGAGTAC CTCGATCGGT TCCGCCTCGA CGAGCGCGTC GACTGGACGG GCGGCGAGGA GACGCTGACC TACCACGGCC ACTGCCACCA GAAGGCCGTC TCGCGGGACC ACCACGCCGT CGGCGTCCTC AGGCGGGCGG GCTACGCCGT CGACCCGCTG GATTCGGGCT GCTGTGGGAT GGCCGGCAGC TTCGGCTACG AGGCCGAACA CTACTCGATG AGCCAGGCGA TCGGACGGAT TCTCTTCGAC CAGATCGCGG ACAGCGACGG CGACGCCGTC GTCGCACCGG GGGCGTCCTG TCGCACGCAG CTCGGTGATC GACGGGGCCA CGAATCTCCG TCACATCCCG TCGAACGGCT CGCAGACGCG CTCGCTGACT GA
|
Protein sequence | MGTRHDRGGP DPATDDRADY DYEGGSVERP AFVAALEDRI DGTVRFDEYT RQLYATDASA YEVTPIGVVY PTSTADVAAV VDYCAERGTP VLPRGGGTSL AGQAVNEAVV VDCSRHMDAI ESVDPAGHTA RAQVGVTLGA LNDRLADHGL KFAPDPAWGD KSVLGGAIGN NSTGAHSLKY GKTDAYIEDC EVVLADGTVT TFGEVTLDEL RSRADRDGLE GQIYAAIARL VDDEREAVTA AFPDLKRNVS GYNLDRLLSE AEDGSVNVAR LLAGSEGTLA IVTEATVSLE PLPETKSLAL LSYHDLIDAM ADVPAILEHD PAAVEVLDDV LLELAADTEE FGDLVDQLLP ADTGAVLLVE FYAENDPQGK QRVADLLADR VGNVGTDAVA QSGADSLTDA PREAFHGQEA HEESERKRFW KLRKSGLPIL LGRTSDAKHI SFIEDTAVPP ENLADYVAEF QELLADNDTF ASFYAHAGPG CLHIRPLVNT KTVEGVEQMA AIADGATDLV TTYGGSVSGE HGDGRARTQW NRKLYGQDVW EVFRELKAAF DPDWLLNPGQ VCGYAADEAI PEGVPARARA VDMTDDLRFD PDYEFEMAFE PAMEWDNENG FQGMVELCHG CGGCRGPQET TGGVMCPTYR AAGEESTATR GRANALRQAM SGDLPADPTD EAFVDEIMDL CIGCKGCAKD CPSEVDMAKL KTEVEHAHHQ EHGASLRSKL FAHVETLSAW GSRLAPLSNW LAGAPGSDRL AERLVGIASE RSLPTFKRES FEDWFAQRGP AVDPEDAQRR ALLVPDTYNN YSNPDVLRAA VRVLEAADVH VAVPDDATSS GRAAHSKGFV DVARERARTN VDALDGRVAD GWDVVLVEPS DAVMFQSDYR DLLGSDAAPV ADNAYGLCEY LDRFRLDERV DWTGGEETLT YHGHCHQKAV SRDHHAVGVL RRAGYAVDPL DSGCCGMAGS FGYEAEHYSM SQAIGRILFD QIADSDGDAV VAPGASCRTQ LGDRRGHESP SHPVERLADA LAD
|
| |