Gene Hmuk_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1920 
Symbol 
ID8411447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1829517 
End bp1830572 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content66% 
IMG OID645020250 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_003177740 
Protein GI257387967 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.179258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCCG AGACGCCGCT TCCCGACACG CTCGCGAACC TGCTCGGACT GGACCCGTCG 
AACCCGCTGG TCCTGTTCGT GATGGCGGTC GTCGCGTCCG GTGTGATCGC GTCGGGACTG
CTCGCCCTGG TCGCCGTCTC GGGGATCTGG GGCAAGCGGA AGATCACGGC AGCGTTCACC
GACCGGATCG CTGTCAACCG ACACGGGCCG GCGGGCATCC TGATCATCCC GGCAGACGCG
CTCCGGCTCC TCTCGAAGGA ACTGATCGTT CCGGAGGGCG TCGACCGTCC GGCCTGGGAC
CTGGGGCCGC TCATCATGGT CTTCTCGGCG CTGGCCGGCT TCGCCGTCAT CCCCATGGGG
AACGGCATCC AGATCGCCGA CCCGGAGACG GGACTGGCCT ACGTGTTCGC GATGGCGTCT
GTCGCCTCGC TCGGCCTCGT GATGGCCGGC TACTCGTCGA ACAACAAGTA CTCGTTCCTC
GGGGGACTGC GCGCGGTCGC ACAGAACCTC GCCTACGAGA TTCCGCTGAT CCTGACGGGG
ATGTCCGTGG CGCTTTTCGC CGGGACGCTC CGCCTGAGCG AGATCGTCGC GGCCCAGAGC
ACGACGCTGT TCAGCCTCGG CGGCCTCGCG ATCCCGTCGT GGTACGCGTT CGTCAACCCC
TTCGCGTTCG TGCTGTTCAT GGTCGCGAAC CTCGCGGAGG TCGGACGGAA CCCGTTCGAC
ATTCCCGAAG CGCCGACCGA GATCGTCGCC GGGTGGCAGA CCGAGTACTC CTCGGTGTAC
TTCGTGCTCG CGTACCTCTC GGAGTTCATC CACATCTTCC TCGGCGGTGC GATCATCGCG
ACGATCTTCC TGGGCGGTCC GGCCGGCCCG GTGTTGCCGG GCATCGTCTG GTTCCTGATC
AAGATCGTCG GGATCTACCT GTTCACTCAG TGGGCGCGTT CCGCGATCCC ACGGGTCCGG
ATCGACCAGC TCATCGAGAT CGGCTGGAAG GGCCTGCTCG TGCTGGCCTT CGCGAACCTG
CTCCTGACGG CCGGGATCGT CGGGGTGATC GCCTGA
 
Protein sequence
MQSETPLPDT LANLLGLDPS NPLVLFVMAV VASGVIASGL LALVAVSGIW GKRKITAAFT 
DRIAVNRHGP AGILIIPADA LRLLSKELIV PEGVDRPAWD LGPLIMVFSA LAGFAVIPMG
NGIQIADPET GLAYVFAMAS VASLGLVMAG YSSNNKYSFL GGLRAVAQNL AYEIPLILTG
MSVALFAGTL RLSEIVAAQS TTLFSLGGLA IPSWYAFVNP FAFVLFMVAN LAEVGRNPFD
IPEAPTEIVA GWQTEYSSVY FVLAYLSEFI HIFLGGAIIA TIFLGGPAGP VLPGIVWFLI
KIVGIYLFTQ WARSAIPRVR IDQLIEIGWK GLLVLAFANL LLTAGIVGVI A