Gene Hmuk_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2065 
Symbol 
ID8411600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1974935 
End bp1976116 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content70% 
IMG OID645020403 
ProductNucleotidyl transferase 
Protein accessionYP_003177885 
Protein GI257388112 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase
[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGTAG TCATCCTCGC GGCAGGTGAG GGGACGCGAA TGCGACCGCT TACGACGGAC 
ACGCCGAAAC CGATGTTGCC GGTCGCAGAC CGGCCGCTGG TGGCACACAC CGCCGACGCG
GCGGTCGAGG CGGGGGCGAG CGAGCTGATA CTGGTCGTCG GGTACGAAGC CGACGCGGTC
AGGTCGTACT TCGGCGAGGA GTATCGCGGC GTCCCGGTGG AGTTCGCAGT CCAGGCCGAA
CAACGCGGCA CCGCGGACGC CGTCAGGGCC GCCAGCGAGC ACCTCGACGG TCCCTTCGCC
GTCCTCAACG GCGACAATCT CTACGATCGA TCGTCGATCG CCGCGCTGTT CGACGCCGGC
CCGGCCATCG CGGCGTCTCG CGTCGACGAT CCGACGGCCT ACGGCGTCCT CTCGACGGAC
CGCAGCACCG TGACGGGGAT CGTCGAGAAG CCCGACGACC CGCCGACGGA GCTCGCGAAC
GCCGGTGCGT ACGTGTTTCC GGCCGACGCC CGCGAGTGGC TCGACGTCGA GAAAAGCGAG
CGCGGCGAGT ACGAGATCAC CGACGTGGTC GCGCGAGCGA TCGAGACCGG TACCGTCTCG
GCGGTCGAGG TCGACCGCTG GCTCGACGTG GGGCGACCCT GGGAGTTGCT GGCGGCAAAC
GAGTGGAAGC TCGGGGAGCT CGATCGGCGG ATCGACGGCG AGGTCCGCGG TGACGCGACG
CTCCGCGGAA ACGTCGTCGT CGAGGCCGGC GCGACCGTCG AGCCCGGCGT CGTCGTCGAG
GGGCCGGCAC TGATCCGTGC CGGCGCGGAG GTCGGACCCA ACGCCTACGT CCGCGGAGCG
ACCCTGCTCG CCGAGGACAC CCACGTCGGC CACGGCGTCG AGATCAAAAA CAGCGTGATC
GGTGCCGGCT CGGCCGTCCC CCACGTCACG TACGTCGGCG ACAGCGTCCT CGGCGAGGGC
GTGAACTTCG GAGCCGGCAC GCAGGTGGCG AACCTCCGTC ACGACGGCGA GCCGGTCAGA
CAGACAGTCA AGGGCGACCG CGTCTCGACC GGACGGCGAA AGTACGGCGT CGTCGCCGGA
GACGGCGTCA AGACCGCGGT CAACACGAGC ATCAACCCCG GTGTCACCCT GTCGAAGGGA
GCCACCACGA CGCCGGGCGA GTCGGTCACG CGGGATCGGT GA
 
Protein sequence
MQVVILAAGE GTRMRPLTTD TPKPMLPVAD RPLVAHTADA AVEAGASELI LVVGYEADAV 
RSYFGEEYRG VPVEFAVQAE QRGTADAVRA ASEHLDGPFA VLNGDNLYDR SSIAALFDAG
PAIAASRVDD PTAYGVLSTD RSTVTGIVEK PDDPPTELAN AGAYVFPADA REWLDVEKSE
RGEYEITDVV ARAIETGTVS AVEVDRWLDV GRPWELLAAN EWKLGELDRR IDGEVRGDAT
LRGNVVVEAG ATVEPGVVVE GPALIRAGAE VGPNAYVRGA TLLAEDTHVG HGVEIKNSVI
GAGSAVPHVT YVGDSVLGEG VNFGAGTQVA NLRHDGEPVR QTVKGDRVST GRRKYGVVAG
DGVKTAVNTS INPGVTLSKG ATTTPGESVT RDR