Gene Hmuk_1450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1450 
Symbol 
ID8410971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1373663 
End bp1375111 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content59% 
IMG OID645019782 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_003177278 
Protein GI257387505 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0543863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGTG GCTGGCGGTA TCGGATTGTT TCCGTCGTTG GAGTCTTTGC GTTGATCGTG 
GCCGCAGTCT CGGTTGCCAA TCATCCACTC GTCCACTCTG GCTTTCGGTT GCTACCCGTC
GTCGGACATC TTCCCTTCGA TCCCGCCGAG GGAAGGGAGT TCTTCATCGA AGCTGGGACA
ACTGCTGTCG TCGTCTTGCT GGCACTCGCT CCGCTGTACA AACCCCGTCC TCACCGGATC
CTCGACATCT CGATGTTCGC CCTGAAACGG ACCCTCGTTG GGCTGTTTGC CCTCGCAACG
ATCGGCTACT TCGATTACAC GTACCGCTTA CCGCGAGCGA CACTCCTGAT CGTCGGATCG
TTGCTCGTCG TGGCGATCCC GACGTGGTTC GTGACCATCC GGAGCCGACC GGCACGTGGC
GAGAGCGGTC GGGCGATCAT CGTCGGCGAC GACCCGAGTG AGATCGATCG TATCTTGGCT
GCGATCGACA TCCCGGTCTT CGGCTACGTC TCTCCTCCGT CGCCCTACAT CGTCGACACG
GAGGGATCGG GGCAACCACA GCTCGCTGAC GGTGCTGGCA GTACGGCCGA CTCGCTGACG
TACACGACGG ATCTGGAGTG TCTGGGTGGA CTCTCACGCC TCGACGACAT TCTCGTCGAA
CACGACATCG ATGTCGCCGT GTTCGCCTTC GACGAGACGG ATCGAGAGGA GTTCTTTGGC
GTGTTGGCTA CCTGCCACGA GCACGGCGTC GACGCGAAGA TCCACCGGGA CAAGGCCGAC
AGCGTCCTCG TCGACGACGA CCAGGTCGAG GAGATCGTGG ACATCGACGT CGAGCCGTGG
GACTGGCAGG AGCGGGTAGT GAAGCGAGTT TTCGACGTGG TGTTTGCGGC TGTCGGCCTG
TTGTTCGGAC TCCCACTGAT GGCAGCCGCT GCCGTGGCGA TCAAACTCGA GGACGGTGGC
ACCATATTTT ACAAACAGGA ACGGACCGCG GAGTTCGGAG ATACCTTTCA GGTGTACAAA
TTCCGGAGTA TGATACCGAA CGCAGAAGAG CGAACCGGGG CAAAACTTAG CGAGGAAGAC
CGAGGTGGAC GAGACCCACG GGTAACGCGA GTTGGTCGAA CGCTTCGGAA GACACATCTC
GACGAGATAC CACAGCTGTG GTCGATCCTC GTCGGCGACA TGAGTGTCGT CGGTCCACGA
CCGGAGCGTC CGGAACTCGA TCGCGATATC GAAGAAGGGG TGTCAGACTG GCGTCGCCGA
TGGTTCGTTC GTCCGGGTCT TACGGGTCTC GCGCAGGTCA ATGACGTAAC CGGTCACGAA
CCAGAACAAA AACTCCGACT TGACGTAGAG TACATTCGGC GCCAGTCGTT TTGGTTCGAC
CTGAAAATCG CAATTCGCCA GATCTGGAAG GTCGTTAGTG ATATTGCTGA AACTTTGGCT
CATCGTTGA
 
Protein sequence
MASGWRYRIV SVVGVFALIV AAVSVANHPL VHSGFRLLPV VGHLPFDPAE GREFFIEAGT 
TAVVVLLALA PLYKPRPHRI LDISMFALKR TLVGLFALAT IGYFDYTYRL PRATLLIVGS
LLVVAIPTWF VTIRSRPARG ESGRAIIVGD DPSEIDRILA AIDIPVFGYV SPPSPYIVDT
EGSGQPQLAD GAGSTADSLT YTTDLECLGG LSRLDDILVE HDIDVAVFAF DETDREEFFG
VLATCHEHGV DAKIHRDKAD SVLVDDDQVE EIVDIDVEPW DWQERVVKRV FDVVFAAVGL
LFGLPLMAAA AVAIKLEDGG TIFYKQERTA EFGDTFQVYK FRSMIPNAEE RTGAKLSEED
RGGRDPRVTR VGRTLRKTHL DEIPQLWSIL VGDMSVVGPR PERPELDRDI EEGVSDWRRR
WFVRPGLTGL AQVNDVTGHE PEQKLRLDVE YIRRQSFWFD LKIAIRQIWK VVSDIAETLA
HR