Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1450 |
Symbol | |
ID | 8410971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1373663 |
End bp | 1375111 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 645019782 |
Product | exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
Protein accession | YP_003177278 |
Protein GI | 257387505 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0543863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGTG GCTGGCGGTA TCGGATTGTT TCCGTCGTTG GAGTCTTTGC GTTGATCGTG GCCGCAGTCT CGGTTGCCAA TCATCCACTC GTCCACTCTG GCTTTCGGTT GCTACCCGTC GTCGGACATC TTCCCTTCGA TCCCGCCGAG GGAAGGGAGT TCTTCATCGA AGCTGGGACA ACTGCTGTCG TCGTCTTGCT GGCACTCGCT CCGCTGTACA AACCCCGTCC TCACCGGATC CTCGACATCT CGATGTTCGC CCTGAAACGG ACCCTCGTTG GGCTGTTTGC CCTCGCAACG ATCGGCTACT TCGATTACAC GTACCGCTTA CCGCGAGCGA CACTCCTGAT CGTCGGATCG TTGCTCGTCG TGGCGATCCC GACGTGGTTC GTGACCATCC GGAGCCGACC GGCACGTGGC GAGAGCGGTC GGGCGATCAT CGTCGGCGAC GACCCGAGTG AGATCGATCG TATCTTGGCT GCGATCGACA TCCCGGTCTT CGGCTACGTC TCTCCTCCGT CGCCCTACAT CGTCGACACG GAGGGATCGG GGCAACCACA GCTCGCTGAC GGTGCTGGCA GTACGGCCGA CTCGCTGACG TACACGACGG ATCTGGAGTG TCTGGGTGGA CTCTCACGCC TCGACGACAT TCTCGTCGAA CACGACATCG ATGTCGCCGT GTTCGCCTTC GACGAGACGG ATCGAGAGGA GTTCTTTGGC GTGTTGGCTA CCTGCCACGA GCACGGCGTC GACGCGAAGA TCCACCGGGA CAAGGCCGAC AGCGTCCTCG TCGACGACGA CCAGGTCGAG GAGATCGTGG ACATCGACGT CGAGCCGTGG GACTGGCAGG AGCGGGTAGT GAAGCGAGTT TTCGACGTGG TGTTTGCGGC TGTCGGCCTG TTGTTCGGAC TCCCACTGAT GGCAGCCGCT GCCGTGGCGA TCAAACTCGA GGACGGTGGC ACCATATTTT ACAAACAGGA ACGGACCGCG GAGTTCGGAG ATACCTTTCA GGTGTACAAA TTCCGGAGTA TGATACCGAA CGCAGAAGAG CGAACCGGGG CAAAACTTAG CGAGGAAGAC CGAGGTGGAC GAGACCCACG GGTAACGCGA GTTGGTCGAA CGCTTCGGAA GACACATCTC GACGAGATAC CACAGCTGTG GTCGATCCTC GTCGGCGACA TGAGTGTCGT CGGTCCACGA CCGGAGCGTC CGGAACTCGA TCGCGATATC GAAGAAGGGG TGTCAGACTG GCGTCGCCGA TGGTTCGTTC GTCCGGGTCT TACGGGTCTC GCGCAGGTCA ATGACGTAAC CGGTCACGAA CCAGAACAAA AACTCCGACT TGACGTAGAG TACATTCGGC GCCAGTCGTT TTGGTTCGAC CTGAAAATCG CAATTCGCCA GATCTGGAAG GTCGTTAGTG ATATTGCTGA AACTTTGGCT CATCGTTGA
|
Protein sequence | MASGWRYRIV SVVGVFALIV AAVSVANHPL VHSGFRLLPV VGHLPFDPAE GREFFIEAGT TAVVVLLALA PLYKPRPHRI LDISMFALKR TLVGLFALAT IGYFDYTYRL PRATLLIVGS LLVVAIPTWF VTIRSRPARG ESGRAIIVGD DPSEIDRILA AIDIPVFGYV SPPSPYIVDT EGSGQPQLAD GAGSTADSLT YTTDLECLGG LSRLDDILVE HDIDVAVFAF DETDREEFFG VLATCHEHGV DAKIHRDKAD SVLVDDDQVE EIVDIDVEPW DWQERVVKRV FDVVFAAVGL LFGLPLMAAA AVAIKLEDGG TIFYKQERTA EFGDTFQVYK FRSMIPNAEE RTGAKLSEED RGGRDPRVTR VGRTLRKTHL DEIPQLWSIL VGDMSVVGPR PERPELDRDI EEGVSDWRRR WFVRPGLTGL AQVNDVTGHE PEQKLRLDVE YIRRQSFWFD LKIAIRQIWK VVSDIAETLA HR
|
| |