Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2219 |
Symbol | |
ID | 8411758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2132581 |
End bp | 2134314 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645020561 |
Product | protein of unknown function DUF181 |
Protein accession | YP_003178039 |
Protein GI | 257388266 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.540724 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTACGG TAGTGGACGT TGTGGGGAGT GGACCGGCGA CGGAGAGCCT TCTCGCGACA CTCGACGACC TCGACGTGAG TATCGACCAC CGGGAAGCGC CCGCCGTCGA CAGCGCCGAC TTCGCGGTCG TCGTCGACGA GACCGGGGCC GCGACGTTCG AGACCGCCAG CGAGCAGGCC CGCGACGGTG GGACGCCGTG GCTCGCGGTC GAACTCGGCG GCGTCGGCGG CCGCCCGGTC GTCCCGGCAG CGGTCACTGG ATTCGACCCC GACCGCGAGT GCTACGACTG TCTGTGCGGC CGTATCGAAG CCAACACGAC GGCGGCCGCC GAGGGTGGGG AGGACCCCGG ACCAGCGACG CTGCGCGTCG CTGGGGCCGT CGCCGGGGAG GCCGTCGCCG ACTATCTCGC TGCCGGCTCC GCACGGACCG ACGCTGACTC GTCGGTGCTG GGCCACGTGG TCGAGTTGCC CCACCGGAAA CGCCGTTTCT TCCCGCTGCC GGGCTGTTCG TGTGGCGGCG AGCGCGAGAC GACGATCGAT CGCTCTCACG CTACGGTCGA CACCGAGCAG GCGCTCGAAC GCGCCGAGCG GGCGCTGGAC GACCGGGTTG GGATCGTCCA GCAGGTCGGC GAGGCGGAGT CGTTCCCGGC ACCCTACTAC CTCGCACAGC TCACCGACAC GTCCGGATTC AGCGACGTCA CGGCCCCGCG ACAGGCGGCC GGCGTCGCGG CCGACTGGGA CGGGGCGTTC ATGAAAGCGC TCGGCGAGTC CTACGAGCGC TACGCTGCCG GCGTGTACAC GGCCGCGGAG ACGACGACCG CCACGGCGGC GTCGCTGGAC GACGCGGTCG CGCCCGAGGC CTTCGTCGCG CCCGACGACG CCGGCGCGGA CGCGACGACC GAACTCGACT GGATCGGAGC CCGGAACCTC GCGACCGACG AGTCGGCGCT GGTGCCCGCC GAACTCGTCT TTCACCCGCC CGTCGGGTCC CACGTTCGCC CGCCCCTGAC GACCGGGCTG GGACTGGGAT CGTCGGGCTG TGAGGCGCTG TTGGCCGGAC TTTACGAGGT GATCGAACGG GACGCGGCGA TGCTGTCGTG GTACTCGACG TTCGAGCCGC TGGGGCTGAC CGTCGACGAC GACGTGTTCG GGACGCTGTA CGAGCGGGCC GCGTCGGAGG GACTGACGGT GACGCCGCTG CTGCTGACAC AGGACGTGGA CGTGCCCGTC GTCGCCGTCG CCGTTCACCG GGACGAGTGG CCCAGCTTCG CCATCGGGTC GGCGGCCGAT CTCGACCCCG AGCAGGCCGC TCTCGGCGCG CTCGAAGAGG CGCTCCAGAA CTGGATGGAG CTTCGGAGCA TGGGCCCCGA ACAGGCCGCG GAGGCCAGCG GCGCGATCGG GGAGTACGCC GACAAGCCAG ACCGGGCTGT CGACCTGCTG GCGTACGATC AGACCATCCC GGCCGAGGCG GTCGGCCCAG ACGCCGTCGA CGACGGCGAG GCCGAACTGG ACGCCCTCGT CGCAGCGCTC TCGGAAGCCG GACTGACGCC GTACGCGACG CGGACGACGA CGCGGGATCT GGACGAACTC GGCTTCGAGG GCGTCCGCGT GTTGGTGCCC GAGGCCCAAC CCCTGTTCCT CGGCGACGCC TTCTTCGGGG AGCGAGCGGA GACGGTCCCG ACCGAACTCG GCTTCGAGCC GCGACTCGAT CGCCCACACC ACCCGTTCCC GTAG
|
Protein sequence | MGTVVDVVGS GPATESLLAT LDDLDVSIDH REAPAVDSAD FAVVVDETGA ATFETASEQA RDGGTPWLAV ELGGVGGRPV VPAAVTGFDP DRECYDCLCG RIEANTTAAA EGGEDPGPAT LRVAGAVAGE AVADYLAAGS ARTDADSSVL GHVVELPHRK RRFFPLPGCS CGGERETTID RSHATVDTEQ ALERAERALD DRVGIVQQVG EAESFPAPYY LAQLTDTSGF SDVTAPRQAA GVAADWDGAF MKALGESYER YAAGVYTAAE TTTATAASLD DAVAPEAFVA PDDAGADATT ELDWIGARNL ATDESALVPA ELVFHPPVGS HVRPPLTTGL GLGSSGCEAL LAGLYEVIER DAAMLSWYST FEPLGLTVDD DVFGTLYERA ASEGLTVTPL LLTQDVDVPV VAVAVHRDEW PSFAIGSAAD LDPEQAALGA LEEALQNWME LRSMGPEQAA EASGAIGEYA DKPDRAVDLL AYDQTIPAEA VGPDAVDDGE AELDALVAAL SEAGLTPYAT RTTTRDLDEL GFEGVRVLVP EAQPLFLGDA FFGERAETVP TELGFEPRLD RPHHPFP
|
| |