Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2209 |
Symbol | |
ID | 8411748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2122835 |
End bp | 2124016 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645020551 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_003178029 |
Protein GI | 257388256 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0133966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.166807 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGAGT TCGAGTTCGA TGTCGACTGG CCACGAACTG CGTGGGTCGC CCTGGGGTTC GTGCTCGCCG CCGCCCTCGT CTTCGTCGCC TATCGCTTCG TCGGGACCTT CGTCTTCGGA CTGTTCGTCT ACTACGCGAC CCGGCGGCTC TACCGCCGGG TCCGGCGACG GATCGAACAG CCGTCGGTCG CCGCGGGCGT CTCGCTGGTC GGGATGGTGC TTCCGGCGCT CCTGTTGCTG GCCTACACGC TCGGGATCGC CCTCCAGCAA CTGAGTCGCT TCGCCGACGA GATCGACGGT GCCGGGTCCG GGAGCCAGAT CGAGGACCTG CTGGCGATGG CCGATCCGTA CCTGAACCTG ACGGTGTTGA GCGATCCGAC GGCGCTGCTG GACAACCTCG GCGGCGTCGG GACGATCACG ACGACGCTCG AATCGGCGCT TGGCTACCTC GGAGTCGTCG GCACGGGACT CTTACACCTG TTCGTGATGC TCGCGCTCGC GTTCTACCTG CTCCGGGACG ACCGCCGGTT CGCGGGCTGG GCCGGCCACC TGACGGCAGA GCGCACCGTC TTCGAGGAGT TCGTCGCTGC CGTCGACGAG AGCTTCGACA AGGTGTTCTA CGGGAACATC CTGAACGCGA TGATCACGGG GCTGGTCGGT GGCATCGTGT ACACGCTGCT CAACCAGATC GCGCCCGCGT CGGTCGCCAT TCCGTACGCG GCACTGGTCG GCGTGCTCGC CGGTGGGGCG AGCCTGATCC CCATCGTCGG CATGAAACTG GTCTACGTTC CGATGGCGGC CTATCTCGCC GTCCTGGCGG GGACCAGCGG CGAAGGGTGG TGGTTCGTCG TCGCCTTCGC CGCGGTCAGT TTCGTCGTCG TCGACGTGAT TCCGGACCTC CTCGTGCGGC CCTACGTCTC GGGCGGGGAA CTCCACACCG GTGCGCTGAT GTTCGCCTAC ATCCTCGGGC CGCTCATCTG GGGATGGTAC GGCATCTTCC TCGGTCCGAT CGTGCTGGTG TTGGTCACGC ACTTCGCACG GATCGTCCTG CCGGAGCTGG TGACCGACGA ACCGATTCGC GCCAGAGAAG TCGATCCGGC CGCGTTGACC GGCGACACCG ACGGCGCGGA CGCCGTCGGC GGCGAGAGAG ACGGCGACGA GGGCACACCA AGCGGGGCCT GA
|
Protein sequence | MVEFEFDVDW PRTAWVALGF VLAAALVFVA YRFVGTFVFG LFVYYATRRL YRRVRRRIEQ PSVAAGVSLV GMVLPALLLL AYTLGIALQQ LSRFADEIDG AGSGSQIEDL LAMADPYLNL TVLSDPTALL DNLGGVGTIT TTLESALGYL GVVGTGLLHL FVMLALAFYL LRDDRRFAGW AGHLTAERTV FEEFVAAVDE SFDKVFYGNI LNAMITGLVG GIVYTLLNQI APASVAIPYA ALVGVLAGGA SLIPIVGMKL VYVPMAAYLA VLAGTSGEGW WFVVAFAAVS FVVVDVIPDL LVRPYVSGGE LHTGALMFAY ILGPLIWGWY GIFLGPIVLV LVTHFARIVL PELVTDEPIR AREVDPAALT GDTDGADAVG GERDGDEGTP SGA
|
| |