Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0885 |
Symbol | |
ID | 8410400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 854514 |
End bp | 855926 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645019220 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003176722 |
Protein GI | 257386949 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.182877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.402091 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCTTG ACCCCCCTGT CGGAGTCGAT TCACTGGTAG CCTTGCCACT ACAGGTCGAC ATCTACGGTC CCGTCCAACT GCCCGAGAAC GTGTTTCTCG GGATCGGTGC GGGGATCATT CTCCTCTTGA TCGGGCTGTC GGCGTTTTTC TCCTCGTCGG AGATCGCCAT GTTCTCGCTG CCCAGCCACC GCGTCGAGGC CCTCGTCGAG GACGCCGTCC CCGGGTCGAA GACCCTCAAA GGGCTCAAAG AGGACCCACA CCGGCTGCTG GTGACGATTC TGGTGGGGAA CAACCTCGTC AACATCGCGA TGTCCTCGAT CGCGACGGGG CTGTTGACGT ACTACCTCGG GAGCGGCCAG CAGGGCCTCG CGGTCGCGCT CTCGACGTTC GGGATCACCG CGATCGTCCT GCTGTTCGGC GAGAGCGCGC CCAAGAGCTA CGCCGTCGAG AACACGGAGT CGTGGGCCCT GCGGATCGCC AAGCCGCTGA AGTTCTCCGA GAAGGTCCTG TTGCCCCTGA TCGTCCTGTT CGACTATCTC ACTCGAATCG TCAACAAGGT CACCGGCGGC CGCTCGGCCA TCGAGACCTC GTACGTCACT CGCGAGGAGA TCCAGGACAT CATCGAGACC GGCGAGCGCG AGGGCGTCCT CGACGAGGAG GAACGCGAGA TGCTCCAGCG GACGCTGCGG TTCAACAACA CGATCGCCAA GGAAGTGATG ACGCCGCGAC TGGACATGGA GGCCATCTCG AAGGACGCCT CCGTCGAGGA GGCGATCCAG CAGTGCGTCC AGAGCGGCCA CGCCCGCGTG CCGGTGTACG AGGGGAGTCT CGACAACGTC ATCGGCATCG CACACCTGCG GGACCTGGTC CGGGATCGGG ACTACAGCGA CGCAGAGACC GCACTCGCGG ACCTCATCGA ACCGACGCTG CACGTTCCCG AGTCGAAAAA CGTCGACGAC CTCCTGACGG AGATGCGCCG CGAACGGCTC CACATGGTCA TCGTCATCGA CGAGTTCGGC ACCACCGAGG GACTGGTGAC GATGGAGGAC CTCACCGAGG AGATCGTCGG CGAGATCCTC GAAGGCGAGG AAGAAGAGCC GATCGAGTTC GTCGGCGACG GCGAGGTCGT CGTCAAAGGC GAGGTCAACA TCGAGGAGGT CAACGAGGCG ATGGACCTCG AACTGCCCGA GGGCGAGGAG TTCGAGACCA TCGCCGGCTT CATCTTCAAC CGCGCGGGCC GCCTCGTCGA GGAGGGCGAA CGCATCGAGT TCGACAGCGT CGAGATCGTC GTCGAGCGGG TCGAGAACAC CCGCATCATG AAGGCGCGAC TGTCGCGTAT CGAGCCCGAA GACGGCGAGA ACGGCGAGAC CGAGAGCGAT GGCGACACGT CCACACAAAC GCCGACCGAG TGA
|
Protein sequence | MALDPPVGVD SLVALPLQVD IYGPVQLPEN VFLGIGAGII LLLIGLSAFF SSSEIAMFSL PSHRVEALVE DAVPGSKTLK GLKEDPHRLL VTILVGNNLV NIAMSSIATG LLTYYLGSGQ QGLAVALSTF GITAIVLLFG ESAPKSYAVE NTESWALRIA KPLKFSEKVL LPLIVLFDYL TRIVNKVTGG RSAIETSYVT REEIQDIIET GEREGVLDEE EREMLQRTLR FNNTIAKEVM TPRLDMEAIS KDASVEEAIQ QCVQSGHARV PVYEGSLDNV IGIAHLRDLV RDRDYSDAET ALADLIEPTL HVPESKNVDD LLTEMRRERL HMVIVIDEFG TTEGLVTMED LTEEIVGEIL EGEEEEPIEF VGDGEVVVKG EVNIEEVNEA MDLELPEGEE FETIAGFIFN RAGRLVEEGE RIEFDSVEIV VERVENTRIM KARLSRIEPE DGENGETESD GDTSTQTPTE
|
| |