Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0253 |
Symbol | |
ID | 8409751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 249870 |
End bp | 250748 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645018578 |
Product | Protein of unknown function DUF439 |
Protein accession | YP_003176097 |
Protein GI | 257386324 |
COG category | [S] Function unknown |
COG ID | [COG2469] Uncharacterized conserved protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.923464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0098053 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGGAAT CCGTCATCGC TGATTTCGTC GGCAACTTCA ACGCGGAGGT GATGGCGACG ACCGAGCCGG TCAAGGGGCG GGTGTTGCTG AGCCAGAAGC GACTGGTGTT GGCGGCCAAC GAGAACGACA AATTCACCAT TCCGCTGTCG TCGATCTTCG ACGTCGCGGT CGGACACGTG CCGCCGGACC TGGGGGACTT CTTCGACTCG ACGGTCACCG TCGCTTTCGA ACGAAACGAC AGTCGACACG TCGCCGTCGT CGAGGGCGAC GACGACAAGA TCTCGAAGTT CTCGACGGTG TTGTTCAAGG CGATTCTCAA CGGTACCGAG ACGACCGTCA AAGAGCGGGC ACGCGTCGGT GGGCGCGTCA CCGACGATGC GTTCTCGACG GCGAAGCTGT TCGTCTCGGA CGGGTCGGTC GAGTTCCGGT CGTCGGACGG CGTTTTCGTC GTCGATCTGG CCACCGTCAC CGACTTCCAG CGCACGACCC GCGAGATCGC CGGTTCCGAC CGCCCGACGC TGACCTTCCG GCACATGATG GACGGCACTG CCGTCACGAC CATGGCCGCG ATGGCGTCGC CGCGCAAGAT GTCGATCCTG GGTCGCTACC TCCGCATCGA GTACAGCGAT CTCATGGAGG ATCTGCAAGA CATCGAACTC ACCGACGACA AGACCGAGAT CCTCGTCGCC ATCTACTCGA CGGGGGACAT GGACGGGATG CCACTGGCAG ACATCGTCGG CACCGACAGC TCCGCGGTGA CGATGCTGTT GCGAGACCTC GAAGAAGACG GGCTCGTCCA GGACTCGTCC GACGGTCCGA AGCTCACCCC CAAAGGCGAG GTCGTCGCGA GCCGACACCT CGAAGACGTG AACTCCTGA
|
Protein sequence | MSESVIADFV GNFNAEVMAT TEPVKGRVLL SQKRLVLAAN ENDKFTIPLS SIFDVAVGHV PPDLGDFFDS TVTVAFERND SRHVAVVEGD DDKISKFSTV LFKAILNGTE TTVKERARVG GRVTDDAFST AKLFVSDGSV EFRSSDGVFV VDLATVTDFQ RTTREIAGSD RPTLTFRHMM DGTAVTTMAA MASPRKMSIL GRYLRIEYSD LMEDLQDIEL TDDKTEILVA IYSTGDMDGM PLADIVGTDS SAVTMLLRDL EEDGLVQDSS DGPKLTPKGE VVASRHLEDV NS
|
| |