Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0622 |
Symbol | |
ID | 8410128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 577973 |
End bp | 581146 |
Gene Length | 3174 bp |
Protein Length | 1057 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645018950 |
Product | PKD domain containing protein |
Protein accession | YP_003176461 |
Protein GI | 257386688 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.40107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACACA CACGACGCGA TCTGTTACGG AAAGCATCGG CACTGTCGGC GCTCGCACTC GGTGCGAGCT CGACCGCAGC AGCACATTCG GGCCACTGCG ACGCGCCCGA CTGGGATTCG AGCGCAACCT ACAGCGGCGG CGACCAGGTC GTCTACGACG GCGCGCTGTG GACCGCCGAG TGGTGGACCC AGCGAGAACC CGACGCCGAC CTCGGGGCCT GGACCCGCGA GCACGACTGT GACGAGGAGA ACGAGGAGCC GACGGCGTCG TTCACGATGG ACGTACGCTC GCCCGCACCG GGCGAGACGG TCAGCTTCGA CGCGTCGGCG TCCTCGGACG CGGACGGCTC GATCGCGTCC TACGAGTGGG ACTTCGACGA CGGCGACTCC GCGACGGGCG AGACCGTCAC CCACAGCTTC GACGCCGAGG GCGAGTACAC GGTCACGCTG ATCGTCCGAG ACGACGAGGA GGCACCGGCC ACGGCGACCA AGACCGTCGC CGTCGGAGAG GGGTCGAACG TGGCACCCGA CGCCTCGTTC ACGGTCGATC CGGCCGAGCC CGATACCGGG GAGACGGTCA GCTTCGACGC GGCCGGCTCC TCGGACTCGG ACGGCTCGCT CGCGAGCTTC GAGTGGGACT TTGGCGACGG AGCCACGGCC ACCGGCGAGT CCGTCACCCA CACCTACGAG ACGGCGGACG ACTACACCGT CTCGCTGACC GTCACCGACG ACGACGGTGC CAGCGACTCG AACGACACTA CCGTCACGGT CTCGGGTACC GGCTCGGACG GCGGCAGCTG TGACGGCGTC GACGCCTGGG ACGCGGACGC GACCTACACC GGCGGCGATC AGGTCACCTA CGACGGCGAT CTGTGGGAAG CCAGCTGGTG GACCTCGGGT GACGAACCCG GCTCCAGCGA GTACGGTCCC TGGGAGAACG AGGGTGCCTG TGGCGACACC AACGAGGCAC CGACCGCGTC GTTCACGACG GACGCGAGCG CGCCCGCACC CGGCGAGGAC ATCACCTTCG ACGCCTCGGA GTCCTCGGAC GCGGACGGCT CGATCGCGTC CTACGAGTGG GACTTCGGCG ACGGCTCGGC GGCCACCGGC GAGGTGGCGA CCCACGGCTA CGCCGATGCT GGCGATTACA CCGTGACGCT GACGGTCTCC GACGACGCGG GCGCGACCGA CAGCACGTCG ACGACGGTGT CGATCAGCGA CGCCGACAAC CCGACCCAGA TCACGATCGA CAGCAGCATC GACGGCTGGC TCGGTGCCGA ACCGGCAGCC ATCGAGGGCG AGCGGAACCC GACGATCTCG CTGACCTCCG GCGAGTCCTA CGCGTTCACC TACAACAACG TCGACGGGAT CCCCCACGAT CTCGTGATCC GCAACGCCGC TGGCGAGGAG CTGCTGGCGA CCGAGCGGGT CGGCCAGGAA GGCCAGAGCC GCACCCTGGA GTTCGAGGCC ACCGAGGAGA TGGTCGAGTA CATCTGTACG GTCCATCCGG AGTCGATGGT CGGCGACCTC GCGATCGACG GGAACGCCCC GCCGATGGCG GGCCTGACGA TCGATACGGC CAATCCGGAC CCCGGCCAGG AAGTCAGCTT CGACGCCTCT GAGTCCTCGG ACGCGGACGG CTCGATCGCG TCCTACGAGT GGGACTTCGG CGACGGTTCG ACGGCCACCG GCGAGACGGC CTCTCACACC TACGCCGAGG CGGGCGAGTA CACCGTCGAG CTGACGGTCA CCGACGACGG CGGCGCGTCG GCCGTGACCA GCACCGACAT CACGGTCGGC TCGGTCAACG AGGGGCCGAC GGCGTCGTTC TCCGTCGATC CCGCTTCGCC GGCACCCGGT GAACAGGCCT CTTTCGACGC CTCGGAGTCC TCGGATCCGG ACGGCTCGAT CGCGTCCTAC GAGTGGGCGT TCGGTGACGA CTCGACGGCC ACCGGCGAGA CGGCCTCCCA CACCTACGCC GGTTCGGGGG CCTACAGCGT CGAACTGACG GTCACCGACG ACGGCGGCGC GACCGCCACC ACGACGACGA CGGTCCAGGT CGGCGACGGC AGCGACATCA CCGCCAGCAC GACGCTCGAA GAGTTCTACC CCGCGTACGA CGAGGACTTC AACCCTACCT ACACCGAAGG CGGCGTCCAG GGACTGCTCA AAAACGAGCT GAACGGCGAC GCCAGCGAGT ACGGGGCCGA CGTCGACGCG ATCCAGAACA ACGCCGGGGA CGGCTCGATG CAACTCGGAG CGCTGGGCGA CCGCGGGCTC GAACTGGTCA AGCAGTTCGA CGCGGCGGGC GTGCCCCGAG AGAACAACGC CCGGATCATG CCCTGGGTCG TCGGGTTGCC CGAGGAGACC GAACCGATCC CGTTCAACGA CGGCGGCGGA CGGGACGGCG GCCTGACGGC CGACGCGGGG CCGGTGGCGG CGAACAACGA CCCCTCGGTG CTGGTCCAGG ACGCCTGGCC AAGCGGCGAG CAGTTCGACG ACGACTACCT CCAGCCCGAC CGGGTCGAGT GGAGCAGCGG CGTCAGCGAC AGCCAGTACA ACAACACCGA CAACCCGATC ATCGAGGCCA CGACGGAGAA GGTCCACCCG GTCACGGGCG AGAGCCTCGG CGACGGTTTC ACCGCAAACG CCCCGGTGGA AGCGACCGCC GAACTCCACG GCGACGGCTG GCTGTTCGAC ACGTCGCTGA TCTTCGAGAA CCGCACCGAG GTGCCGCTGC TGATCGACGG CGCGATCATG TGGTACGTCG CCCCGTCGAA AGAGGCCACT GGGCTGGACC AGTTCTCGTA CGACAACGCA CAGCGCAAGA ACTTCTCGGT GGGCCACCCA CAGCGCGACG TGATCGAGGT GGCACTGCCC GACGGGGCCA AGCCGGGACC CTTCGAGGGG ACCGACGCGC CGCTGTCGGC GTACGCCATT CGTGTCACCC ACCACGACAA CCCGTACATG TACCGGGTCC TCTACCCCAA CCAGAAGTTC TCGATGACCT ACGGGAACGT CACGGGTCCC AGCCAGTACG ACTGGCCTGT CGACGATCTG GTCGACGTGA TGCTCGACAC GCTGCACCTC GAATTCCGCA CGCAGATGGA CAGCCTCGAA CAGAACACCG AGCTGGTCGA CGCACTCGAC ATGCGGAACC GCTACGGGAA CTAG
|
Protein sequence | MRHTRRDLLR KASALSALAL GASSTAAAHS GHCDAPDWDS SATYSGGDQV VYDGALWTAE WWTQREPDAD LGAWTREHDC DEENEEPTAS FTMDVRSPAP GETVSFDASA SSDADGSIAS YEWDFDDGDS ATGETVTHSF DAEGEYTVTL IVRDDEEAPA TATKTVAVGE GSNVAPDASF TVDPAEPDTG ETVSFDAAGS SDSDGSLASF EWDFGDGATA TGESVTHTYE TADDYTVSLT VTDDDGASDS NDTTVTVSGT GSDGGSCDGV DAWDADATYT GGDQVTYDGD LWEASWWTSG DEPGSSEYGP WENEGACGDT NEAPTASFTT DASAPAPGED ITFDASESSD ADGSIASYEW DFGDGSAATG EVATHGYADA GDYTVTLTVS DDAGATDSTS TTVSISDADN PTQITIDSSI DGWLGAEPAA IEGERNPTIS LTSGESYAFT YNNVDGIPHD LVIRNAAGEE LLATERVGQE GQSRTLEFEA TEEMVEYICT VHPESMVGDL AIDGNAPPMA GLTIDTANPD PGQEVSFDAS ESSDADGSIA SYEWDFGDGS TATGETASHT YAEAGEYTVE LTVTDDGGAS AVTSTDITVG SVNEGPTASF SVDPASPAPG EQASFDASES SDPDGSIASY EWAFGDDSTA TGETASHTYA GSGAYSVELT VTDDGGATAT TTTTVQVGDG SDITASTTLE EFYPAYDEDF NPTYTEGGVQ GLLKNELNGD ASEYGADVDA IQNNAGDGSM QLGALGDRGL ELVKQFDAAG VPRENNARIM PWVVGLPEET EPIPFNDGGG RDGGLTADAG PVAANNDPSV LVQDAWPSGE QFDDDYLQPD RVEWSSGVSD SQYNNTDNPI IEATTEKVHP VTGESLGDGF TANAPVEATA ELHGDGWLFD TSLIFENRTE VPLLIDGAIM WYVAPSKEAT GLDQFSYDNA QRKNFSVGHP QRDVIEVALP DGAKPGPFEG TDAPLSAYAI RVTHHDNPYM YRVLYPNQKF SMTYGNVTGP SQYDWPVDDL VDVMLDTLHL EFRTQMDSLE QNTELVDALD MRNRYGN
|
| |