Gene Hmuk_0622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0622 
Symbol 
ID8410128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp577973 
End bp581146 
Gene Length3174 bp 
Protein Length1057 aa 
Translation table11 
GC content68% 
IMG OID645018950 
ProductPKD domain containing protein 
Protein accessionYP_003176461 
Protein GI257386688 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.40107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACACA CACGACGCGA TCTGTTACGG AAAGCATCGG CACTGTCGGC GCTCGCACTC 
GGTGCGAGCT CGACCGCAGC AGCACATTCG GGCCACTGCG ACGCGCCCGA CTGGGATTCG
AGCGCAACCT ACAGCGGCGG CGACCAGGTC GTCTACGACG GCGCGCTGTG GACCGCCGAG
TGGTGGACCC AGCGAGAACC CGACGCCGAC CTCGGGGCCT GGACCCGCGA GCACGACTGT
GACGAGGAGA ACGAGGAGCC GACGGCGTCG TTCACGATGG ACGTACGCTC GCCCGCACCG
GGCGAGACGG TCAGCTTCGA CGCGTCGGCG TCCTCGGACG CGGACGGCTC GATCGCGTCC
TACGAGTGGG ACTTCGACGA CGGCGACTCC GCGACGGGCG AGACCGTCAC CCACAGCTTC
GACGCCGAGG GCGAGTACAC GGTCACGCTG ATCGTCCGAG ACGACGAGGA GGCACCGGCC
ACGGCGACCA AGACCGTCGC CGTCGGAGAG GGGTCGAACG TGGCACCCGA CGCCTCGTTC
ACGGTCGATC CGGCCGAGCC CGATACCGGG GAGACGGTCA GCTTCGACGC GGCCGGCTCC
TCGGACTCGG ACGGCTCGCT CGCGAGCTTC GAGTGGGACT TTGGCGACGG AGCCACGGCC
ACCGGCGAGT CCGTCACCCA CACCTACGAG ACGGCGGACG ACTACACCGT CTCGCTGACC
GTCACCGACG ACGACGGTGC CAGCGACTCG AACGACACTA CCGTCACGGT CTCGGGTACC
GGCTCGGACG GCGGCAGCTG TGACGGCGTC GACGCCTGGG ACGCGGACGC GACCTACACC
GGCGGCGATC AGGTCACCTA CGACGGCGAT CTGTGGGAAG CCAGCTGGTG GACCTCGGGT
GACGAACCCG GCTCCAGCGA GTACGGTCCC TGGGAGAACG AGGGTGCCTG TGGCGACACC
AACGAGGCAC CGACCGCGTC GTTCACGACG GACGCGAGCG CGCCCGCACC CGGCGAGGAC
ATCACCTTCG ACGCCTCGGA GTCCTCGGAC GCGGACGGCT CGATCGCGTC CTACGAGTGG
GACTTCGGCG ACGGCTCGGC GGCCACCGGC GAGGTGGCGA CCCACGGCTA CGCCGATGCT
GGCGATTACA CCGTGACGCT GACGGTCTCC GACGACGCGG GCGCGACCGA CAGCACGTCG
ACGACGGTGT CGATCAGCGA CGCCGACAAC CCGACCCAGA TCACGATCGA CAGCAGCATC
GACGGCTGGC TCGGTGCCGA ACCGGCAGCC ATCGAGGGCG AGCGGAACCC GACGATCTCG
CTGACCTCCG GCGAGTCCTA CGCGTTCACC TACAACAACG TCGACGGGAT CCCCCACGAT
CTCGTGATCC GCAACGCCGC TGGCGAGGAG CTGCTGGCGA CCGAGCGGGT CGGCCAGGAA
GGCCAGAGCC GCACCCTGGA GTTCGAGGCC ACCGAGGAGA TGGTCGAGTA CATCTGTACG
GTCCATCCGG AGTCGATGGT CGGCGACCTC GCGATCGACG GGAACGCCCC GCCGATGGCG
GGCCTGACGA TCGATACGGC CAATCCGGAC CCCGGCCAGG AAGTCAGCTT CGACGCCTCT
GAGTCCTCGG ACGCGGACGG CTCGATCGCG TCCTACGAGT GGGACTTCGG CGACGGTTCG
ACGGCCACCG GCGAGACGGC CTCTCACACC TACGCCGAGG CGGGCGAGTA CACCGTCGAG
CTGACGGTCA CCGACGACGG CGGCGCGTCG GCCGTGACCA GCACCGACAT CACGGTCGGC
TCGGTCAACG AGGGGCCGAC GGCGTCGTTC TCCGTCGATC CCGCTTCGCC GGCACCCGGT
GAACAGGCCT CTTTCGACGC CTCGGAGTCC TCGGATCCGG ACGGCTCGAT CGCGTCCTAC
GAGTGGGCGT TCGGTGACGA CTCGACGGCC ACCGGCGAGA CGGCCTCCCA CACCTACGCC
GGTTCGGGGG CCTACAGCGT CGAACTGACG GTCACCGACG ACGGCGGCGC GACCGCCACC
ACGACGACGA CGGTCCAGGT CGGCGACGGC AGCGACATCA CCGCCAGCAC GACGCTCGAA
GAGTTCTACC CCGCGTACGA CGAGGACTTC AACCCTACCT ACACCGAAGG CGGCGTCCAG
GGACTGCTCA AAAACGAGCT GAACGGCGAC GCCAGCGAGT ACGGGGCCGA CGTCGACGCG
ATCCAGAACA ACGCCGGGGA CGGCTCGATG CAACTCGGAG CGCTGGGCGA CCGCGGGCTC
GAACTGGTCA AGCAGTTCGA CGCGGCGGGC GTGCCCCGAG AGAACAACGC CCGGATCATG
CCCTGGGTCG TCGGGTTGCC CGAGGAGACC GAACCGATCC CGTTCAACGA CGGCGGCGGA
CGGGACGGCG GCCTGACGGC CGACGCGGGG CCGGTGGCGG CGAACAACGA CCCCTCGGTG
CTGGTCCAGG ACGCCTGGCC AAGCGGCGAG CAGTTCGACG ACGACTACCT CCAGCCCGAC
CGGGTCGAGT GGAGCAGCGG CGTCAGCGAC AGCCAGTACA ACAACACCGA CAACCCGATC
ATCGAGGCCA CGACGGAGAA GGTCCACCCG GTCACGGGCG AGAGCCTCGG CGACGGTTTC
ACCGCAAACG CCCCGGTGGA AGCGACCGCC GAACTCCACG GCGACGGCTG GCTGTTCGAC
ACGTCGCTGA TCTTCGAGAA CCGCACCGAG GTGCCGCTGC TGATCGACGG CGCGATCATG
TGGTACGTCG CCCCGTCGAA AGAGGCCACT GGGCTGGACC AGTTCTCGTA CGACAACGCA
CAGCGCAAGA ACTTCTCGGT GGGCCACCCA CAGCGCGACG TGATCGAGGT GGCACTGCCC
GACGGGGCCA AGCCGGGACC CTTCGAGGGG ACCGACGCGC CGCTGTCGGC GTACGCCATT
CGTGTCACCC ACCACGACAA CCCGTACATG TACCGGGTCC TCTACCCCAA CCAGAAGTTC
TCGATGACCT ACGGGAACGT CACGGGTCCC AGCCAGTACG ACTGGCCTGT CGACGATCTG
GTCGACGTGA TGCTCGACAC GCTGCACCTC GAATTCCGCA CGCAGATGGA CAGCCTCGAA
CAGAACACCG AGCTGGTCGA CGCACTCGAC ATGCGGAACC GCTACGGGAA CTAG
 
Protein sequence
MRHTRRDLLR KASALSALAL GASSTAAAHS GHCDAPDWDS SATYSGGDQV VYDGALWTAE 
WWTQREPDAD LGAWTREHDC DEENEEPTAS FTMDVRSPAP GETVSFDASA SSDADGSIAS
YEWDFDDGDS ATGETVTHSF DAEGEYTVTL IVRDDEEAPA TATKTVAVGE GSNVAPDASF
TVDPAEPDTG ETVSFDAAGS SDSDGSLASF EWDFGDGATA TGESVTHTYE TADDYTVSLT
VTDDDGASDS NDTTVTVSGT GSDGGSCDGV DAWDADATYT GGDQVTYDGD LWEASWWTSG
DEPGSSEYGP WENEGACGDT NEAPTASFTT DASAPAPGED ITFDASESSD ADGSIASYEW
DFGDGSAATG EVATHGYADA GDYTVTLTVS DDAGATDSTS TTVSISDADN PTQITIDSSI
DGWLGAEPAA IEGERNPTIS LTSGESYAFT YNNVDGIPHD LVIRNAAGEE LLATERVGQE
GQSRTLEFEA TEEMVEYICT VHPESMVGDL AIDGNAPPMA GLTIDTANPD PGQEVSFDAS
ESSDADGSIA SYEWDFGDGS TATGETASHT YAEAGEYTVE LTVTDDGGAS AVTSTDITVG
SVNEGPTASF SVDPASPAPG EQASFDASES SDPDGSIASY EWAFGDDSTA TGETASHTYA
GSGAYSVELT VTDDGGATAT TTTTVQVGDG SDITASTTLE EFYPAYDEDF NPTYTEGGVQ
GLLKNELNGD ASEYGADVDA IQNNAGDGSM QLGALGDRGL ELVKQFDAAG VPRENNARIM
PWVVGLPEET EPIPFNDGGG RDGGLTADAG PVAANNDPSV LVQDAWPSGE QFDDDYLQPD
RVEWSSGVSD SQYNNTDNPI IEATTEKVHP VTGESLGDGF TANAPVEATA ELHGDGWLFD
TSLIFENRTE VPLLIDGAIM WYVAPSKEAT GLDQFSYDNA QRKNFSVGHP QRDVIEVALP
DGAKPGPFEG TDAPLSAYAI RVTHHDNPYM YRVLYPNQKF SMTYGNVTGP SQYDWPVDDL
VDVMLDTLHL EFRTQMDSLE QNTELVDALD MRNRYGN