Gene Hmuk_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2106 
Symbol 
ID8411644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2010412 
End bp2012196 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content73% 
IMG OID645020447 
Producthypothetical protein 
Protein accessionYP_003177926 
Protein GI257388153 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.057671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCCTA CGCACCGCAC CGCCGCCCGG ACGAACGCCC CGACCCGACA GCTCCTGATC 
GTGGCGACGT GTGTCGTCGG CCTCCTCGTC GCCGCAGCCG CCATGCCAGC CACGGCACCA
CAGCCGACGC CCCCGGAGTT CCTGAGCGGG GAGTCGGACT GCCAGATCCT CTTCTCCGAG
GATCCGGTCG CCGGACACGA ACTCACCACG ACGGTCCTGT ACGACGAGGA GCCGGTGTCG
GACTCACCGG TGTGGTTCAA CGGCGAACGC GTCGGCCGGA CCGACGAGGA CGGACGGGTC
GTCGGCACCG TCCCCTACGA GCAGACCCTG CAGGTCCGGG TTGAACTGCC CGGCGGGGGC
AGCTGTGAGG CCAGCATCGA CACCGGCGAC TCGGACGCGC CGCTGAAGAC CGTCGACCGA
TCCGGCCTCG GTGTCGCCGC CCTCGACGGC GTGGCCCAGC AACAGGCTCA GAACGGGTCG
GGCGCGTACC CGGTCCGTGG CCGGATCGAT CTCTCGGTCG ACGAACAGCC CTACCCCGGC
GAGACGACGA CGCTGCGGGC GACGATCCAG GGCAACCCCG TGGCCGACGC GACGGTGTCG
GTCGACGGAC GGACGATCGG ACGGACCGAC GCAGACGGCA CGATCGAGAT TCGCGCGCCG
ATCGAGGGCG ACCGGACGCT CACCGTCCAC GTCGAACGGG GGGCCTTCGA ACGCGAGACC
GAGATCGTCG TCCTCCGCCT CGACGCGACG ATACGGACCG ACGCACTGCT CGCGCTGCCG
GGGCAGAACG CGACGGTCGT CGCCCGGCTC GGCGACCGCC CGGCGGTCAA CGCCACGGCG
CTGGTCGGCG GCGAGCGACT GGGTCGAACG GACGCGGACG GCCACGTCGA GATGACGCTT
CCGGCCGACC CGACGGCACG GCTCACCGTC GCGACGGCCG ATCAGGTGGC CACCACGCCG
GTCCTGTTGG CGTTCGCGCC GACGATCCTG CTGACCGTCC TCGCCGTCCT CGCGCTCGTC
GGCGTGCCGG CGGCCGGCTA CTCGATCGCC GGTCGTCGCG GTGTGGCGAT CGGCTCCGGC
GTCGCCGTCA GCGTCCTGGC GCTCGCGTAC GTGTTTCTCC GCTTTGGCCG CACGATCGCA
CTGCTGGGGT CGCTGGCGCT GCTGGCGGTC GTCGGACTCG TCGCCTTCCT CCGGAGCGAC
TACAGCGCGG TCGAGGCGGC CCGAGCGACC GCCGGCTGGT TCCGCCGTCT CGGCCGCCGA
CTGGCCTCCG ACGGCCTGTG GCTGTCCGGG CGACTCGAAG CTGCCGTCGG GAGCCTCGAA
CGCCGTCTCC GGCGTCTGTG GGACCGACTG ATGGGTCCAG ACGCGACGCC GCTCGCGGAC
GCCGGCCGCT GGCTCACGTC GCTTCCCGCC CGCCTGCTCG CGCTCGTCCG CGCTCTCGCC
CGCGGGCCGA GCTGGCTAGG TGCCGACGAG AGCGACCGCG ACGACCTCGC GGGCGAATCC
GGAGACGCGG ACGACGAGAC GACGCTCTCG CGGCGCGCAC AGTTCCGCCG CGTGTGGCGT
GCGTTCGCCG GTCGGGTCGC CCCCGAGACG TGGCCCCGGC GCACGGCCGG CGAGGTCTCC
CGTCGGGCGA TCGATCGCGG CCTCTCGCCC GAGCCGGTCC GCGAACTGAC CGACACGTTC
CGGGCCGTCG AGTACGGCGA CGAGTCGCTC ACCGACGGAC AGGTAGCGCG GGCTCGCGCG
GCCCTGGAGG AGATCCGCGA CGACTCCGAG GGGGGGTCGG CGTGA
 
Protein sequence
MVPTHRTAAR TNAPTRQLLI VATCVVGLLV AAAAMPATAP QPTPPEFLSG ESDCQILFSE 
DPVAGHELTT TVLYDEEPVS DSPVWFNGER VGRTDEDGRV VGTVPYEQTL QVRVELPGGG
SCEASIDTGD SDAPLKTVDR SGLGVAALDG VAQQQAQNGS GAYPVRGRID LSVDEQPYPG
ETTTLRATIQ GNPVADATVS VDGRTIGRTD ADGTIEIRAP IEGDRTLTVH VERGAFERET
EIVVLRLDAT IRTDALLALP GQNATVVARL GDRPAVNATA LVGGERLGRT DADGHVEMTL
PADPTARLTV ATADQVATTP VLLAFAPTIL LTVLAVLALV GVPAAGYSIA GRRGVAIGSG
VAVSVLALAY VFLRFGRTIA LLGSLALLAV VGLVAFLRSD YSAVEAARAT AGWFRRLGRR
LASDGLWLSG RLEAAVGSLE RRLRRLWDRL MGPDATPLAD AGRWLTSLPA RLLALVRALA
RGPSWLGADE SDRDDLAGES GDADDETTLS RRAQFRRVWR AFAGRVAPET WPRRTAGEVS
RRAIDRGLSP EPVRELTDTF RAVEYGDESL TDGQVARARA ALEEIRDDSE GGSA