Gene Hmuk_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0944 
Symbol 
ID8410460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp906044 
End bp907345 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content65% 
IMG OID645019279 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003176780 
Protein GI257387007 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.322748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAAG CCATCGGTGC CGGTGGCCTC ATCGCCATCT CCGGCTGTAT GGGCGACGGT 
GGCGATGGCG GCGATGGCGG TGACGGTGGC AGCGACGGCG GTGACGGTGG CAGCGACGGC
AGCGACGGTG GCGATCAGCA GACCATCCAG TTCCTGACGA TGGGGGTCGG CGACAACATC
GCCGAGTTCT TCGAGAAGAA CAACGCGGCC TTCGAAGAGG AGTTCGGCGT CACGCTTGAC
TTCACGAGCG TCACCTGGGA CAACGCCCAG CAGACGGTCA ACAACCGCGT CGACGGCGGC
GAGGCACCTG ACGTAAGTCG CTGGCCGGCC CGCTGGATCC CCCAGCTCGT CGGCAAGGAA
GCGCTCGTCC CCATCACCGA CATGATGGAA GGCGAGTTCG GCGACCAGTT CTACCAGGGC
ATGGCCGACG GCTGTATGTA CCAGGGCGAG TACTACGCTG CCCCCTGGGC CGCATCCAAC
AAGTGCTTCT ACTACAACAA GGACGTGTTC GAGGCGGCGG GCCTCGATCC GGAGGACCCC
CAGCTCGACA CCTGGGACGA CATGCTCTCG GCGGCCCAGA CCATCACCGA GGAGACCGAC
ACCCCCGCAC TGGGACTGGC CGGTGCCGAC GCCATCGAGA CCGGCTCGCA GTACTACCAC
TACCACTGGT CACACGGCGC GGACCTGATC GACGACGAGG GTCAGCCGGT CGTCAACTCC
GATGGGGCCG TCGAGGCGCT GAGCTTCTAC TCGGACCTGC ACCTCGAACA CGGCGTCACT
CAGTCCTCGC CGCTGTCCTC GACGCGCCAG GACATCCGTC AGCTGTTCGA GTCCGGCTCG
CTGGGTATGG TCATCGCCCA CGTCTACACG GGCATCAACA TCGACGACAG CGACGCCGAC
TTCGACTACG GGATCGCACA GGTGCCGGAG GGGCCCGCTG GCCGCTACAG CCTGAACACG
ATCGACGGCG TTTCGATCTT CGCCCAGACC GAGGTCGAGG ACCTCGCGCG GGACCTGCTA
CGGTTCTACT TCGACGAGGA CCGCCACTTC GAGTACGCGA GCAGCAAGGG ATTCATGCCG
ACGGTCGAGG CGGTCGGCGA GCGCGACTAC TTCCAGGACT CGGAGAACTG GGCACCGTTC
ATCGAGGCCG GTCAGTACGC CCGCGCTCGG CCGAAACTGT CGAACTTCAA CGAGTTCAAC
AACCGCATGG TCCAGGCGAT CCAGGAAGCG CTGGGCGACC AGAAGTCCCC CCAGCAGGCC
CTGGACGACG CACAGGCGGA CCTCGAAGAG ATGATGCAAT AA
 
Protein sequence
MLEAIGAGGL IAISGCMGDG GDGGDGGDGG SDGGDGGSDG SDGGDQQTIQ FLTMGVGDNI 
AEFFEKNNAA FEEEFGVTLD FTSVTWDNAQ QTVNNRVDGG EAPDVSRWPA RWIPQLVGKE
ALVPITDMME GEFGDQFYQG MADGCMYQGE YYAAPWAASN KCFYYNKDVF EAAGLDPEDP
QLDTWDDMLS AAQTITEETD TPALGLAGAD AIETGSQYYH YHWSHGADLI DDEGQPVVNS
DGAVEALSFY SDLHLEHGVT QSSPLSSTRQ DIRQLFESGS LGMVIAHVYT GINIDDSDAD
FDYGIAQVPE GPAGRYSLNT IDGVSIFAQT EVEDLARDLL RFYFDEDRHF EYASSKGFMP
TVEAVGERDY FQDSENWAPF IEAGQYARAR PKLSNFNEFN NRMVQAIQEA LGDQKSPQQA
LDDAQADLEE MMQ