Gene Hmuk_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0206 
Symbol 
ID8409704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp202032 
End bp203333 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content60% 
IMG OID645018531 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003176050 
Protein GI257386277 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.568957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGACT ATTTCATCAA TCGTATATCC ATGGGATTCG ATCGGAGAAC ACTGTTGAAG 
CACATTGGTG CAACGGGAAC GATCGCAGCA GTCGGCGGCT GTGTTGGCGT CGAGGAACAG
GGAACAGATG CTGATGCCGG CGGGGGCGAG AGCAGCAACG GTACCGACAG TACAACAGAG
GAGCCCGCCG GATCGGCAAC GGCCTGGTAC GGCCTCTCCG ACACGGAGCT CGAACTCCGC
GAAGACATCA TCGCGGCATT CAACGAGGAG TCCAGCCACA CGATCAAGGG AGGGAATATC
GCCGAGATGC AAGACCGGAC GACGAGCGCG ATCCCTGCCG GACAGGGACC GGAAACGTTC
CAGTGGGCCC ACGACTGGGT CGGTGATTAC TACGAACGCG GGTTCGTCGT CGACCAGAGT
GACGAGCTGT CCGTCGACCT CGACCAGTTC ACCAGTGCAG CGGCTGGTGC CGTCCAGACC
GACGATGCGA TCGTCGGGCT CCCCTTTTCG GCGGAGACGG TGACGCTAAT CTACAACGCA
GACATCGTGG ACAAACCACC GGAGACGTTC GAGGAAATGG CAGCAAGTAT GGAAGCGTAC
CACGATTCGG CCAACGGGAA GTACGGGCTA GCCATGCCGT TCAACCCCTA CTTTATCAGC
GGGATCGCAC AGGCGTTTGG CGGCCGCTAC TTCGATCCCG AAAGCGACCC AGTGGTTGGT
CTCGATTCTG AGGAGACGGT CCGTGGATTC GAGTTTATGC TCGACAATCT CGTCCCATAT
ATGCCGAACG ACCCAGGCTT CGAACCCCAG CAGGCAACGT TCGCAGAGGG CAACGCGGCC
TTCGCAGTCA ACGGTCCGTG GTATCTTGCC ACACTCAACG ACAGCGACAT CAACTACGAG
GTGACGACCT TCCCGTCGAT GGACGGCGGT GAGTTCACTC CACTGAGCGG GATCAAGATG
TGGTACTTCT CGAAGGCAAT GGAAGAGGGA GATGTCGACG CGACGGCAGG ACGCGAGTTC
ATCGAGTGGT TCGTGACCAA CGAGGACCAC CTACTCACCA GAGCCGAAGA ACAGGGCCAC
ATTCCAGTCC TCTCGTCGCT CGCCGGCAGC GACGATCTCC CCGGCCCAGT CCGGGCCTAC
TCGGAGGCCG TCGATCAGGG TATCCCGATG CCGACGGATC CTCGTATGAG CGACGTGTTC
GCAGCGCTGG AGGAACCAGT CGTCCAGATT TTCAACGGAA GTCAGAGCCC AGCACAAGCA
CTTGCCGGGG CCGCCGACGA GGCTCGAAGT AACTGGGAGT AA
 
Protein sequence
MYDYFINRIS MGFDRRTLLK HIGATGTIAA VGGCVGVEEQ GTDADAGGGE SSNGTDSTTE 
EPAGSATAWY GLSDTELELR EDIIAAFNEE SSHTIKGGNI AEMQDRTTSA IPAGQGPETF
QWAHDWVGDY YERGFVVDQS DELSVDLDQF TSAAAGAVQT DDAIVGLPFS AETVTLIYNA
DIVDKPPETF EEMAASMEAY HDSANGKYGL AMPFNPYFIS GIAQAFGGRY FDPESDPVVG
LDSEETVRGF EFMLDNLVPY MPNDPGFEPQ QATFAEGNAA FAVNGPWYLA TLNDSDINYE
VTTFPSMDGG EFTPLSGIKM WYFSKAMEEG DVDATAGREF IEWFVTNEDH LLTRAEEQGH
IPVLSSLAGS DDLPGPVRAY SEAVDQGIPM PTDPRMSDVF AALEEPVVQI FNGSQSPAQA
LAGAADEARS NWE