Gene Hmuk_0332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0332 
Symbol 
ID8409830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp326048 
End bp327454 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content64% 
IMG OID645018657 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003176176 
Protein GI257386403 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC CAACTCGCAG AGACTACGTT AGAGGAGTCG GTGCAGCAAC AATTGTCGGA 
CTCGCAGGCT GTTCCGGAGA CGGCGGCGAC GGCGGTAGCG ACGGCGGTGA CGGCGGCAGC
GACGGCGCTT CGACCGGCGA CAGCGGCGGA AGCAGTGACG TGACGTTCGA CTTCTGGCAC
ATCCACGGTG ACGAGCTCGG CGAGACGCTG AGCCAGTTCG CTCAGGAGTT CTCCGAGGAG
ACAGACGGCG TGACGGTCAA CGCGGTCAAC AAGGACGGCT ACCGCCAGAA CCTCAACCAG
TCCCTGCAGG CCTCGCGAGC CGGTGACCCA CCGGGCGTCG CCCAGATCTT CGAGATCGGG
ACACAGCTGT GTCTCGACAG CGGAGCCTTC ACGCCCATCG AGCAGGTCAT CCCCGACGGT
GCGGTCGACT TCGACGATCT CCTCCCGTCG GTGTCGAGCT ACTACCGCAT CGACGACGAG
CTCAATTCGA TGCCGTTCAA CTCCTCGAAC ACGATCATGC TGTACAACAA GACGGCGTTC
GAGGAGGCGG GACTGGATCC GGAGGATCCG CCACGGAGCC TCTCCGGCGT GCGCAGCGCC
GCAGAGACGA TCGTCGACCA GACCGACATG GAGGCCGGCA TCTCCTGGCC GAACCACTCC
TGGATGCAGA TCGAACAGCA GTTCGCGAAG CAAGACCAGG TGCTGCTCAA CAAGGAGAAC
GGTCGCGCCG GTCGTGCCGA CAAGACCTAC TACAACAGCG AGGCCGGTCG GAACGTCTAC
GAGTGGTGGA AGGGCATGGC CGACGACGAC CTGTACCTGA ACCCGGGTAT CGAGGCGTGG
TCCGAGGCAC GACAGGCGTT CCTCACGGGC CAGGTCCCGA TGCTGTGGGA CTCCACGTCG
AACATGGTCT CGATGAAGGC CGGTGCCGAG GAGAACGGTT TCGAACTCGG GTCGGCGTAC
CTGCCCGCAC CGGACGGTGC CAACAACGGC GTCGTCATCG GCGGCGGCTC GCTGTGGGTG
CCCGACGCCC TCGCAGACGA GAAAAAAGAG GCCGCCGGCA AGTTCATCGC CTACATGGTC
CAGCCCGAAC AGCAGGCCCG CTGGCACCGC AACAGCGGGT ACTTCCCGGT GAGTCAGGGT
GCCGTCGACC AGCTCGAAAG CGACGGCTGG TTCGAGGAGA ACCCCGACTT CCGTACGGCC
TTCGATCAGT TGCAGGACAC CAAGGACTCG CCCGCCACGC GTGGCGCGGT CATGGGCGTC
TTCCCGGAAG CGCGGAGTAT CAACACGGAG ATCTCGGTCA GTATCCTCAA CGGTGACGTC
GGCGTCGAAG AGGGCCTCTC GGAGATGGAC ACCCAGGTCC AGGAGACCCT GTCGAGCTAC
TCCGGCAACT ACAGCGGAGA GGAGTAA
 
Protein sequence
MKKPTRRDYV RGVGAATIVG LAGCSGDGGD GGSDGGDGGS DGASTGDSGG SSDVTFDFWH 
IHGDELGETL SQFAQEFSEE TDGVTVNAVN KDGYRQNLNQ SLQASRAGDP PGVAQIFEIG
TQLCLDSGAF TPIEQVIPDG AVDFDDLLPS VSSYYRIDDE LNSMPFNSSN TIMLYNKTAF
EEAGLDPEDP PRSLSGVRSA AETIVDQTDM EAGISWPNHS WMQIEQQFAK QDQVLLNKEN
GRAGRADKTY YNSEAGRNVY EWWKGMADDD LYLNPGIEAW SEARQAFLTG QVPMLWDSTS
NMVSMKAGAE ENGFELGSAY LPAPDGANNG VVIGGGSLWV PDALADEKKE AAGKFIAYMV
QPEQQARWHR NSGYFPVSQG AVDQLESDGW FEENPDFRTA FDQLQDTKDS PATRGAVMGV
FPEARSINTE ISVSILNGDV GVEEGLSEMD TQVQETLSSY SGNYSGEE