Gene Hmuk_2574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2574 
Symbol 
ID8412120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2477639 
End bp2478826 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content65% 
IMG OID645020915 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003178387 
Protein GI257388614 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.577743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCG GACTGCAGCA GTCACGTGCT GGTTCTTTCG AGCTCGTCCA TCACTACGTC 
GGGGGCGACG GAGAGGCCGC ACTGAGAGCG CTGATGGAGG GATTCGCCGA CAGCCACAGC
TCGATCTCGG TTCAGGAGAC CCACTACGAC AACATGCGGC TCCAGGTCAA GAGCCGCATT
CTGGGTGAAG ATCCGCCGGA CGTCTGGACC GGGTGGCCCG GCGGCGAGAT GGCCGGCTAC
GCGGAAGTCG ACGCGGTCAA AGACATCACC GACCTCTGGG AGGCCTCGGG GATGGCCGCG
AACTTCCGGT CGGTCGCGGC GGACGTGTCA CAGGTCGACG GTCGGTACCA CGCGGTCCCG
ATCACCATCC ACCGGGCAAA CGACATGTAT CTCCACACCG AGACCGTCGA GGCGCTCGGG
ATCGACCCCG CGCGAGCGTC GGACCCGACC GAACTCGTCG AGATGCTGGA AGCGGTCGAC
GACGACCACG ACGGACCGTC GTTCTTGCTC CCGATGGCCG ACCCGTTCAC CGTCCTGCAG
CTGTGGGAGA TCACGCTCCT GGGACTGTCC GATCACGCCA CGTTCGAGGC GATGACCGCG
GGCAACGCGT CGCGTCACCG CGACGTGATC GTGAGTGCGC TCGAACACAT CCAGCGTTTC
TCGGCGCTGT CCAGCGAGGA CTCGCTGTAT CACGGGATGA CCGACGCGAA CGAGCAGTTC
ATCGACGGAG CCGCCCCGGT CTACCCGCAG GGCGACTGGG CGGCCGGCGT GTTCGACGAG
ACGCCCGAGT TCGACTTCCA GTCCGAGTGG GACCGCGTCG CGTTCCCCGG TACGGAGGAC
ATGTTCGCCG TCGTCATGGA TTCGTTCATC CCGTCGTCGA AATCGGACAG CGACGCGCTC
GAGACGTTCC TCGAATACGT TGGCTCGTGT GACGCACAGG AGCGGTTCAG CCGGAAGAAG
GGGTCGCTGC CGGCACGGAA AGACGCCTCG ATCGACGGGT TCACCGAGTT CGGGAGATCA
CAGTACCGGC AGCTCGATCG CTCGGCCGAA CAGCCCCAGA TAATCACGCA CGGCCTGAGC
GTCTCCTCGG CACAGCTGGT CGACCTGAAG TCCGCGATCG CGGGCTTCAT CGACGAGTGG
GACGCACAGG CGACGGCCGA CGAGATGATC GGCGTCTTCG ATCGCTGA
 
Protein sequence
MSSGLQQSRA GSFELVHHYV GGDGEAALRA LMEGFADSHS SISVQETHYD NMRLQVKSRI 
LGEDPPDVWT GWPGGEMAGY AEVDAVKDIT DLWEASGMAA NFRSVAADVS QVDGRYHAVP
ITIHRANDMY LHTETVEALG IDPARASDPT ELVEMLEAVD DDHDGPSFLL PMADPFTVLQ
LWEITLLGLS DHATFEAMTA GNASRHRDVI VSALEHIQRF SALSSEDSLY HGMTDANEQF
IDGAAPVYPQ GDWAAGVFDE TPEFDFQSEW DRVAFPGTED MFAVVMDSFI PSSKSDSDAL
ETFLEYVGSC DAQERFSRKK GSLPARKDAS IDGFTEFGRS QYRQLDRSAE QPQIITHGLS
VSSAQLVDLK SAIAGFIDEW DAQATADEMI GVFDR