Gene Hmuk_2264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2264 
Symbol 
ID8411805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2185164 
End bp2186708 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content67% 
IMG OID645020607 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003178083 
Protein GI257388310 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.337461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAATA ATCATTCGGA CGACGACGGA CAGCGAAGCA GTGTCTCTCG ACGACGGTTC 
GTCGCAGCCA GCGGTGCATC GGGCGTCGCG GCGACACTCG CCGGCTGCGC AGATCTGGTC
GGTGGCGGGG ACGGCGGCGA CGGTGGCAGC GACGGCGGCG ACGGTACCGG AGCGACGACG
GGCGAGGGCG GGAGCGACGG CGCGACGACG ATCCAGTGGG GCATCGGGCC GACCGCCGTC
CAGACGGCCG GCGAGGAGAT GAAGACGGCA CTCCACGAAG CGGGTGGCCT CAGAGACGAC
ATCGAGATCG AGTGGGTCCC CAGCGCCTCG GACACGGGCG AGGTCCGCTC GAACTACAAC
CGCATCCTCA ACGCGGACCA GAGCGACCCC GATATCTTCC AGATGGACAA CGGCTGGGTG
AACATCTTCA TCCAGCGCGG GCTGATCCAG AACCTCTCGG AGACGCTCCC GGAGGATCTC
CTCTCGGACA TCAACGAGAA CTACTTCAGC GGGTTCACCG ACACGGCCCG AGACCCGTCG
TCGGGTGACC TCTACGGCGT GCCGCTGTTC CCCGACTTCC CGACGATGCA GTACCGCAAG
GACCTCGTCG AACAAGCGGG CTACGACCCG GAAAGCGAGA ACTGGGCGAC CGAGCCGATG
ACGTGGGAGG AGTGGTCCCA CGTCGTCGCC GACGTGAAGG ACAACGCCGA CGCCGAGTAC
GGCTTTACGA CCCAGTGGGA CATCTACGAG GGCACCGCCT GCTGTACGTT CAACGAGGTC
ATGTCATCGT GGGGCGGCGC GTACTTCGGC GGCCGCGAGA ACCTCTTCGG TCCGGTCGGC
GAGCGTCCGA TCACGGTCGA CGAGCCCGAG GTCCACAACG CGCTCAACAT GATGCGGAAG
TTCGTCCACG ACGAGGAGTT CGACGGGACC TTCGCGGACT ACGCCGGCAA CATCGCGCCG
ACGGACATCC TCGGCTGGAT CGAGGAGCCC TCTCGCTCGC CCTTCGCCGA GGGCGACGCA
GTCTTCCACC GCAACTGGCC GTACTCGCTC GCGCTGACCG GCCGGAACCC CGAGGAGACC
GACGATCCCG CACTCGGCGA GGACCTGGGT GCGATGCCGA TCCCCTACGC CGTCTCCGAG
AGCGAGGCCG CCCAGCCCGG AACCGGTGGC ACGACCTCGG CGCTCGGTGG CTGGCACCTC
ACGTTCAACC CCAACAGCGA CAACCTCGAC GTCATCGACG AGGTCGTCTC GGCCGTCATG
GAGCCGGACT TCGCCCTCGA ACTGTTCCGA CTGCAGGGGT GGCTCCCGCC GCGTCCCGAG
CTGTTCAACT CCGACGAGGC CCGGAACGTC GCACCGGTCG GGCGCTACAT GGACACGCTC
CAGGTGGCCG GTGAGAACGC GATGGCACGG CCGGTCACGC CGGTCTGGAG CCAGCAGTCC
AGCGACATCG CCCAGTCGGC CAACAGGGTC GTCGGCCAGG AAACCTCGGC CGAGGACGCG
ATGGCTTCGC TGACCTCCAG CCTCGAAGCC ACCGAACAGA ACTGA
 
Protein sequence
MSNNHSDDDG QRSSVSRRRF VAASGASGVA ATLAGCADLV GGGDGGDGGS DGGDGTGATT 
GEGGSDGATT IQWGIGPTAV QTAGEEMKTA LHEAGGLRDD IEIEWVPSAS DTGEVRSNYN
RILNADQSDP DIFQMDNGWV NIFIQRGLIQ NLSETLPEDL LSDINENYFS GFTDTARDPS
SGDLYGVPLF PDFPTMQYRK DLVEQAGYDP ESENWATEPM TWEEWSHVVA DVKDNADAEY
GFTTQWDIYE GTACCTFNEV MSSWGGAYFG GRENLFGPVG ERPITVDEPE VHNALNMMRK
FVHDEEFDGT FADYAGNIAP TDILGWIEEP SRSPFAEGDA VFHRNWPYSL ALTGRNPEET
DDPALGEDLG AMPIPYAVSE SEAAQPGTGG TTSALGGWHL TFNPNSDNLD VIDEVVSAVM
EPDFALELFR LQGWLPPRPE LFNSDEARNV APVGRYMDTL QVAGENAMAR PVTPVWSQQS
SDIAQSANRV VGQETSAEDA MASLTSSLEA TEQN