Gene Hmuk_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1933 
Symbol 
ID8411461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1841021 
End bp1842187 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content67% 
IMG OID645020264 
ProductABC transporter, periplasmic binding protein, thiB subfamily 
Protein accessionYP_003177753 
Protein GI257387980 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAC GGAACTTCCT CACGCGAACC GGAGCGGGCC TCGCAAGTCT GACTGCGCTG 
TCGGGCTGTA CCGGTGACGG CGGCGACGGG ACCGAGTCGG CGAGTACCGA GAGCGCGACC
GACACCGAGG TCACGGAGAG CGCGACCGAT ACCGAGAGCG TGACCGACGA GGGAACGACG
ACCGGAACCG TCGAAGAACT GAGCGGGACG CTGTCGGTCG CGACCTACTC GTCGTTCGTC
GGCGAAGACA CGGCCGGCAA CTGGCTCAAA TCCGAGTTCG AGTCCGAGCA TCCGGACGTG
ACCGTCGAGT TCGAAACGCC CGAGAACGGG CTCAACCAGT ACATCCAGCG CAAGTCCGAG
GGCGCGCCGA TCGACGCCGA TCTGTTCGTC GGGCTCAACA CGGGGGAACT CGTCCGGGCC
GACGAGCAAC TCGACGAGGC GCTGTTCGCG ACTGCCAGTG ACCGCATCGA AGGGGCGGAC
ACGGTCAAGC CGGAGCTCCA GTTCGATCCC GACGGACGAG TCGTGCCCTA CGACACCGGG
TACATCAGCC TCGTCTACGA CGAGGGCGAG GTCGACGCGC CGGGCACCTT CGACGCCCTG
CTCGAACCCG CCTACGAGGA CGCGCTGATC GCCCAGAACG CCCAGCAGTC CGACCCCGGT
CGCGCGTTCC TGCTGTGGAC GATCTACAAC AAAGACCCGG ACGGCTATCT GGACTACTGG
GAGGGGCTGG TCGACAACGG CGTCACGATC CTCTCGGACT GGGAGCCGGC GTACAACGCC
TACTCGGATG AGGAGGCCCC GATGGTCGTC TCGTACTCGA CCGACCAGGT GTTCTACCAC
GGCGAGGGCG TCGACATGTC GCGCCACCAG ATCGGCTTCC TGAACGATCA GGGCTACGCC
AACCCCGAGG GGATGGCCCA GTTCGCCGAC AGCGACGACG CCGAACTGGC CCGGGCGTTC
GCCTCGTTCG CACTGACAGC TCCGGCCCAG CGCGAGATCG CCACGCGAAA CGTCCAGTTC
CCGGCCGTCG AGGGCGTCGA CCCCGGCGGC GACTTCGGCG AGTACGCGCT GGAGCCCCCC
GAGCCGGTCA CCTTCACCTA CGACGAACTG TCGGGCAACG TGAGCGGCTG GATCGACGAG
TGGGCCCGAC AGATCGCGAG CAACTAG
 
Protein sequence
MRRRNFLTRT GAGLASLTAL SGCTGDGGDG TESASTESAT DTEVTESATD TESVTDEGTT 
TGTVEELSGT LSVATYSSFV GEDTAGNWLK SEFESEHPDV TVEFETPENG LNQYIQRKSE
GAPIDADLFV GLNTGELVRA DEQLDEALFA TASDRIEGAD TVKPELQFDP DGRVVPYDTG
YISLVYDEGE VDAPGTFDAL LEPAYEDALI AQNAQQSDPG RAFLLWTIYN KDPDGYLDYW
EGLVDNGVTI LSDWEPAYNA YSDEEAPMVV SYSTDQVFYH GEGVDMSRHQ IGFLNDQGYA
NPEGMAQFAD SDDAELARAF ASFALTAPAQ REIATRNVQF PAVEGVDPGG DFGEYALEPP
EPVTFTYDEL SGNVSGWIDE WARQIASN