Gene Hmuk_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1189 
Symbol 
ID8410709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1129939 
End bp1131804 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content64% 
IMG OID645019525 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003177022 
Protein GI257387249 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.105584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACA AGTCCAACGA CTACAAAGAC GTTCTTAGCC GCCGCCGGTT CGTCGCGCTG 
ACAGGCGCGG CAGGTGCTGC TGCACTTGCC GGCTGTGACG GCTCGGAAGG TACGGATGAC
AGTACGCCGG CCGACGAGGG TGGGGACGAC GACACGTCGA CCGAAGACGA CGACATGTCG
ACCGAAGACG ACTCGATGGA AGTCGCCGAC GTGCGACACC GCAGCGGGAC GAGCCTCTCG
CCCGCGGACA TCCAGTTCAA CCCGTGGGGC CAGAACACCG CACAGATCTC GAACGAGCTC
ATCTTCGATC CGTTCGCGGA GTTCAACTAC GCTACTGGCG AGTACGTTCC CGCGATCATC
GAGGAGTGGG AGTACACGGG CGATACCTTC GAGATGGTCA TTCAGGAGGG TGCGACCTGG
CACGACGGCG AGCCCGTCAC GGCCCAGGAC CTGGCGACCT ATCTCCGACT CGACCGCGAG
TCGGGCAGTT CGATCTGGGA CTGGGGCAGC GACGTCGAGG AGATCGACGA CCGGACCGTC
GCCATCACTA TCGAGGGCGA CATCAACCCG TCGCTGATCG AGTTCTCCGT GATGGAGAAT
CGGCTCACCA CCAAGCACTC CCGCTACGGC GACGTTCTCT CGGAGACGCA GAACGCGGAC
GACAACACGC CGCTGACGGA GTTCGTCGAC GACGAGCCGA TCGGCAACGG GATCTTCCAG
TACGGCGAGG CCGACGAGCA GGTAATTCTC ACCGAGCGCC ACGCCGATCA CCCCAACGCC
GACAACGTCA ACTTCAAAGA GTACGCCTTC CAGTACTTCG ACGGCAACAC GGCGATCCAC
CAGGCGCTGC TCTCGGGGAA CATCGAGAGC ATGTTCGCCA TCTACACGCC GGGCAACGTG
GTCACCGACC TGCCTGACTC GATGAACGAG TACCGCACGC CACGGAACGG CGGCGTCGGG
ATTATCCCCA ATCACAATCA CGACCACCTC GGTCGACGCG AGGTCCGTCA GGCGATCGCC
TACGCCATGA ACCGAACGCA GGTCGCGGCG AACTCCGACC CGCGTACCAA GGTTGCCCCG
CGCATCCCCA CGGCACTGCC CAACGCCCAG CTCGAGAACT GGCTCGGCGA CTCCATGGAG
GACTTCGAGA CGTACGGTCG TGAGTCCAGT GAGGTAGAGA AGGCAGCGAG TGTGCTCAAG
GAGGCCGGCT ACAGCCGCAA CGGCGACGAC GTGTGGGAGG ACGAGGACGG CAACACCCTC
TCCTTCGAAC TCATCGCGCC CGGTGGCTGG TCCGACTGGG TCACTGCGAT GGAGTCCGTG
GCCGATCAGC TCAACGCCGC CGGCATGGAC GTGGAGTTCT CGACGGTGCC GTTCGGTGAC
CTCGGCGGAT CCGACGGCCG CTGGGCACAG GGGAACTTCG ACGCGACCGC CGAGTACTGG
ACTGCGGCGT TCGCGCGTGC CGCTCACCCG TACCACAACC TGCGCCACCA GATGGTCAAC
CCCAAGGCGA CGCTGCGCGA AAACGGCTAC GCCTATCCGG GTGCTGTCGA GGACCGCGGC
GGTTCCGAAG CCGACATCAC CGTTCCGGCA CTCGACGGCT CGGGCGAGCT GACGGTCAAC
CCGGTCGAGG ACGTCGGTAC GCTCGGTTCG ACCAGTGACA GCGACACTGA GGCCGAGCTC
GCGCTCGAAC TCCTCTGGGT CTCCAACCAG GATCTCCCGA TGATCCCGAT CCAGGAGGGG
CTGAACCAGA CGTTCATCTC CTCGAAGCGA TTCGATATTC CGGCCGAAGA CGCCGAAGTC
GCCCAAGTGC AGTACGCGAA CACCTGGCTC CCGCGCCAGG GCGAGATGAC CTACAACGGC
AACTAA
 
Protein sequence
MADKSNDYKD VLSRRRFVAL TGAAGAAALA GCDGSEGTDD STPADEGGDD DTSTEDDDMS 
TEDDSMEVAD VRHRSGTSLS PADIQFNPWG QNTAQISNEL IFDPFAEFNY ATGEYVPAII
EEWEYTGDTF EMVIQEGATW HDGEPVTAQD LATYLRLDRE SGSSIWDWGS DVEEIDDRTV
AITIEGDINP SLIEFSVMEN RLTTKHSRYG DVLSETQNAD DNTPLTEFVD DEPIGNGIFQ
YGEADEQVIL TERHADHPNA DNVNFKEYAF QYFDGNTAIH QALLSGNIES MFAIYTPGNV
VTDLPDSMNE YRTPRNGGVG IIPNHNHDHL GRREVRQAIA YAMNRTQVAA NSDPRTKVAP
RIPTALPNAQ LENWLGDSME DFETYGRESS EVEKAASVLK EAGYSRNGDD VWEDEDGNTL
SFELIAPGGW SDWVTAMESV ADQLNAAGMD VEFSTVPFGD LGGSDGRWAQ GNFDATAEYW
TAAFARAAHP YHNLRHQMVN PKATLRENGY AYPGAVEDRG GSEADITVPA LDGSGELTVN
PVEDVGTLGS TSDSDTEAEL ALELLWVSNQ DLPMIPIQEG LNQTFISSKR FDIPAEDAEV
AQVQYANTWL PRQGEMTYNG N