Gene Hmuk_0217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0217 
Symbol 
ID8409715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp215308 
End bp217074 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content65% 
IMG OID645018542 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003176061 
Protein GI257386288 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.745298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCTG ACGATCCACT CGGACGACGC GACTTCCTCA GGGCGGCCGG TGCTGGTGCC 
GTCACGGTAA CGCTCGCCGG CTGTGCCGGC GACGACGGTG ACGACGAGAC GCCGACAGAG
GGCGGCGACG AACAGATGGA GACCGACGAG GCCGCGACCA CCGCCGAAGG TGACGACGGC
GGCGACGACA CTGCCGGCGA CCAGACGCTC GTCTACGCCC GTGGGGACCA CCCGGAGAAC
TACGACCCCC AGCAGACCAC GAGCGGCGAG GTCGCGAAGG TGACGAACCA GATCTTCGAC
ACGCTGATCC AGTTCGCCGC CGGCAGCGGG GGCGAACTGG AAGCCGGACT CGCGACCGAC
TACTCGCTGG AGGGGACGAC GGCGACGCTC ACGCTTCGGG AGGACGTGAC CTTCCACAGC
GGTGAGCCGT TCACCGCCGA GGACTTCGAG GCGACCTTCC GGCGCTTTAC CGATCCCGAA
TACGACTACT ACCTCGGCGA CGCCAACCGA TCCGGATACG GCCCCTTCAC GCTCGGCAAC
TGGATCGAAT CGGTCGACGC CAGCCAGGAC GGCGAACTGA CGATCGAACT GAGCCAGCGC
TACGCGCCCT TCCTGCGCAA CCTCGCGATG TTCGCGGCGG CGGTGCTCTC GAAGGCCCAG
ATCGAGAGCT TCGACGCGAG CCCGGACGCG CAGGTCGGGC TCGGCACCGA ACCGATCGGG
ACCGGCCCCT TCGCGTTCGA CCAGCTGGAC AACCCCAACG ACCGGATCCG CCTGACGGCC
AACGAGTCGT TCTGGGGTCC CGGTCCGAAC GTCGGCTCTG TCGTCTTCAA GACCATCTCC
GAGAACAGCA CGCGCGTTCA GGACGTGATC AACGGCGCGT CACACGTCAC CGACAACCTC
GACTCCGACG GCTTCCAGCG GGCCGACAGC AGCGACACGG CGACGCTGCT GCGCAAGAAC
GGAATCAACG TCGGCTACAT GGCGATGAAC ATGGAACGGA TGGAGCCGTT CCGGGATCGC
CGAGTCCGGC GTGCGGTCTC GCTCGCGGTC AACACCGAGG CCATCGTCAA CCAGATCTAC
CAGGGCTTTG CCACCCAGGC CTCCCAGCCG CTGCCGCCGG ACGTGCTGGG ACACAACGAC
GGTCTCGATC CCTACCCGAC GGACAAAGAC GAGGCCCGGT CGCTGCTGGA GGAAGCGGGC
TACGGCGACG GCTTCGAGTT CGAGCTAGCG ACGTTCTCGA ACCCGCGCGG TTACAACCCC
AGTCCGGTCC AGACGGCCAA CCAGGTCCGT TCCGATCTGC AGGACATCGG TCTCTCTGTC
GAGATCAACC AGTTCTCGGA CTTCGGCCCC TATCTCGATT ACACCGACCA GGGCCGCCAC
GACGCCTGCT TCCTCGGGTG GTACACAGAC AACGCCGATC CCGACAACTT CCTCTACGTC
CTGCTCGACC CACAGGTCCC GCTCGACGAC GTTCCGGACG GACAGGACTG GATCAGCTTC
GACACTGACG GGTACAACAC GCTGAACGTC TCGGCGTGGG CCAACACCGA GTACATGGAA
CTGGTCCGGG AGGCTCAGTC GACCTACGAC ACGAACGAGC GCGATACGAT GTACCAGGAG
GCCAACAAGC TCGCCCACGA CGAGGCTCCG TGGGTGTTCG TCGACTACGC CGAGACGCTT
CGAGCGATCA ACGAGGCCGT CGTCGAGGAC ACCTACACGG TGAGCTCCGT CGGCGGACCG
TACCTCAACA CCGTCGAACT GCAGTAA
 
Protein sequence
MQSDDPLGRR DFLRAAGAGA VTVTLAGCAG DDGDDETPTE GGDEQMETDE AATTAEGDDG 
GDDTAGDQTL VYARGDHPEN YDPQQTTSGE VAKVTNQIFD TLIQFAAGSG GELEAGLATD
YSLEGTTATL TLREDVTFHS GEPFTAEDFE ATFRRFTDPE YDYYLGDANR SGYGPFTLGN
WIESVDASQD GELTIELSQR YAPFLRNLAM FAAAVLSKAQ IESFDASPDA QVGLGTEPIG
TGPFAFDQLD NPNDRIRLTA NESFWGPGPN VGSVVFKTIS ENSTRVQDVI NGASHVTDNL
DSDGFQRADS SDTATLLRKN GINVGYMAMN MERMEPFRDR RVRRAVSLAV NTEAIVNQIY
QGFATQASQP LPPDVLGHND GLDPYPTDKD EARSLLEEAG YGDGFEFELA TFSNPRGYNP
SPVQTANQVR SDLQDIGLSV EINQFSDFGP YLDYTDQGRH DACFLGWYTD NADPDNFLYV
LLDPQVPLDD VPDGQDWISF DTDGYNTLNV SAWANTEYME LVREAQSTYD TNERDTMYQE
ANKLAHDEAP WVFVDYAETL RAINEAVVED TYTVSSVGGP YLNTVELQ