Gene Hmuk_0541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0541 
Symbol 
ID8410043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp513139 
End bp515037 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content63% 
IMG OID645018867 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003176382 
Protein GI257386609 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.843572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTTG CGGGCTGTTC GGATCAGCTC GGCGGTGGTG GTGACGGTGG CGACGGCGGC 
GACGGTGGTG TCGACCCCGT TCAGGATCGT GTCACGGTCG ATCCCGCCGA CATCGAAGAG
GGCGGGACGT TCAGAGCCGC GATCGGTGAG GGACCGGACT CGTTCGACTT CGCGTACAGT
AGCTCCGCTT CGGCTTCGAT CCTCCACAAC CTCATCTTCG AGGGGATGGT GACGACCGAC
GCGAGCGGCG AGATCTATCC GTGGCTCGCC GAGTCGTACG AACAGGTCGA CGTACAGGAC
GTATCGCCGG CCGACTACGC GGACTACATG ACCTCCGTCC CCTACACCGA GACCGAAGAC
GGCGCGATGG TCATCGACAC GGACGCACAG ATCGTCCTGG AACACCCGGA CAACGATCCC
GCGTCCGGCG ACGACGCCCG CGTTCTGACC GTCGAAGAGG CCGGCGACGC CGTCGCGGAT
GGCACCTACG GGATGCACTT CCGGTTCGAC CTCCACGAGG GTGTCACGTT CCACAACGGC
AACGAGATGA CCGCCGACAA CGTCGTCGAG TCCTACGAGC GCATCCGGAA CTCGACGCTG
TCGGGCCAGT ACTACGACTC GATGCTGGAC ATCCAGGCCG ACGGCGACTA CACCGTCCAC
CTCTACATTC AGGAACCCGA CGCGGCCGCG GTGCTGGAAC TCGGCGACGC GCCGATCTAC
CCCTCCGAGT CGGCGACGCT CCCGCCCGAG GCGATGGACC CCCGACAGGG GAACACGCCG
ATGGGGACCG GAATGTTCGA ACTGGACGAG TTCCAGGAAG GCGAGTACGT CGTGTTCACC
GCCTTCGACG ACTACTGGTT CGACACCGAG ATGAAAGACT GGTTCGAGGG CTCCTCGGAG
TTCCCGAACG GCCCGGTCGT CGACGAGGTC GACGTATCGT TCGTCTCGGA GGACGCTTCA
CGGTCCGCGG CCCTCCAGGA AGGCGAGATC GACATGAGCT ACGGGCTGAC TGCGAGCACG
CTCAACGACT ACCAGAACTC CGAAGACTTC CGGACGGCCC CGACCGACGG TGCCGGCTAC
ACGTTCCTCC AGCACCCCGT CACGGTCGAA CCCTTCACCG ACAAGCGGGT TCGACAGGCG
ATCAACCACC TCATCCCGCG TGAGAATATC GCCCAGAACA TCTTCTCCGG GTGGGAAAAT
CCGGCTTGGA CGCCGCTGCC GCCGGTCGCC GCCGGGGCCG GGACCGACGA CTACGAGCAG
CTCGTCGAGG ACGGCCGCGA GTACAACGAA TACGACCAAG AGCGAGCGGC GGAGCTCGTC
GAAGAGGCAA TCGAGGACAA TGGCTGGGAG ACCCCGATCG AGGTCCAGCT GGAGACGAAC
TCCGACAACG ACGACCGCGT CCGTACCGTC GAGCTGATCC AGGAAGCGCT CAATCGGTCG
GAGTACTTCG AGGCCTCTCT GGAGACCTAC GAGTTCCTCG ACTTCATCGG CCAGCTCCTC
AGCGAGGAGT ACTACGACGA CGGCAAGTTC GCTTTCATCG GGCTCTCGGG CGGCTTCAAC
CCACACGGCT ACGCGAAGTC CGTCCACTCA CAGGACAACT TCGCTCAGTG TTGTAACTTC
CAGAACATCA ACGACGACGA ACTGAGCCAG CTGTTGCGCG ACGCACGATA CGGCGTCGAC
GTGGCCCAGG ATCCCGAACT CAGACAGGAG CGGTACAACG CGGTCTGGGA ACGCGTCCTC
GAACTCAGCG CCAACTCCTA CGGTACGCAC AGCACGCTCG TCGGTGTCGT CGACGACACC
GTCGTCAACG GGTTCAACAC GTATCCGAGC ACGCAGGACA TCATCGGATA CGGCCTGTTC
GCTCCACAGG ACGAACAGAT TACGTACCTC AGCAGATAA
 
Protein sequence
MSLAGCSDQL GGGGDGGDGG DGGVDPVQDR VTVDPADIEE GGTFRAAIGE GPDSFDFAYS 
SSASASILHN LIFEGMVTTD ASGEIYPWLA ESYEQVDVQD VSPADYADYM TSVPYTETED
GAMVIDTDAQ IVLEHPDNDP ASGDDARVLT VEEAGDAVAD GTYGMHFRFD LHEGVTFHNG
NEMTADNVVE SYERIRNSTL SGQYYDSMLD IQADGDYTVH LYIQEPDAAA VLELGDAPIY
PSESATLPPE AMDPRQGNTP MGTGMFELDE FQEGEYVVFT AFDDYWFDTE MKDWFEGSSE
FPNGPVVDEV DVSFVSEDAS RSAALQEGEI DMSYGLTAST LNDYQNSEDF RTAPTDGAGY
TFLQHPVTVE PFTDKRVRQA INHLIPRENI AQNIFSGWEN PAWTPLPPVA AGAGTDDYEQ
LVEDGREYNE YDQERAAELV EEAIEDNGWE TPIEVQLETN SDNDDRVRTV ELIQEALNRS
EYFEASLETY EFLDFIGQLL SEEYYDDGKF AFIGLSGGFN PHGYAKSVHS QDNFAQCCNF
QNINDDELSQ LLRDARYGVD VAQDPELRQE RYNAVWERVL ELSANSYGTH STLVGVVDDT
VVNGFNTYPS TQDIIGYGLF APQDEQITYL SR