Gene Hmuk_0218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0218 
Symbol 
ID8409716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp217175 
End bp219892 
Gene Length2718 bp 
Protein Length905 aa 
Translation table11 
GC content68% 
IMG OID645018543 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_003176062 
Protein GI257386289 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain
[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0707728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACC TCCTCACGCT CTCGAATCTC CAGACCCACT TCGAGACCGA ACGGGGCACG 
GTACACGCCG TCGACGGGAT CGATCTGTCC GTCCGTGCGG GCGAAACGCT CGGACTGGTC
GGTGAGTCCG GGTCGGGCAA GTCCGTGACG GCGCTGTCGG CCATGCGGAT CGTCGACGAG
CCCGGCTTTC TCGCCGGTGG CGACGTGCAC TTCGGTGCGG TCGAGACAGT CGACCGTCTC
GCACGACAGT ATCCCCGCGG CGTCGCGACG GCCGACCACG ACGGCTATCT CCACCTCGAC
GCGGTCGATC TCGATCGGTC GCGACTCCCC GACGGCGTCG ATCGATCGCT CGACGACGAC
GAACTGGCCC GGCGGTTCGT CCGGAGCGAG CCGGCGACAG CACTCGGGCC GGTCGGTTCG
ACCGGAGAGC ACGAGACCGA CCGCGGAACG ACGCCCTCCG GGACTGGATC TGACTCCGCC
CAGCTGCGGA CGGCTCCGGT CACGATCCGG GACGGCTACG TCGATCTCCG GGCCGCCCCC
GAGCGGGTGC TCCGCGACGT GCGGGGCGGG GACATGGGCA TGATCTTTCA GGATCCGATG
ACCTCGCTCA ATCCGGCGCT GACGGTCGGC CAGCAGATCG CGGAGAGCCT GTTGCTACAC
CGGTACGGCC GACAGCGATC GGATTCCTGG GTCAACGCGC TCCGCGAGAT CCTCCCCGTC
GTCGGTGGTA ACGCCGTGAC GGGACGGGTC CGCGAGGACG TGCTCGAACT GCTCGATGCC
GTCGGGATTC CGGAGCCGAC GACTCGAATC GACGAGTACC CCTACGAGTT CTCCGGCGGG
ATGCGCCAGC GGGTGCTGAT CGCGATCGCG CTGGCCTGTC GTCCGAAGCT CCTGATCGCC
GACGAGCCGA CGACGGCACT GGACGTGACG ATCCAGGCCC AGATTCTGGA TCTGATCGAC
GACCTCCAGG CGGAGCTCGG GATGGCGGTG CTCTTTATCA CCCACGACCT CGGCGTCGTC
GCCGAGACCT GCGACCGCGT GGCGGTGATG TACGCCGGAG AGATCGTCGA GGAAGGGCCA
GTCGGGGAGA TCTTCCACAA TCCGTCACAC CCCTACACCT ACACGCTGCT GGAGTCGATC
CCCCGTGCGG ACACAGAGCG ACTCACTCCC ATCGGTGGGT CGGTGCCGAG CCTGATCGAC
ATGCCCGAGG GGTGTCACTT CGCGCCGCGG TGTCCGTGGG CCACCGACGA CTGCCGCCGG
GGCGAGATCC CGTATCTCCA GCACGGGCCC GAGGGCGTCG ACCACCGCTC GAAGTGCATC
TTCGAGTCGT TCGACGCCGA CGCTTACGGC GGCGACGCGG ACGGCGTGGC CGCGAGCGAG
ACGACGCGGA CGGACCGCAC GCTCGTCGAG ATCGACGGCC TGAAAAAGCA CTTCTCGCGG
GCGGAGGATC TGTTCGACAA GTATCTCGGG CGCGTCCCCG ACGCCGTGCG GGCCGTCGAC
GGCGTCTCGC TGGACGTCTA CGAGGGGGAG ACCCTGGGGT TGGTCGGCGA ATCGGGCTGT
GGGAAGTCGA CGACTGGCCG GACGATACTC CGGCTGCTGG AGCCGACCGA CGGCACCGTG
CTGTTCGCCG GGGACGATCT GGCCGGCCTC GACTCGGACG CGCTGCGCGG GAAGCGCCGA
GACATGCAGA TGATCTTTCA GGACCCGCTG TCGAGTCTCG ACCCCCGGAT GACTGTCCGA
CAGACGATCA CGGAGCCGCT CCAGATCCAC GACTTGCCCG ACGCGGACGA CGAGCGGTCG
AAGCGACAAC AGCGCCGCGA GCGCGTCGAG GAACTGGTCG CGGCCGTCGG ACTCGACGTG
GCACAGCTCG ACCGGTATCC CCACGAGCTG TCGGGCGGCC AGCGCCAGCG CGTCGGCATC
GCACGCGCGC TCGCGGTCGA TCCCGACTTC ATCGTCTGTG ACGAGCCCGT CTCGGCGCTG
GACGTGTCCG TCCAGGCCCA GATCATCAAT CTCCTGGAGG ATCTGCAAGG CGAGTTCGGC
CTCACCTATC TCTTTATCGC CCACGATCTC TCCGTGGTTC GTCACATCTG TGACCGGATC
GCCGTGATGT ACCTCGGCGA GATCGTCGAG GTCGCCGACA CGCCGGCCCT GTTCGACGAC
CCCAAACACC CGTACACGAA GTCGCTGCTG TCTGCGATCC CGGTGGCCGA TCCCGACGCC
GACTCGGATC GCGTCATTCT GAAGGGGGAC GTGCCCAGCC CGATCGACCC GCCCAGTGGC
TGCCGGTTCC ACACGCGCTG TCCGTCGGTC ATCCCGCCCG AGGACATCGA GATCGAGCAG
GCGACGTTCC GCGAGGTGAT GGACTACCGC CAGCGCGTCG AAAACGAACG GATCGACCTC
GACGCGGCGT GGGACGCGGC CGCGGGCGAG ACGAGCGCGA CCGTCGCGGA CGGGGGGCGG
CCCCACGAGC GAGCGTCTCG CTCGGCGTTC AAATCCGCGC TGTTCGAGGA TCTGTTCGAG
CACCCGCCGA CGGGCCGGAA CCGCGAGGTC GTCGCCGAGT CGTTCGAGCA CCTCGCGACA
GAGGACTGGG ACGGGGCCGA GACGGTCCTG CGTGACCGCT TCGAGAGCGT CTGCGAGCGG
TCCCATCCCG AACTCGGCGA CCGCGCCCAT CCGGCGGCCT GCCACCTGGT CGAGAACGGC
GACATCGACC GATCGTGA
 
Protein sequence
MRDLLTLSNL QTHFETERGT VHAVDGIDLS VRAGETLGLV GESGSGKSVT ALSAMRIVDE 
PGFLAGGDVH FGAVETVDRL ARQYPRGVAT ADHDGYLHLD AVDLDRSRLP DGVDRSLDDD
ELARRFVRSE PATALGPVGS TGEHETDRGT TPSGTGSDSA QLRTAPVTIR DGYVDLRAAP
ERVLRDVRGG DMGMIFQDPM TSLNPALTVG QQIAESLLLH RYGRQRSDSW VNALREILPV
VGGNAVTGRV REDVLELLDA VGIPEPTTRI DEYPYEFSGG MRQRVLIAIA LACRPKLLIA
DEPTTALDVT IQAQILDLID DLQAELGMAV LFITHDLGVV AETCDRVAVM YAGEIVEEGP
VGEIFHNPSH PYTYTLLESI PRADTERLTP IGGSVPSLID MPEGCHFAPR CPWATDDCRR
GEIPYLQHGP EGVDHRSKCI FESFDADAYG GDADGVAASE TTRTDRTLVE IDGLKKHFSR
AEDLFDKYLG RVPDAVRAVD GVSLDVYEGE TLGLVGESGC GKSTTGRTIL RLLEPTDGTV
LFAGDDLAGL DSDALRGKRR DMQMIFQDPL SSLDPRMTVR QTITEPLQIH DLPDADDERS
KRQQRRERVE ELVAAVGLDV AQLDRYPHEL SGGQRQRVGI ARALAVDPDF IVCDEPVSAL
DVSVQAQIIN LLEDLQGEFG LTYLFIAHDL SVVRHICDRI AVMYLGEIVE VADTPALFDD
PKHPYTKSLL SAIPVADPDA DSDRVILKGD VPSPIDPPSG CRFHTRCPSV IPPEDIEIEQ
ATFREVMDYR QRVENERIDL DAAWDAAAGE TSATVADGGR PHERASRSAF KSALFEDLFE
HPPTGRNREV VAESFEHLAT EDWDGAETVL RDRFESVCER SHPELGDRAH PAACHLVENG
DIDRS