Gene Hmuk_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0046 
Symbol 
ID8409543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp41791 
End bp42930 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content67% 
IMG OID645018384 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_003175904 
Protein GI257386131 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.47667 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACC GCGCCACGAC GACGGGCACT ACCCGGGACG ACGATGTCGT GATGTCGCTG 
GAGAACGTCT CGGTCGACTT CGAGAAAGAA CAGGGCGTCC TCGAGTCGCT GTTCGACGAG
CCAGAGACCG TCCAGGCGGT CAGTGACGTG TCGATCGACA TCTCCGAGAA CGACGTGCTC
GCACTCGTCG GGGAGTCTGG CTGCGGGAAG ACGACGCTTG GCAAGACGAT CATCGGCGTC
CAGCGCCCCA CCGAGGGGAC CGTCTCCTAC CGGGGACAGG ACGTGTGGGA CGCCAAGGAC
GGCCGCGGCG ACGTGACCGT CCCCTTCGAC GACATCCGTC GGTCGCTTCA GATGATCCAC
CAGGACCCCG GCGCGGCGCT CAACCCCAAC CGGAAGGTCC TGACGACGCT GGAAGCGCCT
CTGAAGAAGT GGGACCCCGA GATGTCCACC GAGGACCGCC GGGCGCGGAT CTTCGCGCTG
CTGGACCGGG TGGGCATGGA GCCGCCCGAA GACTACGCGC ATCGGTTCCC CCACCAGCTC
TCTGGGGGCG AACAGCAGCG GATCGCGCTG GTCCGGGCCC TGCTGATGAA TCCGGACGTG
ATCCTCGCCG ACGAGGCCGT CTCGGCGCTG GACGTGTCGC TGCGCGTCGA GACGATGAAC
CTCCTCCTGG AGCTGCAAGA GCAGTTCAAC ACCTCGTTCG TGTTCATCAG TCACAACCTC
TCGAACGCCC GCTATCTGGC ACAGGAAGCG GGCGGACGCA TCGGCATCAT GTACCTCGGG
GAGATCGTCG AGATCGGCCC GCCCGACGAG GTCCTGAACG ACCCCCAGCA CCCCTACACG
AAGGTGCTGC GCTGGGCGAC CGCCGATCTG GATCCGACCG CCCAGGAGAT GACCGATCCG
CCGGTCCGCT CGATCGACAT CCCGGACCCG GTGAATCCGC CGTCGGGCTG TCGGTTCCAC
ACCCGCTGTC CGGAGGCTCG GGAGGTCTGT ACCACCACGG CTCCGGAACT TGGCGAGGAG
GCGGCGACGG CGAGCGAACG CTGTGCCGCC TGCCACCGCA CCGATCCCGA CCACGAGTAC
TGGGAGAGCG AACCCCTCGA CGGCGTCGAA GCCGCCGAGT CGCCGACACT GAACGACTGA
 
Protein sequence
MSDRATTTGT TRDDDVVMSL ENVSVDFEKE QGVLESLFDE PETVQAVSDV SIDISENDVL 
ALVGESGCGK TTLGKTIIGV QRPTEGTVSY RGQDVWDAKD GRGDVTVPFD DIRRSLQMIH
QDPGAALNPN RKVLTTLEAP LKKWDPEMST EDRRARIFAL LDRVGMEPPE DYAHRFPHQL
SGGEQQRIAL VRALLMNPDV ILADEAVSAL DVSLRVETMN LLLELQEQFN TSFVFISHNL
SNARYLAQEA GGRIGIMYLG EIVEIGPPDE VLNDPQHPYT KVLRWATADL DPTAQEMTDP
PVRSIDIPDP VNPPSGCRFH TRCPEAREVC TTTAPELGEE AATASERCAA CHRTDPDHEY
WESEPLDGVE AAESPTLND