Gene Hmuk_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0047 
Symbol 
ID8409544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp42927 
End bp43982 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content67% 
IMG OID645018385 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_003175905 
Protein GI257386132 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAAC AACGCACGCA GGGTGGCCCC GACAGCGCGG ACCCGATCAT CGAGGCCCGC 
AACGTCTGTG TCACCTACGA CCTGGAGCAC CGAGACGCGA TGGTACTGGA CGACGTGTCG
ATCGACCTCC GCCGGGGCGA GATTCTCGGT GTCGTCGGCG AGTCCGGCAG CGGCAAGTCG
ATGCTGGCAA ACGCCATGAT GGACGCCGTC GAGGAACCCG GTATCACGAC CGGTGAAGTG
ACCTACTACC CCGAAGACGG CGGCGAACCG GTCGACGTTC TCGACCTCTC GACCGAGGAC
CTCAAGGAGT TCCGCTGGGA GGAAGTGTCG ATGGTGTTCC AGGGCGCGCT GTCGTCGTTC
AACCCCACCA TGTCGATCCG CGGGCACTTC GAGGAGACGC TGGCGGCCCA CGACTACGAC
GTGGAGGAGG GGATGGAACG GGCCAGACAG CTCCTCGGTG ACCTGTACCT CGATCCCGAC
CGCGTTCTGG ACTCGTACGC CCACGAACTC TCTGGCGGGA TGAGCCAGCG GGCGCTGATC
GCACTCAGTC TGGTCCTCGA ACCGCAGGTC CTCTTGATGG ACGAGCCGAC GGCCGCGCTC
GACCTGCTGA TGCAGCGCTC GATCCTCTCC TTGCTGGCCG ACATCAAGGC CAAGTACGAC
CTGACGATCC TCTTTATCAC CCACGACCTC CCGCTGGTCG CGGGGCTGGC CGACCGACTG
GCGATCCTCT ATGCCTTCGA GCTCGCGGAG GTCGGCACCG CGACGCAGAT CGCCCACGAC
TCCAAACACC CCTACACGCG GGCGCTGTTG CAGGCCGTGC CGAACCTCGA CGCGCCGACC
GACTCGATGC GACCGATCGA GGGGACCGCG CCGAACCCGG CCCACGTCCC CGACGGCTGT
CACTACGCGC CGCGGTGCCC GCTCGCGACC AGGGAGTGTC ACGAGGAGAC GCCGCCGTGG
GTCGACGTCG AGGACGACCA CCGCTCGGCC TGCTTCCACT ACGACGAGGC CGAGGACGCC
GTTCCGTTCG ACCTCTCGGA GGTGTCCGAC GCGTGA
 
Protein sequence
MQEQRTQGGP DSADPIIEAR NVCVTYDLEH RDAMVLDDVS IDLRRGEILG VVGESGSGKS 
MLANAMMDAV EEPGITTGEV TYYPEDGGEP VDVLDLSTED LKEFRWEEVS MVFQGALSSF
NPTMSIRGHF EETLAAHDYD VEEGMERARQ LLGDLYLDPD RVLDSYAHEL SGGMSQRALI
ALSLVLEPQV LLMDEPTAAL DLLMQRSILS LLADIKAKYD LTILFITHDL PLVAGLADRL
AILYAFELAE VGTATQIAHD SKHPYTRALL QAVPNLDAPT DSMRPIEGTA PNPAHVPDGC
HYAPRCPLAT RECHEETPPW VDVEDDHRSA CFHYDEAEDA VPFDLSEVSD A