Gene Hmuk_1186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1186 
Symbol 
ID8410706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1125001 
End bp1126731 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content68% 
IMG OID645019522 
Producthypothetical protein 
Protein accessionYP_003177019 
Protein GI257387246 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.710855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGAA AGACGACAGT CCTGTTCATC GCACTCCTGC TCGTGTTCGG TGGCGGTACG 
GTCGCGTACC TGGTCCAGAC CGACGGCGGA AGCGTCGAGA CCAGAGACGT GCGCTTTGCC
GCTCCGGACG GACAGATGAT CCACGGGACC CTGTACGTCC CGCCCGGTGC TAGTGCCGAC
GATCCCAGCC CGGGGGTCGT CGCGACCCAC GGCTACATCA ACACCCGCGA GACACAGTCC
CCGTTCGCCA TCGAGTTCGC GCGTCGCGGG TTCGTCGTCC TGGCGATCGA TCAGGCCGGT
CACGGCCACT CCGATCCCCC AGCGTTCGGG AACGGTTTCG GCGGCCCACC GGCCCTGTCG
TATATGCGGT CGCTGGCGTT CGTCGACGAC GACAACGTCG GGCTCGAAGG GCACTCGATG
GGCGGTTGGG CCAGCGTCGC AGCGGCCCAG AGCGACCGGG ACGGCTACGA GTCGGTCGCG
CTGGTCGGCT CTTCGACGGG GAACTCGGGG GTCGCCGACG GCAACGCGAC CTTCCCGCGC
AACACGGCCG TCGTCTTCAG CGAGTACGAC GAGTTCTCGA TGCTGATGTG GGGTGTGCCC
CACGCGTCGA ACGTCGAGCG AAGCGAGAAG CTCCAGACGC TGTTCGGCAC TGACGAACCC
GTCGAGGAGG GACGGACCTA CGGGAGTCTC GAAGACGGAA CCGCTCGCCG TCTGTACACT
CCGTCGACGA CCCACCCCGG CGACCACCTC TCGACGACCG CCGTCGGCGA CGCGGTCGAG
TGGATGCAGG TGACGCTGGA CGGCGAGGAC GAACTACCGC CGGGCAACCA GATCTGGTGG
CTCAAAGAGC TGGGAACGCT GGCCTCGCTG CTCGGCGGGG TCCTCTTCCT GTTCCCCTTC
GGAGCGATGG TCCTCTCGGA CCGGCGCTTC TCGGATCTCC GCCGGACGGT ACCCGAGCCG
GCCGGCGTCG AGGGGCGAAG CTGGTACGGC GCGGTCGCGG TGGCCGCACT CGTCCCGGTG
GTGACCTACT TCCCGCTGAA CCTCGTCGGC CAGGGGTTCG TCCCGATCAG CTGGCTGTTC
CCCCAGCAGA TCACCAACGG CGTGATGGTG TGGGCCCTGG GGAACGGCCT GATCGTCGCC
GGGCTGGTCG GCGCGTGGCA CTATCGGTCG GACCGTGACT CGCTCGCGCG CTACGGGTTC
GACACGGACG GCTGGCGAAC GGTCGCCCGT TCGGCCGTCG CGGCGGCGAC GATCGTCGGT
GGCTTCCTCG GCACGCTCGG GGCCGTGGGC TACCTCTTTG GCACCGACTT CCGCTTCTGG
GTGTTCGCGA TGAAGCTCCC GACGCTCACG CAGTTCCGGA TCGCGCTGGT GTACCTGGTG
CCGTTGACGC TGTTTTTCCT CGCACTGGAG CTGCTACTGC ACGGCCAGCT CCGAACCGGC
GACCGCTCCT TCCGGGAGGC GCTGGCCCAC AACTGGATCG CCGTCGTCGG CGGGTTCGTC
CTGCTCCTCG CGGTACAGTA CGGCGTCCTG CTGGCCGGTA GTCCTTCACC CTTCGGCCAG
CCGCTCCTGA CGATCGTCGC CCTCCAGTTC GTCGCCCTGC TCTCGATCGT CACCGCCGTC
TCCACGTACT TCTTCCGGCG GACCGGTCGC GTCTGGGTCG GCGCGTTCGT CAACAGTCTG
CTCGTGGCGC TGGTCCTCGT CGCGGGGACC GCGACGCACG CGCCGCTGTA G
 
Protein sequence
MTRKTTVLFI ALLLVFGGGT VAYLVQTDGG SVETRDVRFA APDGQMIHGT LYVPPGASAD 
DPSPGVVATH GYINTRETQS PFAIEFARRG FVVLAIDQAG HGHSDPPAFG NGFGGPPALS
YMRSLAFVDD DNVGLEGHSM GGWASVAAAQ SDRDGYESVA LVGSSTGNSG VADGNATFPR
NTAVVFSEYD EFSMLMWGVP HASNVERSEK LQTLFGTDEP VEEGRTYGSL EDGTARRLYT
PSTTHPGDHL STTAVGDAVE WMQVTLDGED ELPPGNQIWW LKELGTLASL LGGVLFLFPF
GAMVLSDRRF SDLRRTVPEP AGVEGRSWYG AVAVAALVPV VTYFPLNLVG QGFVPISWLF
PQQITNGVMV WALGNGLIVA GLVGAWHYRS DRDSLARYGF DTDGWRTVAR SAVAAATIVG
GFLGTLGAVG YLFGTDFRFW VFAMKLPTLT QFRIALVYLV PLTLFFLALE LLLHGQLRTG
DRSFREALAH NWIAVVGGFV LLLAVQYGVL LAGSPSPFGQ PLLTIVALQF VALLSIVTAV
STYFFRRTGR VWVGAFVNSL LVALVLVAGT ATHAPL