Gene Hmuk_0148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0148 
Symbol 
ID8409645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp149473 
End bp151326 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content68% 
IMG OID645018473 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003175993 
Protein GI257386220 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.554216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGG ACCTCCTCGA AGCGCTGGCG AGCCTGCCGA CCTTCCACCA CGCGACCGTC 
TCCCCCGACG GGAACGAAGT CGCCGTCTAC TACGACGAGA CGGGCCGCAA CGAGTTGCAC
GTCGTCGACG TAGCGACCGG CGAGCGAACG CGGTGGAGCG ACGGCGAAGT GCCCCGGAAC
GCCCGCTGGC ACGTCGAGTG GGGAGCCGAC GGCGACCGCG TCTTCTTCCA CCTCGACGAC
GACGGCAACG AACAAAACGA CGTGTACGCG ATCGCTCGCG ACGGCTCGGT CGAGCCAGTT
GTCCAGTTGG ACGGGCAGAC GGTCCTGCAA GACGTGGGCG AAGACGGCGA AACCCTGCTG
GTCGGATCGA CGGCGAGCGG TCAGATGCAG CTGTACCGCC ACGACTGCGC GACGGGCGAG
ACGACCCAGC TGACCGAGTA CGATCGGGCC GTCGGCGCGG GCGTGCTGTC GCCCGATTGC
GAGCGCATCG CCTACGCGAC CAACGAGACG GACACCTTCG AGAACACGGA CACCTACGTG
GCCGACGCCG ACGGATCGAA CCAGCGGAAC CTGGAGATCG GCGAGACGGG CGCTGAAGTC
GGGCCGGTCG ACTGGGGTCC AGACGGCGAC CGGCTGCTGG TGACGGACAA CACGGCGGAT
CTGGGCCGGT GTGGCGTCTA CGATCTGTCG ACCGACGAGG TGACCTGGTA CGGGGGCGAG
TTCGACGAGG ACGCCGAGTC CTTCCTGCCC GGTGGCGACC GGTTCCTCGC GGTTCGGACA
CGAGAGTGTG CGAAAGAAGT CGTCGTCTAC GACGCCCGGA CCGGCGCGGC GACGACGCTC
GATCTACCAT CGGGCGTCGC GTCACTCGGG CGTGCGGGCA GCGCCGTCGT CGACGAGGAA
CGCATCGTCT GCTCGCACAC GACGCCGGAT CGGCGACCCG AACTGCTGTG TTACGACCTC
GCGACCGACG AGACCGAGCA GCTGATCGCG GCCGACTACG GCGACCTCGA TCCCGATCGG
TTCGTGGACG CCGAGTACTT CACCTTCGAC TCCGCGGGCG TCGCCGAGTT CGACGACCGC
GCCGGCCAGG GAGTCGAGAG CGACCCAGCG TCCCGCGAGA TCGAGGCGCT GCTGTACGAC
AGCGGCGAGC GTCCGTCGCC AGCGATCGTG AACCCCCACG GCGGTCCGCG GGGACAGGAC
ACTCGCGGGT TCGACCTCTA CACGCAGTTC CTCTGTTCGC AGGGGTACAG CGTCCTGAAG
GTGAACTACC GGGGCTCGAC CGGCCGCGGC CGCGAGTTCG CACAGGCGAT CTACGACGAC
TGGGGCGGCA ACGAGCAGGC CGACATCGCG ACGGGGACGA CGATCCTCGC CGAGAAGGAG
TGGGTCGACG ACGACCGTAT CGCCGTCTTC GGTGGCTCTT ACGGCGGCTA CTCGGCGTAC
TGTCAGATGA CGATGTACCC CGAGCTGTAC GACGCGGGGG TCGCCTGGAT CGGCGTCACC
GACCTGCGGG ACCTCTACGA GAACACGATG CCCCACTACC GGACCGAACT GCTGGAGAAG
AACGTGGGCT CGCCCGAGGA GAATCCGGCG CTGTACGACG AACGCAGCCC CATCACTCAC
GCCGAGAACC TCGCCGCTCC GCTGTTGGTG CTCCACGGCG TCAACGACCG GCGCGTCCCG
GTCTCGCAGG CCCGACTGTT CCGTGATCGC CTCGACGAAC TCGGCTACGA GGCGGGCGAA
GACGGCGACT ACGAGTACGT CGAACTCGGC GAGGAGGGCC ACGCATCGAC CGACGTGGAC
CAGAAGATCC GCACGTTCCG CACGCTCGCG GACTTCCTCG AACGCCGGCT CTGA
 
Protein sequence
MDEDLLEALA SLPTFHHATV SPDGNEVAVY YDETGRNELH VVDVATGERT RWSDGEVPRN 
ARWHVEWGAD GDRVFFHLDD DGNEQNDVYA IARDGSVEPV VQLDGQTVLQ DVGEDGETLL
VGSTASGQMQ LYRHDCATGE TTQLTEYDRA VGAGVLSPDC ERIAYATNET DTFENTDTYV
ADADGSNQRN LEIGETGAEV GPVDWGPDGD RLLVTDNTAD LGRCGVYDLS TDEVTWYGGE
FDEDAESFLP GGDRFLAVRT RECAKEVVVY DARTGAATTL DLPSGVASLG RAGSAVVDEE
RIVCSHTTPD RRPELLCYDL ATDETEQLIA ADYGDLDPDR FVDAEYFTFD SAGVAEFDDR
AGQGVESDPA SREIEALLYD SGERPSPAIV NPHGGPRGQD TRGFDLYTQF LCSQGYSVLK
VNYRGSTGRG REFAQAIYDD WGGNEQADIA TGTTILAEKE WVDDDRIAVF GGSYGGYSAY
CQMTMYPELY DAGVAWIGVT DLRDLYENTM PHYRTELLEK NVGSPEENPA LYDERSPITH
AENLAAPLLV LHGVNDRRVP VSQARLFRDR LDELGYEAGE DGDYEYVELG EEGHASTDVD
QKIRTFRTLA DFLERRL