Gene Hmuk_3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3141 
Symbol 
ID8412694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp3023832 
End bp3025622 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content68% 
IMG OID645021488 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003178953 
Protein GI257389180 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.91649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.671974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACGATC TCGAACGCTA TCTCAACGTC AGATCTGCGA CCGGGGCGAC GCTCGGGCCC 
GACGGACAGC TCGCCTTCCG GATGAACGCG ACCGGCACGT TCCAGCTCTG GAACGTCGAC
GAACCGGGCG GCTGGCCCCA GCAGCGCACC TTCTACGAGG ACTCCGTGTC CTTCGCCTCG
TACTCGCCCG AACGGCCGGA GCTGATCTTC GGCAAGGACG AGGGGGGCGA CGAACGGATG
CAGCTGTTCC GCCTCGACGC CGACGGCACG ATCACGTCGC TCACCGACAG TCCGGACGCC
AAACACCGCT GGGGCGGCTG GAGCCACGAC GGCGAGCGAT TCGCCTTCGC CTCGAATCGC
CGCGACGAGA GCGTCTTCGA CATCTACGTG CAGGGCCGGG ACGACGACGA GGCCACACTG
GTCCACGAGG GCGACGGCTG GCTCACCGTC GGCGGCTGGT CGCCCGACGA CACGAAACTC
CTCGTCGGCG AGGCCCACTC GAACGTCGAC CAGGATCTCT TCGTCCTCGA CCTGGAATCG
GGCGACTTCG AGCACCTCAC GCCCCACCAC GGCGACGTGC GCTACGGGAG TGCCCAGTGG
GCACCCGACG GCGACGGCGT CTATCTCGTC ACCGACAGCG ATGCGGACCT GCGCTGGCTG
GCCCGCCTCG ATCTCGACGA GACGCTCCGC GAGGTCGTCT CGGACGACGA CTGGAACGTC
GACGGCGTGT CCGTCGACGT CGACACCGGA CGGCTCGCCT ACTCCCGGAA CGTCGACGGC
TACACTGACC TGACCGTCGG ACGGTTGACC GACGAGACCA CCATCGAGAC GTTCCCGACG
CCGGAGCTGC CCGGCGGGAT CGCTGGCGGC GTCTCGTGGG GACCCGACGC CGAGCGCTTC
GCCGTGACGG TCACCGGCCG GACGGTCAAC ACGAACGTCT TCGTCGTCGA GACCGAGACG
GGCGAGGCCA CCCGCTGGAC GCACGCCTCG ACGGCCGGCA TTCCCGACTC GACGTTCGTC
CCGCCGGAGC TGGTCCACTT CGAGAGCTTC GACGGGCGCG AGATTCCGGC GTTCTTCTCG
CTGCCGCCGG CCGAGGTCAG AGCCGACGAC GGCGTCCCGG TGATCGTCGA CATCCACGGC
GGTCCCGAGA GCCAGCGCCG GCCCTCCTTC TCCGGCCTCC AGCAGTACTT CCTCTCGCGT
GGCTACGCGC TGTTCGAACC CAACGTCAGG GGATCGAGCG GCTACGGCAC CGACTACATG
CAACTGGACG ACGTGGAAAA CCGGCTGGAC AGCGTCCGCG ACATCCGCGC TGGCGTCGAC
TGGCTCCACG AGCATCCGGC GGTCGATCCC GACCGACTGG TGGCCAAGGG CGGTTCCTAC
GGCGGGTTCA TGGTGCTCGC CGCGATGACG GAGTACCCCG ACCTCTGGGC CGCGGGCGTC
GACGTGGTCG GCATCGCGAA CTTCGTCACC TTCCTCGAAA ACACCGGCGA CTGGCGAAGA
TCGCTCCGCG AAGCCGAGTA CGGCTCACTG GAGGACGACC GCGGGTTCCT CGAATCCGTC
TCGCCGATCC ACAGCGCCGA CCAGATCGCC GCGCCACTGT TCGTCATCCA CGGCGAGAAC
GACCCGCGGG TCCCGGTCGG CGAGGCCGAA CAGATCGCCG ACGCCGTCCG CGAGCAGGAC
GTTCCGGTCG AACTGCTGGT CTTCGACGAC GAGGGCCACG GGATCGCCAA GCGAGAGAAC
CGGATCGAGG CCTACACGCG GGTCGTCGAG TTCCTCGACG AGCACGTCTG A
 
Protein sequence
MYDLERYLNV RSATGATLGP DGQLAFRMNA TGTFQLWNVD EPGGWPQQRT FYEDSVSFAS 
YSPERPELIF GKDEGGDERM QLFRLDADGT ITSLTDSPDA KHRWGGWSHD GERFAFASNR
RDESVFDIYV QGRDDDEATL VHEGDGWLTV GGWSPDDTKL LVGEAHSNVD QDLFVLDLES
GDFEHLTPHH GDVRYGSAQW APDGDGVYLV TDSDADLRWL ARLDLDETLR EVVSDDDWNV
DGVSVDVDTG RLAYSRNVDG YTDLTVGRLT DETTIETFPT PELPGGIAGG VSWGPDAERF
AVTVTGRTVN TNVFVVETET GEATRWTHAS TAGIPDSTFV PPELVHFESF DGREIPAFFS
LPPAEVRADD GVPVIVDIHG GPESQRRPSF SGLQQYFLSR GYALFEPNVR GSSGYGTDYM
QLDDVENRLD SVRDIRAGVD WLHEHPAVDP DRLVAKGGSY GGFMVLAAMT EYPDLWAAGV
DVVGIANFVT FLENTGDWRR SLREAEYGSL EDDRGFLESV SPIHSADQIA APLFVIHGEN
DPRVPVGEAE QIADAVREQD VPVELLVFDD EGHGIAKREN RIEAYTRVVE FLDEHV