Gene Hmuk_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0037 
Symbol 
ID8409534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp32804 
End bp34399 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content72% 
IMG OID645018375 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_003175895 
Protein GI257386122 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACG ACGCCGTCGA CGCCGTCGAC GCGGTCGAAC TGGAGATCCT CCGGAACCAA 
CTGGAGAGCG TCGCAGAGGA GATGGGCGAG GTGCTGATTC GGGGTGCCTA CTCGCCCAAC
ATCACCGAGC GTCGGGACTG CTCGACCGCG CTGTTCGACG CCGACGGGCG GCTGGTCGCA
CAGGCCGAAC ACATCCCGGT CCACCTCGGG GCGATGCCCC GTGCCGTCGA CGCGATCCGG
GACCGGGATC CACGGCCCGG CGACGTGTTC GCGCTCAACG ACCCCTTCGC CGGTGGCACC
CACCTCCCGG ACGTGACCTT CGTCTCGCCG ATCGGGGCGG GAGCTGGTGG TCGGTCCCGG
TCCGGCGACC GACCTACCGG AGACGACGAT ATCGTGGCGT ACGCCGTCTC GCGGGCACAT
CACGCCGACG TGGGCGGGAT GGCTCCCGGC AGCATGCCCG CCGGGGCACG CGAGATCCAG
CAGGAGGGGC TGCGCGTGCC GCCGGTCCGG ATCGTCGCCG ACGGAACGGT GGTCGACGAC
GTGTTAGAGC TGGTGCTGGC GAACGTCCGC AACCCGGCCC AGCGCCGGGC CGACGTACAG
GCCCAACTGG CCGCCAACGA GCGCGGTGCC CGCCGGATCG GAGAGCTGCT CGACGACCAC
GGTGAGGGGC TGCTCGCGGC CTTCGACGCC GTCGTCGACT ACTCGCGAGC GCGGATGGAG
CGAGAGCTGT CGGCCCTCGA ACCGGGCACC TACGAGGCCG CCGGGACGAT CGAGGGCGAC
GGCGTGACCG ACACGGCGGT ACCGATCGAG GCGACCGTGA CCGTCGCGGA CGGCGCGGTG
ACGGTCGACT TCGAGGGGAC CGCCCCGCAG GTCGCGGGCA ACGTGAACGC GCCGCTCGCC
GTCGCCGAGA GCGCCGTCTA CTACGTCATA CGATGTATCA CGGACCCGGA GATCCCACCG
AACCAGGGCT GTTACGATCC CGTCACGGTC GAAGCTCCCG AGGGATCGCT GCTGAATCCG
ACGCCGCCCG CAGCGGTCGT CGGCGGCAAC GTCGAGACGA GCCAGCGGAT CACCGAGGTC
GTCTTCGACG CGCTCGCCGA GGCCGCACCC GACCGCGTCC CGGCCGAGAG CCAGGGGACG
ATGAACAATC TCGTCGTCGG GGGACCGGAC TTCACCTACT ACGAGACCAT CGGCGGCGGC
GCGGGCGCGA CGCCGAACCG GGACGGTGCC TCCGGCGTCC AGGTGGGGAT GACGAACACG
CGAAACACGC CGGTCGAGGC GCTGGAGGCG GCGTACCCGC TGCGGGTCGC CGAGTACTCG
CTGCGCTCCG ACACGGGAGG CTCCGGCCGC CATCGCGGCG GTGACGGACT CGTCCGTGAG
ATCGTCGTCG AGACCGACGC CACCGTCTCG CTCCTGACCG ATCGCCGCCA GACGCCGCCA
GCGGGACGAG CCGGCGGCAC AGACGGGACC GTCGGCGAGA ACTTCGTCGA CGGCGAGGCC
GTTGCCTCGA AGCACACCCG CGCGGTCGAG GCCGGCACCA CCGTCAGAGT CGAGACGCCG
GGCGGTGGCG GCTACGGCGA TCCCGACGGC GCGTGA
 
Protein sequence
MTDDAVDAVD AVELEILRNQ LESVAEEMGE VLIRGAYSPN ITERRDCSTA LFDADGRLVA 
QAEHIPVHLG AMPRAVDAIR DRDPRPGDVF ALNDPFAGGT HLPDVTFVSP IGAGAGGRSR
SGDRPTGDDD IVAYAVSRAH HADVGGMAPG SMPAGAREIQ QEGLRVPPVR IVADGTVVDD
VLELVLANVR NPAQRRADVQ AQLAANERGA RRIGELLDDH GEGLLAAFDA VVDYSRARME
RELSALEPGT YEAAGTIEGD GVTDTAVPIE ATVTVADGAV TVDFEGTAPQ VAGNVNAPLA
VAESAVYYVI RCITDPEIPP NQGCYDPVTV EAPEGSLLNP TPPAAVVGGN VETSQRITEV
VFDALAEAAP DRVPAESQGT MNNLVVGGPD FTYYETIGGG AGATPNRDGA SGVQVGMTNT
RNTPVEALEA AYPLRVAEYS LRSDTGGSGR HRGGDGLVRE IVVETDATVS LLTDRRQTPP
AGRAGGTDGT VGENFVDGEA VASKHTRAVE AGTTVRVETP GGGGYGDPDG A