Gene Hmuk_0212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0212 
Symbol 
ID8409710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp211268 
End bp212539 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content70% 
IMG OID645018537 
Productdihydroorotase 
Protein accessionYP_003176056 
Protein GI257386283 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0857683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATTC GGAACGCGAC GCTCGCGGAC GGACGGACTC GGGACGTGCG CGTCCGCGGA 
GAGACGATCG ACGCCGTGGA CGAGGATCTC GACCCGGCGG ACGAGGACAC CGTCGACGCG
GCAGACAGAC TGCTCTTGCC CGGAGCGATC GACGCCCACG TCCACTTCCG CCAGCCCGGC
TACGGCCACA AGGAGAGCTG GGCCAGCGGT TCGCGGTCGG CCGCGGCCGG CGGCGTCACG
ACCGTCGTCG ACCAGCCCAA CACCGACCCG CCGACGGTCG ACGGGGCCGC CTTCGATCAG
AAGGCCGAGC TGGCCGGCGA ATCACTCGTC GACTTCGGCA TCAACGGCGG CGTCACGGGC
GAGTGGGAGC CCGCGGAACT ACTTGACCGG CCCCTGTTCG CACTCGGCGA GGTCTTCCTC
GCGGACTCGA CCGGCGACAT GGGGATCGAC GCCGACCTGT TCGAGGACGC ACTGGTCGCG
GCGGCCCAGC GGGACGTGAC CGTCACCGTC CACGCCGAAG ACGCCTCGCT GTTCAATCGG
GCGGCGAGAG ATCGCGACGA CGCCGACGCC TGGAGCGCGT TCCGCACCGC CCGCGCGGAA
GCCGCCGCCG TCGAGCGAGC CTGCGAGGTC GCGGCCGAAC ACGACGCCCG GATCCACATT
GCACACACCT CCACACCCGA GGGGATCGAC ACCGCCAGCG ACGCCGGGAT GACGACCGAG
GTCACGCCCC ATCACCTCCT GCTCTCGCGG TCGGACCTCG ACGAGTTGGG CACGCACGGC
CGGATGAACC CGCCGCTGCG CAGCGAGAAA CGCCGCCGAG AGGTGTACGA CCGCGTCGTC
GACGGCACCG TCGACATGAT CGCGACCGAC CACGCGCCCC ACACCCGCGA AGAGAAGGAC
GCCTCGATCT GGGACGCCCC CTCCGGGGTG CCCGGCGTCG AGACGATGCT CCCGCTCTTG
CTGGCCGAGG CCCGGACCGG CGATCTGACC TACGAACGGG TCCGAGATCT CGTCGCCGCG
AACCCCGCCG ACGTGTTCGA CCTGCCGGAG AAGGGCCGGA TCGCCGAGGG CAACGACGCC
GACCTCGTGC TGGTCGACAC CGACGACGTG CGCGAGATCA CCGGCGACGG GCTCCACTCG
AACTGCGGGT GGACTCCCTT CGAGGGGTTC GAGGGCGTCT TCCCGAAGTG GACGATGGTC
CGTGGCACGG TCGTCTACGA CCGGTCTGAC GACGAATTCA CCGATCAGCA GGGCGAGAAC
GTTCGAGCCT GA
 
Protein sequence
MLIRNATLAD GRTRDVRVRG ETIDAVDEDL DPADEDTVDA ADRLLLPGAI DAHVHFRQPG 
YGHKESWASG SRSAAAGGVT TVVDQPNTDP PTVDGAAFDQ KAELAGESLV DFGINGGVTG
EWEPAELLDR PLFALGEVFL ADSTGDMGID ADLFEDALVA AAQRDVTVTV HAEDASLFNR
AARDRDDADA WSAFRTARAE AAAVERACEV AAEHDARIHI AHTSTPEGID TASDAGMTTE
VTPHHLLLSR SDLDELGTHG RMNPPLRSEK RRREVYDRVV DGTVDMIATD HAPHTREEKD
ASIWDAPSGV PGVETMLPLL LAEARTGDLT YERVRDLVAA NPADVFDLPE KGRIAEGNDA
DLVLVDTDDV REITGDGLHS NCGWTPFEGF EGVFPKWTMV RGTVVYDRSD DEFTDQQGEN
VRA