Gene Hmuk_2578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2578 
Symbol 
ID8412124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2482369 
End bp2483406 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content69% 
IMG OID645020919 
Productamidohydrolase 
Protein accessionYP_003178391 
Protein GI257388618 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.786376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.942138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC AGACCATTCT CGCCGGCACG ATCTTCCGCG GCCGCGAGTT CGAACCGATC 
GAGGGCCGCG TCGTGATCGA GGACGGCGAG ATCGCGGCCG TCGAGGAAGC GACGGTCGAT
CCCTCCTCGT GGATCATCCC GGCGTTCGTC AACGCACACA CCCACATCGG CGACTCCATC
GCCAAGGAGG CCGGCGGCGG CCTCACACTC GAAGAACTCG TCGCGCCGCC CGACGGCCTC
AAACACAGGC TACTCAGACA GGCCAGCCGC GAGGAACTCG TCGACGCGAT GGCCCGAACG
ATATCGTTCA TGGAGCGAGC GGGAACGGCC GCGTTCGTCG AGTTCCGCGA AGGCGGGGTC
GACGGCGTCG CGGCGATCGA GGCGGCACTG GCGGACTCGC CGGTCGACAG CGTCGTCCTC
GGGCGGGAGA CGGTCGCGGC GATGGAGCGA AGCGACGGGT TCGGTGCGAG CGGAGCCGCC
GACGGCGACT TCAGCCACGA ACGGACCGCG ACCGCCGAGG CGGGGAAGCT GTTCGGCATC
CACGCCGGGG AGGTCGACGC CAGCGACATC AACCCCGCGC TGGACCTCGA TCCGGACTTT
CTCGTCCACA TGGTCCACGC CGAGGGGTTG CACCTCGACC GCGTGGCCGA CAGCGAAGTC
CCCGTCGTCG TCTGCCCTCG CTCGAACGTC GTGACGAACG TCGGCGTTCC GCCGATCACC
GACCTCGCCG AACGGACGAC GGTCGCGCTG GGGACGGACA ACGTCATGAC CAACAGCCCG
TCGATGTTCC GCGAGATGGC CTGGACGGCG AAACTCGCCG ACGTTCCGGC CGTCGAGGTC
CTCCGGATGG CGACGGTCAA CGGAGCCGAG ATCGCGGGCC TGAACTGCGG GCTCGTCGCA
GAGGGCCGCG ACGCCGATCT GCTGGTGCTG GACGGCGACT CGGACAATCT CTCGGGTGCG
CGGGACCCGG TCCGCGCGAT CGTCCGGCGT GCCGGCGTCG ACGACGTCGA ACGAGTCCAC
TACGCAGGCA AGGCTTAA
 
Protein sequence
MSDQTILAGT IFRGREFEPI EGRVVIEDGE IAAVEEATVD PSSWIIPAFV NAHTHIGDSI 
AKEAGGGLTL EELVAPPDGL KHRLLRQASR EELVDAMART ISFMERAGTA AFVEFREGGV
DGVAAIEAAL ADSPVDSVVL GRETVAAMER SDGFGASGAA DGDFSHERTA TAEAGKLFGI
HAGEVDASDI NPALDLDPDF LVHMVHAEGL HLDRVADSEV PVVVCPRSNV VTNVGVPPIT
DLAERTTVAL GTDNVMTNSP SMFREMAWTA KLADVPAVEV LRMATVNGAE IAGLNCGLVA
EGRDADLLVL DGDSDNLSGA RDPVRAIVRR AGVDDVERVH YAGKA