Gene Hmuk_2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2080 
Symbol 
ID8411616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1990400 
End bp1991710 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID645020419 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_003177900 
Protein GI257388127 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTCA CCGGGACCGT CGTCGCCGAC GCCGACACCG TCTACGAGGA CGGCGCGGTC 
GTCACGAGCG GTGACGAGAT CGTCGCAGTC GGGGACCGCC AGCGTCTGGT CCGGCAGTAC
CCGGACCACG ACAGCCGATC GTTCGACATC GTCGCGCCGG GGCTGGTCGG CTCGCACGTC
CACTCGGTAC AGAGCCTCGG ACGGGGGATT GCCGACGACG AAGCGCTGCT TGACTGGCTC
TTCGATCACG TCCTCCCGAT GGAGGCTGCC ATGGACGCCG AGCAGATGCG GACCGCGGCG
ACGCTTGGCT ACATGGAGTG TCTGGCCAGC GGCGTCACGA CCGTCGTCGA CCACCTCTCG
GTCGCGCACG CCGACCAGGC TTTCGAGGCG GCCGGCGAGA TCGGGATCCG CGGCCTGCTC
GGGAAGGTGT TGATGGACTA CGACGCCGGG GCGCTACAGG AAGACACCGA CGCGGCACTG
GCCGAGTCCG AGCGCCTCAT CGAGCGCTAT CACGGGGCCT TCGACGACCG GATTCGCTAC
GCAGTCACGC CGCGCTTTGC CGTCTCCTGT ACCGAGCGCT GTCTCAGGGG GGCTCGCGAT
CTCGCCGACG CCTACGACGA CGTGCGCATC CACACGCACG CCAGCGAGAA CCGCGACGAG
ATCCAGACGG TCGAGGACCG GACGGGCATG CGCAACGTCG AGTGGCTCGA CGAGGTCGGC
CTGACGGGGC CAGACGTGAC GCTCGCCCAC TGCGTCTGGA CGGACGAGAC CGAGCGGGCG
ATCCTCGCCG AGACCGACAC CACCGTCGTC CACTGTCCCA GCTCGAACAT GAAACTCGCC
AGCGGGATCG CGCCCGTCGA GGCCTACTTG CAGCGGGGGA TCACCGTCGC GCTGGGCAAC
GACGGCCCGC CCTGCAACAA CACGCTGGAT CCGTTCACCG AGATGCGCCA GGCGGCCCTG
CTGGCGAAGG TCGGCGAACT CGACGCGACG GCGCTGCCGG CGGCGACCGC CTTTCGGATG
GCGACCGAGC ACGGGGGGCA GGCGACTGGC TTCGACGTGG GCGTCCTCGC GCCGGGTCGG
CCGGCCGACG TGATCGGCCT CGCGACGGAC ACCGCCCGGG CGACGCCGGT TCACGACCCG
CTCTCGCACC TCGTCTTCGC CGCCCACGGC GACGACGTTC GGTTCACGAT GGTCGACGGC
GAGGTCGTCT ACGACGACGG CTCGTTCGCG AACGTGGACG GCGCAGCCGT CCGCGCGGAC
GCTCGACGGC ACGCAGATTC GATCTACGCT GAGATCTCGT CCAAGGATTA A
 
Protein sequence
MRLTGTVVAD ADTVYEDGAV VTSGDEIVAV GDRQRLVRQY PDHDSRSFDI VAPGLVGSHV 
HSVQSLGRGI ADDEALLDWL FDHVLPMEAA MDAEQMRTAA TLGYMECLAS GVTTVVDHLS
VAHADQAFEA AGEIGIRGLL GKVLMDYDAG ALQEDTDAAL AESERLIERY HGAFDDRIRY
AVTPRFAVSC TERCLRGARD LADAYDDVRI HTHASENRDE IQTVEDRTGM RNVEWLDEVG
LTGPDVTLAH CVWTDETERA ILAETDTTVV HCPSSNMKLA SGIAPVEAYL QRGITVALGN
DGPPCNNTLD PFTEMRQAAL LAKVGELDAT ALPAATAFRM ATEHGGQATG FDVGVLAPGR
PADVIGLATD TARATPVHDP LSHLVFAAHG DDVRFTMVDG EVVYDDGSFA NVDGAAVRAD
ARRHADSIYA EISSKD