Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2080 |
Symbol | |
ID | 8411616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1990400 |
End bp | 1991710 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645020419 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_003177900 |
Protein GI | 257388127 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACTCA CCGGGACCGT CGTCGCCGAC GCCGACACCG TCTACGAGGA CGGCGCGGTC GTCACGAGCG GTGACGAGAT CGTCGCAGTC GGGGACCGCC AGCGTCTGGT CCGGCAGTAC CCGGACCACG ACAGCCGATC GTTCGACATC GTCGCGCCGG GGCTGGTCGG CTCGCACGTC CACTCGGTAC AGAGCCTCGG ACGGGGGATT GCCGACGACG AAGCGCTGCT TGACTGGCTC TTCGATCACG TCCTCCCGAT GGAGGCTGCC ATGGACGCCG AGCAGATGCG GACCGCGGCG ACGCTTGGCT ACATGGAGTG TCTGGCCAGC GGCGTCACGA CCGTCGTCGA CCACCTCTCG GTCGCGCACG CCGACCAGGC TTTCGAGGCG GCCGGCGAGA TCGGGATCCG CGGCCTGCTC GGGAAGGTGT TGATGGACTA CGACGCCGGG GCGCTACAGG AAGACACCGA CGCGGCACTG GCCGAGTCCG AGCGCCTCAT CGAGCGCTAT CACGGGGCCT TCGACGACCG GATTCGCTAC GCAGTCACGC CGCGCTTTGC CGTCTCCTGT ACCGAGCGCT GTCTCAGGGG GGCTCGCGAT CTCGCCGACG CCTACGACGA CGTGCGCATC CACACGCACG CCAGCGAGAA CCGCGACGAG ATCCAGACGG TCGAGGACCG GACGGGCATG CGCAACGTCG AGTGGCTCGA CGAGGTCGGC CTGACGGGGC CAGACGTGAC GCTCGCCCAC TGCGTCTGGA CGGACGAGAC CGAGCGGGCG ATCCTCGCCG AGACCGACAC CACCGTCGTC CACTGTCCCA GCTCGAACAT GAAACTCGCC AGCGGGATCG CGCCCGTCGA GGCCTACTTG CAGCGGGGGA TCACCGTCGC GCTGGGCAAC GACGGCCCGC CCTGCAACAA CACGCTGGAT CCGTTCACCG AGATGCGCCA GGCGGCCCTG CTGGCGAAGG TCGGCGAACT CGACGCGACG GCGCTGCCGG CGGCGACCGC CTTTCGGATG GCGACCGAGC ACGGGGGGCA GGCGACTGGC TTCGACGTGG GCGTCCTCGC GCCGGGTCGG CCGGCCGACG TGATCGGCCT CGCGACGGAC ACCGCCCGGG CGACGCCGGT TCACGACCCG CTCTCGCACC TCGTCTTCGC CGCCCACGGC GACGACGTTC GGTTCACGAT GGTCGACGGC GAGGTCGTCT ACGACGACGG CTCGTTCGCG AACGTGGACG GCGCAGCCGT CCGCGCGGAC GCTCGACGGC ACGCAGATTC GATCTACGCT GAGATCTCGT CCAAGGATTA A
|
Protein sequence | MRLTGTVVAD ADTVYEDGAV VTSGDEIVAV GDRQRLVRQY PDHDSRSFDI VAPGLVGSHV HSVQSLGRGI ADDEALLDWL FDHVLPMEAA MDAEQMRTAA TLGYMECLAS GVTTVVDHLS VAHADQAFEA AGEIGIRGLL GKVLMDYDAG ALQEDTDAAL AESERLIERY HGAFDDRIRY AVTPRFAVSC TERCLRGARD LADAYDDVRI HTHASENRDE IQTVEDRTGM RNVEWLDEVG LTGPDVTLAH CVWTDETERA ILAETDTTVV HCPSSNMKLA SGIAPVEAYL QRGITVALGN DGPPCNNTLD PFTEMRQAAL LAKVGELDAT ALPAATAFRM ATEHGGQATG FDVGVLAPGR PADVIGLATD TARATPVHDP LSHLVFAAHG DDVRFTMVDG EVVYDDGSFA NVDGAAVRAD ARRHADSIYA EISSKD
|
| |