Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0754 |
Symbol | |
ID | 8410268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 721972 |
End bp | 723270 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645019089 |
Product | amidohydrolase |
Protein accession | YP_003176592 |
Protein GI | 257386819 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.558016 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAC TCCTCGTTTC CGGCGGGCGG GTCCTCCGAC CGGACCTGAC CGTCGAACGC GCGGACGTAC TGGTCGATCA GGACAGCGGC GACGTCGCGG CCGTCGGCGA GCCCGGCGCG CTCTCCGGCG ACGACGAGCT GGACGCCAGC GAGGGACTGG TCGTGCCGGG GCTGGTCAAC GCCCACACGC ACGTGGCGAT GACGCTGTTG CGGGGGTACG CCGACGACAA GCCACTGGAC GCGTGGCTGC AAGAGGACAT CTGGCCAGTC GAGGCGGAGT TGACCCCGAA GGACGTGCGG GCCGGTGCCG AACTCGGCCT GGTCGAGATG ATCAAATCCG GGACGACGGC GCTCTCGGAC ATGTACTTCC ACGTCGACGA GATCGCCGGG GCCGTCGAGC AGGCCGGCCT GCGGGCCGTG CTGGGCCACA CCGCGGTCAC CGTCGGCAAG GACGAGGCGG ACGCACGCGA GGACGTCCAG CAGAGCCTCG ACGTGGCCGA GCGCCTCGAC GGCGCGGCCG ACGGGCGAAT TCGGACGACC TTCCAGCCCC ACTCGCTGAC GACCGTCGGG GAGGAGCTCC TCCGGGAGTT CGTCCCCGCG GCGAACGACG CCGGCCGGCC GATCCACCTG CACGCCAACG AGACGAGCGA CGAGGTCGGC CCGATCGTCG ACGAGCACGG CAAGCGGCCA CTGGAGTACG CCGACGACCT GGGGGTGCTG GGTCCGGACA CCTGGATCGC CCACGGCGTC CACGTCGACG AGCGAGAGAT CGAGTTGCTG GCCGACACCG ACACCGGCGT CGCCCACTGC CCGGCCTCGA ACATGAAGCT CGCCAGCGGG ATGGCACCCG TCCAGGAGCT GCTCGATGCC GGGGTCACGG TCGGGCTGGG GACGGACGGC GCGGCCTCGA ACAACGACCT CTCGATGTTC GACGAGATGC GCGACGCCGC GATGATCGGC AAGCTCGCCG CCGAGGACGC GAGCGCGATG GCGGCAGCGA GCGTCGTCGA GATCGCCACG GCCGGCGGGG CCGAACTGCT CGGGTTCGAC AGCGGGCGGA TCGAAGCGGG CGCGAACGCC GACCTCGCCG TGGTCGACCT CGACCAGCCA CACCTGACGC CGGCCCACGA CCTCGTGAGC CACCTCGTCT ACGCCGCCAG CGGGAGCGAC GTGCGCCACA CCGTCTGTGA CGGAACGGTC CTGATGCGCG ACCGGGACGT GAAACCGTTC GACGAAGCGA CTGTCGTCGA GCGTGCAGAC GAGCACGCGA CGGCGCTGGT CGGGCGCGCG ACGGAGTAG
|
Protein sequence | MSELLVSGGR VLRPDLTVER ADVLVDQDSG DVAAVGEPGA LSGDDELDAS EGLVVPGLVN AHTHVAMTLL RGYADDKPLD AWLQEDIWPV EAELTPKDVR AGAELGLVEM IKSGTTALSD MYFHVDEIAG AVEQAGLRAV LGHTAVTVGK DEADAREDVQ QSLDVAERLD GAADGRIRTT FQPHSLTTVG EELLREFVPA ANDAGRPIHL HANETSDEVG PIVDEHGKRP LEYADDLGVL GPDTWIAHGV HVDEREIELL ADTDTGVAHC PASNMKLASG MAPVQELLDA GVTVGLGTDG AASNNDLSMF DEMRDAAMIG KLAAEDASAM AAASVVEIAT AGGAELLGFD SGRIEAGANA DLAVVDLDQP HLTPAHDLVS HLVYAASGSD VRHTVCDGTV LMRDRDVKPF DEATVVERAD EHATALVGRA TE
|
| |