Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2201 |
Symbol | |
ID | 7401136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2184699 |
End bp | 2186024 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643709273 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_002566848 |
Protein GI | 222480611 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.419012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCG CCGGAACCGT CATCGCCGAC CCCGAGACCG TGATCCCCGA CGGCGCCGTC GTCGTCGAGG GCGAGACGAT CGCCGCGGTC GGCGACGCCG AGATCCTCCG CGAGGCGTAC CCCGACCACG AGCGACGCGA TATCGACATC GTCGCCCCCG GGCTGGTCGG CGGCCACGTG CACTCGGTGC AGTCGCTCGG CCGCGGGATC GCCGACGACG CCGCCCTCCT CGACTGGCTG TTCGACGCCG TGCTCCCGAT GGAGGCCGCG ATGGACGCCG GCGCGACCCG GGCCGCCGCC GAACTCGGAT ACCTCGAATG CCTCGAATCG GGAACGACGA CGGTCGTCGA CCACCTCTCC GTCAACCACG CAGAGGAGGC GTTCGAGGCC GCGATCGAGA CGGGGATCCG CGCACGACTC GGAAAGGTGC TGATGGACCG CGACTCGCCC GAGGGGCTGC TGGAGGACAC CGACGCCGCG CTCGCCGAGA GCGAGGCGCT GATCGAGGAG TATCACGGCG CCGCCGACGG CCGGGTGCGG TACGCGGTGA CACCCCGGTT TGCCGTCACC TGCTCGGAGG CGTGTTTGCG GGGCTGTCGC GACCTCGTCG ACCGCCACGA CGGGGTGACG ATCCACACCC ACGCCAGCGA GAACGAGGAC GAGATCGAAA CGGTGGAGGC CGACACCGGG AAGCGGAACG TCCTCTGGCT CGACGAGGTC GGGCTGACGG GGCCGGACGT GACGCTCGCG CACTGCGTCC ACACCGACGA GCGCGAGCGC GAGGTGCTCG CCGAGACCGA CACGGTCGTC ACCCACTGCC CCTCGTCGAA CATGAAGCTC GCGTCCGGGA TCGCTCCGGT TCAGGACTAC CTCGACCGCG GAATCACCGT CGCGCTCGGC AACGACGGAC CGCCCTGCAA CAACACGCTC GACCCGTTCA CCGAGATGCG GCAGGCGAGC CTCCTGGGGA AGGTCGACGC CCGCGACCCG ACTCGACTCC CGGCCTCGAC GGTGTTGGAG ATGGCGACGA CGAACGGGGC GCACGCCGCC GGCTTCGACC GTCTCGGCAC CCTTCGGGAG GGTCAGCGCG CCGACGTGAT CGGGATCACC ACCGACCGCA CCCGCGCCAC CCCGCTTCAC GACCCGCTCT CGCATCTGGT GTACGCCGCC CACGGCGACG ATGTGGTGTT CACCATGGTC GACGGCCGGA TCCGGTACGA CGACGGCGAG CACGTCGGGA TCGACGCCGA CGCGGTCCGC GAGCGCGCCA CGCGCCACGC GAAGCGGGTC GTCGAGGAAG CGGGTATCGA CACGGCCGAG TCGTAA
|
Protein sequence | MLIAGTVIAD PETVIPDGAV VVEGETIAAV GDAEILREAY PDHERRDIDI VAPGLVGGHV HSVQSLGRGI ADDAALLDWL FDAVLPMEAA MDAGATRAAA ELGYLECLES GTTTVVDHLS VNHAEEAFEA AIETGIRARL GKVLMDRDSP EGLLEDTDAA LAESEALIEE YHGAADGRVR YAVTPRFAVT CSEACLRGCR DLVDRHDGVT IHTHASENED EIETVEADTG KRNVLWLDEV GLTGPDVTLA HCVHTDERER EVLAETDTVV THCPSSNMKL ASGIAPVQDY LDRGITVALG NDGPPCNNTL DPFTEMRQAS LLGKVDARDP TRLPASTVLE MATTNGAHAA GFDRLGTLRE GQRADVIGIT TDRTRATPLH DPLSHLVYAA HGDDVVFTMV DGRIRYDDGE HVGIDADAVR ERATRHAKRV VEEAGIDTAE S
|
| |