Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1568 |
Symbol | |
ID | 7401501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1586171 |
End bp | 1587235 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643708635 |
Product | amidohydrolase |
Protein accession | YP_002566225 |
Protein GI | 222479988 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCTTG AAGGGACCGT TCTGGTCGGT CCGGAGTTCG AGCCGGTCGA AGGCCGGGTC ATCGTCGCCG ACGACGAGAT CGTCCGGATC GAGGAGACGA GCGTCGACTC CGACGACGTG ATCCTCCCGG CGTTCGTCAA CGCCCACACC CACATCGGTG ACTCCATCGC CAAGGAGGCC GGCGAGGGCC TCTCGCTCGA CGAGCTGGTC GCACCGCCGG ACGGGCTCAA ACACCGCCTC CTGCGGGAGG CGAGCCACGA GGCGAAAGTC GCCGCGATGG CGCGGAGCCT CCGGTACATG GAGTCGACCG GGACCGGCAC GTTCCTGGAG TTCCGCGAGG GCGGCGTCAA GGGCGTGGCC GCGCTCCGGG ACGCGGTCGC GGGCGAGGGC GTCGACTTCG GCGAGCGCGC GATCGATCCG GTCGTGTTCG GTCGTGACGA CCCCGACGTG CTCTCGGTCG CTGACGGGTA CGGCGCCTCC GGTGCCCGCG ACGCCGACTT CGACGCGGTG CGCTCGGAGA CTCGCGAGGC GGGCAAGCTG TTCGGGATCC ACGCCGGCGA GCGCGACGCC GACGACATCA ACGCCGCGAT GGATCTCGAC CCCGACTTTC TCGTCCACAT GGTCCACGCC GAGCCGATCC ACCTCGAGCG GCTCGCCGAC CGCGGGACGC CCGTGGCCGT CTGCCCCCGG TCGAATCTCG TGACGAACGT CGGCGTGCCG CCGATCCGCG ACCTCGCCGA GCGGACGACG GTCGCGCTCG GCACCGACAA CGTCATGCTC GACTCGCCGT CGATGTTCCG CGAGATGGAG TTCGCCGCGA AGCTCTCCGA TCTCCCGGCC CGAGAGATCC TGCGGATGGC GACGGTGAAC GGCGCGGCGA TCGCGGGGCT GAACCGCGGC GTGATCGAGC TGGGCGCGGA TGCCGATCTG TTGGTGCTCG ACGGCGACTC CGACAACCTT GCCGGCGCGC ACGACCTCGT TCGCGCGATC GTCAGGCGCG CCGGCGCGGC CGACGTGTCT CGGGTCGTAA TCGGCGGCGA GCCGGCCGGG AGAGAAACGG TTTAG
|
Protein sequence | MHLEGTVLVG PEFEPVEGRV IVADDEIVRI EETSVDSDDV ILPAFVNAHT HIGDSIAKEA GEGLSLDELV APPDGLKHRL LREASHEAKV AAMARSLRYM ESTGTGTFLE FREGGVKGVA ALRDAVAGEG VDFGERAIDP VVFGRDDPDV LSVADGYGAS GARDADFDAV RSETREAGKL FGIHAGERDA DDINAAMDLD PDFLVHMVHA EPIHLERLAD RGTPVAVCPR SNLVTNVGVP PIRDLAERTT VALGTDNVML DSPSMFREME FAAKLSDLPA REILRMATVN GAAIAGLNRG VIELGADADL LVLDGDSDNL AGAHDLVRAI VRRAGAADVS RVVIGGEPAG RETV
|
| |