Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0425 |
Symbol | |
ID | 7401043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 442997 |
End bp | 444424 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643707490 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_002565098 |
Protein GI | 222478861 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.410662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.214403 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACCA CCCAGATCAA AGATCCCGTC CACGGCTACG TCGAGCTCCC GGACGCGCTC GTCGAGGGCG TCGTCGACAC CCGGCCGTTC CAGCGGCTGC GGTACGTCCG CCAGCTCTCG GCGACGCACC TCGTGTACCC GGGCGCGAAT CACACCCGGT TCGAGCACTC GCTGGGCGTC TACCACCTCG GCCGAACCGT CTTCGAGAAC CTCCGACAGC AGTCGTACTT CGCACGGGAG GCGACCGTCG ACGAACTGGA AGAGATCCAG CGCACCTTAG AGTGCGCCTG CCTGCTCCAC GACGTGGGCC ATCCGCCCTT CTCGCACCTC TCCGAGGGGT TCCTCGACGA GGGGGTACTC CGGGAGCGCG TCGCGGAGAC GGGCTTAGTC GACGCCTTCG ACGCGGCCGG CGTCGGCGGC GCCCCGCTCC GCTCGGCGAA CCCGCACGAG CTACTCGGCT GCGTGATTAT CGTCGAGGAG TACGGCGACG CGCTCCGGGC GTTCGATGTC GACCCCTTCG AGGTGTGCGC GTACGTGCTC GGCTACAGCC TCGCGTACGA GCGCGGCGAA CCGTGGCAGT ACGGGGTCGG CGCCCAGCTG CTCCACTCAC CCATCGACGT GGACCGGCTC GACTACATCA CTCGGGACAA CTACATGACC GGTGCCGGCG TGTTGAGCTT CGACGTCGAC CGTATGGTCG ACGCCTACAC CGCTCACCCC GAGGAGGGCC TGGCGCTCAC CGAGAAGGCG CTCTCGACCA TCGGCAACTA CCTCGAAGGG CGGATCGCGC TGTACATGTG GGTCACCCAG CACCACAAGT CGGTGTACGC GAACCGGCTC CTCCAGGCGA TGCTCGGCGA ATACGCCGCC GAGACCGGCG AGAGCCCGGT TACGGTCAAC GGCGTGCTCT CCCGAGAGCT CGACGACAAT GCGGTGCTTG AGCGCCTCCG GATCGCCGCC CGCGATCGCC CCGATTCGAC GCTGGCGTCG ATGTACGATC GCTTCCGGGG GCGGCGCTTC CCGGCCACCT GCTGGAAACA CCGGATCGCG CTCGCCGACC GGGTGGGCCG AGACCTCGAC GGCGACCTCG GCGGGGACGG CGGCGAAGCC CTCGACGAGT TCACGGCGTG GCTCACCGAG GGCGACGATC GGCTGGAACG ACTCCTCGCC GACGCCCTCG ACGTGCCGGT CCACGAGGTG TGGATCGACC GGTCGTACGT GCCGGCCTAC GACCCCGACG AACTGGAGGA CATCCCCATC GCGTACGGCG GGACGACGCG GTCCGTCGGC GATTGGGGGC TGTACGGCGA CCGCGCGTTC GACGTGCCGA TCCCCTTCGT GTTCGTCCCC GACGGGACGA AGCGGCGGGC GATCCGCGTG CTCACGGAGG CGTTCGAGCG GGAGGTCGGG GAGACGAAGC AAGCTTGA
|
Protein sequence | MPTTQIKDPV HGYVELPDAL VEGVVDTRPF QRLRYVRQLS ATHLVYPGAN HTRFEHSLGV YHLGRTVFEN LRQQSYFARE ATVDELEEIQ RTLECACLLH DVGHPPFSHL SEGFLDEGVL RERVAETGLV DAFDAAGVGG APLRSANPHE LLGCVIIVEE YGDALRAFDV DPFEVCAYVL GYSLAYERGE PWQYGVGAQL LHSPIDVDRL DYITRDNYMT GAGVLSFDVD RMVDAYTAHP EEGLALTEKA LSTIGNYLEG RIALYMWVTQ HHKSVYANRL LQAMLGEYAA ETGESPVTVN GVLSRELDDN AVLERLRIAA RDRPDSTLAS MYDRFRGRRF PATCWKHRIA LADRVGRDLD GDLGGDGGEA LDEFTAWLTE GDDRLERLLA DALDVPVHEV WIDRSYVPAY DPDELEDIPI AYGGTTRSVG DWGLYGDRAF DVPIPFVFVP DGTKRRAIRV LTEAFEREVG ETKQA
|
| |