Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0335 |
Symbol | |
ID | 7399725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 358436 |
End bp | 359299 |
Gene Length | 864 bp |
Protein Length | 287 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643707397 |
Product | HAD superfamily (subfamily IA) hydrolase, TIGR01548 |
Protein accession | YP_002565009 |
Protein GI | 222478772 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01548] haloacid dehalogenase superfamily, subfamily IA hydrolase, TIGR01548 [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0232711 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTCG ACGCAGTCGT CCTCGACATC GACGGGGTGT TGGTCGACGT AGCGGACTCC TACCGGCGGG CGATCATCGA GTCCGTCGAA CGGGTGTGTG ACAAGACGAT CGATCGCGAC GCGATTCAGT TGTTCAAGGA CGCCGGTGGG TTCAACAACG ACTGGGAGCT GACTGACGCG GCCGCCCTCT ACGTGCTGGC CCGCCGGGAG GGGCTCGGGA TGGATGTCGA GACCTTCACC GGTCGGATCG CCGAGGGCGG GGGCGGGCTT GACGCTGCGA AGGAGGTCGT CGGCGACCTC CCCCGAGTCG CGCAGGCGCG GGTCCGCGAC CAGTGGGATA CCGAGCGACT CCGCGAGACG TTTCAGGCGC TGTACCTCGG CGAGGAGCTG TACCGCGAGC TGGAGGGCGG CGCCCCCCCG CTGTCGGCGC CGGGGTACAT CCACGACGAG CCGACCTTGG TGGAGCCGGA GACGATCGCT GACCTGACCG ACCGGTTCGA CGTTGGCGTG TTGACGGGGC GGCCCGCCGC CGAGGCGGAG ATCGCCTTGG AGCGCGTCGG CCTCGACGTG CCCGAAGATC GGCGGTTCAC GATGGACGAC TGGGAGGAGG GGAAGCCGCA TCCGCGGGCG CTGGTGACTC TCGCGGAGCG GTTCGACGCC GACCGGATCG CCTTCATCGG GGACACCCTC GACGACGTGC GAACCGCGCG CAACGCCGAC GATGAAGACG CGAGCCGCGT CTACTACGGG ATCGGCGTCC TCACCGGCGG CCTGACCGGT GACGAGGGGC GCCGGAAGTT CGCCGAAAAC GGAGCCGACG CGGTCGTCGA GGACGTGAAC GAACTCGTCG AGCTGTTGGA GTAA
|
Protein sequence | MQVDAVVLDI DGVLVDVADS YRRAIIESVE RVCDKTIDRD AIQLFKDAGG FNNDWELTDA AALYVLARRE GLGMDVETFT GRIAEGGGGL DAAKEVVGDL PRVAQARVRD QWDTERLRET FQALYLGEEL YRELEGGAPP LSAPGYIHDE PTLVEPETIA DLTDRFDVGV LTGRPAAEAE IALERVGLDV PEDRRFTMDD WEEGKPHPRA LVTLAERFDA DRIAFIGDTL DDVRTARNAD DEDASRVYYG IGVLTGGLTG DEGRRKFAEN GADAVVEDVN ELVELLE
|
| |