Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0021 |
Symbol | |
ID | 4600487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 16955 |
End bp | 17656 |
Gene Length | 702 bp |
Protein Length | 233 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639772774 |
Product | HAD family hydrolase |
Protein accession | YP_919434 |
Protein GI | 119718939 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.191967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAGGG CGGTTTTCTT CGACATGGGA GGGGTGCTCG TATTCGACAG AGGGTTTGCG CACCATCTCG CCAGGAACGT CAGTCTGGCT CTACGCGAAG CAGGGCTGGA ATACTCCGAG GAGGAGGTGC TCAGAGCCTG GAAGGAGTCA AGTGTGCACG GTGACGAGCT GGAAACCTGG GACCTTGTCA GATCCATGGT GTTGCTTAGA AAGCTGGGCG TAACCCCGAA ACCCCTGCTC GCAGAGAAAG TGTACAAGGC TGTGCTCGAG AGCTACGTTC AAGGCTTCTC GCTGGACGAA GAGGCTGAGC ACGCGCTGAG CCTGTCCCGT AGCCTAGGCT TCACCGTGGG GCTAATAACC AACGTTGGGA GCTACGAGAT CGTGCGGAGG AGGCTTGAGG AAGCCGGGCT CCTGAAGTAC GTAGACGTCA TTGTAGCCTC GCAGGCTGTT GCCTGGAAAA AGCCTTCACC GAGGATATTC GAACTAGCGT GCTACCTGGC CGGCGTCGAG CCGGGCAACG CCGTCCACGT GGGCGACGAC CCACGCATAG ATGTGGAGGG CGCCAAGAAA GCCGGGCTCA GGGCCGTACA AGTGCTGAAA GCCGGGCCTC CGAGAAGCCC CTACGCTGAT GCGTGGGTCA ACTCCGTAGG AGAAGTTCCA GGGATTCTAG AAGACTGGGT CTCCAAGGGG CTTTTTCGCT GA
|
Protein sequence | MTRAVFFDMG GVLVFDRGFA HHLARNVSLA LREAGLEYSE EEVLRAWKES SVHGDELETW DLVRSMVLLR KLGVTPKPLL AEKVYKAVLE SYVQGFSLDE EAEHALSLSR SLGFTVGLIT NVGSYEIVRR RLEEAGLLKY VDVIVASQAV AWKKPSPRIF ELACYLAGVE PGNAVHVGDD PRIDVEGAKK AGLRAVQVLK AGPPRSPYAD AWVNSVGEVP GILEDWVSKG LFR
|
| |