Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0952 |
Symbol | |
ID | 4601292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 905399 |
End bp | 906127 |
Gene Length | 729 bp |
Protein Length | 242 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639773730 |
Product | HAD family hydrolase |
Protein accession | YP_920355 |
Protein GI | 119719860 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTACCTG GCCAGGCTTG GAGGGGCGTA GTCGTCGACC TCTGGGGCAC GTTGCTCTAC CCCTCGGTAA GCCTGGAGGA GTACTCGAGG GAGAGGGCCA GGAGGATAGC CGAGGCCCTC AGGGCCCGCG GCGCCTCTCT CGGGGAGGAC GAGGTTCTCG AAGCCTACAA GCGGGCTAGG AGGCTCGCGG ACAGGGTGAG GAACATAACG ATGATCGAGG TTAGCCTGGA GGGCGAAGTA GTGATGCTCT TAGACGAGCT GGGGCTAGAG CCCTCCGAGG AGCTGGTATC CGAGGTCTCC GAGGCCTTCA TACAGCCGTA CCTCTCGCTC GTGAAGCCAG CCGAGGGGGC GCGGGGCTTC CTGGAGGGCG CCAAGAAGAT GGGCTACAGG CTCGTACTAG CCTCCAACAC TATGAGCACG AGGCACAGCG TGGAGCTCTT GAGGAGGCAC GGGCTCGCCG AGCTATTCGA CTACATGGCC TTCTCGGACA GCGTCGGCTT CAGGAAGCCC CACCCCAGGT TCTTCGCGCA CATAGTCGCC GAGGCTGGCA TAGCGCCGCG GGAAAGCTTC TTCGTGGGGG ACGAGGAGGC GGACATAAGG GGCGCCAAGC TCTGCGGCTT CAAAACGATA GCGTACACGG GCTTCCACCC CTACACCGGC AACACCCAGC CCGACTGCAC CGCGCACAGC TTCGACGCCG TGGCAGAGTG CATGGAGAAA CTGGCCTAA
|
Protein sequence | MLPGQAWRGV VVDLWGTLLY PSVSLEEYSR ERARRIAEAL RARGASLGED EVLEAYKRAR RLADRVRNIT MIEVSLEGEV VMLLDELGLE PSEELVSEVS EAFIQPYLSL VKPAEGARGF LEGAKKMGYR LVLASNTMST RHSVELLRRH GLAELFDYMA FSDSVGFRKP HPRFFAHIVA EAGIAPRESF FVGDEEADIR GAKLCGFKTI AYTGFHPYTG NTQPDCTAHS FDAVAECMEK LA
|
| |