Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2904 |
Symbol | |
ID | 8743521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 2977653 |
End bp | 2978369 |
Gene Length | 717 bp |
Protein Length | 238 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646513489 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 1 |
Protein accession | YP_003404446 |
Protein GI | 284166167 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCCG ATGAGCGCAC GATCGACGAC GAATCCGCCG ACTGGGAGGC CGTCTTCTGG GACATCGGCG GCGTCATCCT CGATCTCGAG TCCGTCCAGG GCGCCCACGC GGCGTTCGTC GAGGGGCTGG TCGAGGAGCA CGGCCTCGAG ATGAGCGTCG AGGAAGCCGT CGACGTCTGG CGGACGGCCG TCGGCGACTA CTTCCGCGAG CGCGATGGCA CCGAGTTTCG CTCGGCCCGC GAGGGGTACG CGGCGGGCGT CGAGGCCCTC GTCGGCGAGA AACTCCCGCG AGAGCGATGG GAACCCGACT TCGAGGAAAT CGTCAACTCG TCGATCGAAC CAGTCCCGGG CGCTCCGGAG ACCATCGCGA AACTCGCCGA CCGCGAGATC CACGTCGGCG TCATCAGCGA CGTCGACGAC GAGGCGGGAA AGGAGATGCT CGAGCAGTTT GGCGTCCGCG AACGGTTCGA TTCGATCACC ACGTCGGAGG AGGTCGGCCG GACCAAGCCC GATCCCGAAA TCTTCGAGAC GGCGCTCGCG AAGGCCGGCG TCGCCCCCGA ACGATCGCTG ATGATCGGCG ACCGGTACGA CCACGACGTG AAGGGCGCTG ACGAGATGGG AATCCGCGGT GTCGCCTTCG GCGCCGAGGA CGGCCCGGCC GTCTCCTACC GGATCGAATC GCCCGCGGAG GTGCTCGAGA TCGTCGATGG GACGTGA
|
Protein sequence | MSADERTIDD ESADWEAVFW DIGGVILDLE SVQGAHAAFV EGLVEEHGLE MSVEEAVDVW RTAVGDYFRE RDGTEFRSAR EGYAAGVEAL VGEKLPRERW EPDFEEIVNS SIEPVPGAPE TIAKLADREI HVGVISDVDD EAGKEMLEQF GVRERFDSIT TSEEVGRTKP DPEIFETALA KAGVAPERSL MIGDRYDHDV KGADEMGIRG VAFGAEDGPA VSYRIESPAE VLEIVDGT
|
| |