Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0261 |
Symbol | |
ID | 4711115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 300204 |
End bp | 300941 |
Gene Length | 738 bp |
Protein Length | 245 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639854721 |
Product | HAD family hydrolase |
Protein accession | YP_001001857 |
Protein GI | 121997070 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCGTC CCCCCCTGCG GGCCCTGACC TTCGACCTCG ACGACACCCT CTGGTCCGTC GACGATGTCC TCGCGGCGGC CGAGCAGACC CTCCACGACT ATCTGCGCGA CGCCTACCCG GTGGTGGCCG AGCGCTTCGA TCCCGACACC ATGCGGCAGC TGCGCCGCGA ACTGGCCGCC GCGCGCCCGG AACTCACACG CAACGCCACC ACCCTGCGCC GTGCCACCAT GGCTGAGATC GCCCGCAGCT GCGGGATCGA CGGAGCCGCG GCGGAGCGGT TCGTGGAGCA GACCTTCGCC GTGTTCCTCG AGGCGCGGCA GAGCCGGGTG GCCCCGTTCC CCGAAGTGCT GCCGGCCCTG CGCGATCTGG CCGGGCGGTT CCGTATCGGC GTGATCACCA ACGGCAACGC CGATGTCTAC CGCACCGAAC TCGGCCCCTA CGTCCACTTC GTGGTGCGCG GCGTGGACAT CGACATTCCC AAGCCCGAGC CGGAGATCTT CGCCCTGGCC TGCGAGCACG CCGGGGCGCC CCCCGGCGAG GTTCTGCACG TGGGGGATGA CCCGGACAGC GACGCCGCCG GGGCGCTGGC CGCCGGCATG CAGGCGGCGT TGATCTGCCG TTACGGCCCG CCGGAGCCGT CCCACGGCGT GCCCGACTGC CTGATGGTCC CGGACATGGC GGCGCTGCGC CGGGCGATCC TGCGCAGCGT GGATGCAAGC GTGGATGCGG ATGGCTAG
|
Protein sequence | MIRPPLRALT FDLDDTLWSV DDVLAAAEQT LHDYLRDAYP VVAERFDPDT MRQLRRELAA ARPELTRNAT TLRRATMAEI ARSCGIDGAA AERFVEQTFA VFLEARQSRV APFPEVLPAL RDLAGRFRIG VITNGNADVY RTELGPYVHF VVRGVDIDIP KPEPEIFALA CEHAGAPPGE VLHVGDDPDS DAAGALAAGM QAALICRYGP PEPSHGVPDC LMVPDMAALR RAILRSVDAS VDADG
|
| |