Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4881 |
Symbol | |
ID | 5736958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6217437 |
End bp | 6218072 |
Gene Length | 636 bp |
Protein Length | 211 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641282047 |
Product | HAD family hydrolase |
Protein accession | YP_001547639 |
Protein GI | 159901392 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR02247] Epoxide hydrolase N-terminal domain-like phosphatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAAG CCATCCTATT CGATTGGGGC GGCGTTTTCA ACCCGCAACA TGAGTCACTT GACGGCTATC GTAGTATTGC TCAGCGCTAT GGACATTCGG CAGAATCGTT GTATGCCTTG TTGTATAACG GCGATGAGTG GCGACAAGCT CGCATCGGCG AACTAACCAG CCAAGCCTAT TGGTCGAGTA TGCAGCAAAA ATTGGGCGTT ACTGGCGAGC TAGCCAACTT TATGGGCGAG CTTTTCGCTG GCGAACAACT CAATCAACAG ATGATACGGA TTGCTCAGGT GTTGCATCGA CGCTATCGCA CAGGTTTGCT CTCGAATGCC CTCGATGATC TGGAGACGAT TTTGGAGCGC TGGCAGGTTG CCAATTTATT CGATGTGGTG ATCAATTCGG CCCGCGTTGG CGTGGCCAAG CCCAATCCCC ATGCCTTTGA ATTGGCGGTT GCCGCCTTGG GCGTGCAAAT TCGCGACATT ATTTTTATTG ATGATAAATT GCGTAACGTG CTAGCAGCGC GGGCTTTCGG ACTACCGACC GTGCATTTCA CCACCACCAC AGCGCTGATC GACGAACTAG GAACTTTGGG TGTACTCAAA CCCCATGAAC GGGCCATGCT GCGCGAGGAG CTTTGA
|
Protein sequence | MTQAILFDWG GVFNPQHESL DGYRSIAQRY GHSAESLYAL LYNGDEWRQA RIGELTSQAY WSSMQQKLGV TGELANFMGE LFAGEQLNQQ MIRIAQVLHR RYRTGLLSNA LDDLETILER WQVANLFDVV INSARVGVAK PNPHAFELAV AALGVQIRDI IFIDDKLRNV LAARAFGLPT VHFTTTTALI DELGTLGVLK PHERAMLREE L
|
| |