Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1592 |
Symbol | |
ID | 5733479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1845412 |
End bp | 1846212 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641278731 |
Product | HAD family hydrolase |
Protein accession | YP_001544363 |
Protein GI | 159898116 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01452] phosphoglycolate/pyridoxal phosphate phosphatase family [TIGR01458] HAD-superfamily subfamily IIA hydrolase, TIGR01458 [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAATC TCAAGCAGCT CAAGCTTGTT TTGCTCGATA TGGATGGGGT GCTGCATCGC GGCGGGGAGA TTTTACCAGG TGCGGCAGAG TTGACGACTG TGCTTGATCG CTTAGGCTTA GGCTATGCCT GTTTGACCAA TAATTCATCG CAATTGCCTG CCACCTTCGC CCGTCATCTG CAAGATTTAG GGGTTGCGAT TGCGCCTGAG CATGTGATTA CCTCGTCAAC TGCTACGGCT ACGCTGTTGC GCACGCGCTA CCCGCAAGGC ACGCGCTTGC TGGCAATCGG CATGGATGGG ATTCAGTCGT CGTTATTTGC TGATCGCTAT TTTGTATCAG CCGAAACCGA TGTAGCAGCA GTGGTGGTTG GGGTTGATTT TAACCTGACC TATGCCCGCT TGAAAACTGC AACCTTGGCG TTACGCGCAG GCGCAGCGTT TATTGCTACC AACAGCGACC GTACATTTCC TGCACCTGAA GGCTTGATTC CTGGGGCTGG CTCGATTGTA GCAGCCTTGG CAGCCGCTAG CGATTGCACG CCCGAAGTGA TTGGCAAACC TGAACCAGCC ATGTTCGAAG CGGCCTTGCA GTTGTTTGGA GTAACCGCCG AACAAACCTT GATGGTCGGC GATCGGTTGG ATACCGATAT TGCAGGAGCG CAGCGGGTTG GCATTGCCAC GGCCTTTGTG GGCAGCGGCG TACATAGCAT GCAACAAGCC CAAGCCTGGC AACCAGCAAT CGATTTGGTG GCTGATGATT TGGCAGGCAT TTTGGCCTTG CTCAGGGCTG GGCGGGAGTA G
|
Protein sequence | MLNLKQLKLV LLDMDGVLHR GGEILPGAAE LTTVLDRLGL GYACLTNNSS QLPATFARHL QDLGVAIAPE HVITSSTATA TLLRTRYPQG TRLLAIGMDG IQSSLFADRY FVSAETDVAA VVVGVDFNLT YARLKTATLA LRAGAAFIAT NSDRTFPAPE GLIPGAGSIV AALAAASDCT PEVIGKPEPA MFEAALQLFG VTAEQTLMVG DRLDTDIAGA QRVGIATAFV GSGVHSMQQA QAWQPAIDLV ADDLAGILAL LRAGRE
|
| |