Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2432 |
Symbol | |
ID | 5734313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3116400 |
End bp | 3117392 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279573 |
Product | aldo/keto reductase |
Protein accession | YP_001545200 |
Protein GI | 159898953 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.060631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTATC AAGCCAAGGA TAGCCGCTAC ACCAGCATGC GCTACAATCG CAGTGGCAAA AGTGGCCTGA AATTATCAGC AGTTTCATTA GGGTTATGGC ACAATTTTGG TGGCGTTGAT GTCTATGAGA ATGGGCGGTC GATGGTGCTG CATGCCTTTG ATCATGGCAT TACTCACTTC GACCTCGCCA ATAACTATGG CCCGCCGCCA GGCTCTGCCG AAGAAAATTT TGGCCGGATG CTCCAGCATG ATCTTAAACC CTACCGTGAT GAATTGGTGA TTTCGACCAA GGCTGGCTAC TATATGTGGG CTGGCCCGTA TGGCGATTGG GGTTCGCGTA AGTATCTGGT TTCCAGCCTT GATCAAAGCT TAAAGCGCAT GGGCTTGGAG TACGTTGATA TTTTCTATCA TCATCGCCCC GACCCCGAAA CGCCGCTCGA AGAAACCATG CAAGCCCTCG ATCAGATTGT ACGTAGTGGT AAGGCGCTCT ATGTGGGGCT TTCCAATTAT TCTGCTGAGC AAACTGCCGC CGCGAGCGCG ATTCTGCGCC AACTTGGCAC ACCATGTTTG ATCCACCAAC CATCCTACAA TATGTTCAAT CGCTGGGTTG AGGGTGGTTT GTTGGCAACC TTGGCCGAGC AAGGCATTGG CTGTATTGCT TTTTCGCCGC TGGCCCAAGG CCTGTTAACT GATCGCTATT TGCAAGGCAT TCCAGGCGAT TCGCGTGCCG CCAAATCGCA TGGCTTTCTC AAGCCTGCCC ATATTACCGA TAGTGCTTTA GCTAAAGTTG CTCAGTTGAA TGATTTGGCG CAAGCTCGCG ACCAAACTTT AGCCCAAATG GCCTTGGCTT GGGTGTTGCG CCACCCAACT ATGACCTCAG TGTTAATCGG AGCTAGCCGT ATTTCGCAGA TCGACGATGC CATTACAGCC CTCAACAATT TGACATTTAG TGATGAAGAA CTTGCTACGA TTGAAACCGT TCTAAGCGAG TGA
|
Protein sequence | MIYQAKDSRY TSMRYNRSGK SGLKLSAVSL GLWHNFGGVD VYENGRSMVL HAFDHGITHF DLANNYGPPP GSAEENFGRM LQHDLKPYRD ELVISTKAGY YMWAGPYGDW GSRKYLVSSL DQSLKRMGLE YVDIFYHHRP DPETPLEETM QALDQIVRSG KALYVGLSNY SAEQTAAASA ILRQLGTPCL IHQPSYNMFN RWVEGGLLAT LAEQGIGCIA FSPLAQGLLT DRYLQGIPGD SRAAKSHGFL KPAHITDSAL AKVAQLNDLA QARDQTLAQM ALAWVLRHPT MTSVLIGASR ISQIDDAITA LNNLTFSDEE LATIETVLSE
|
| |