Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0341 |
Symbol | |
ID | 5732251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 407333 |
End bp | 408223 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277465 |
Product | aldo/keto reductase |
Protein accession | YP_001543121 |
Protein GI | 159896874 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000752262 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTTTC CGTTGCTGCG CGAATTTGGC TCAACTGGTT TGCAGGTTTC AGCGCTAGGC TTTGGCGCTG GCCATATTGG TGGAAACGAA TTAACTGAAG CTGAAGCAGC CAAGTTGCTG CATGGCTTGC TCGATTTAGG GATTAACTTA ATTGATACAG CACGCGGCTA TGGCCTTTCC GAAGAGCGGA TCGGGCGACA TCTCCAGCAA CGTCGTCATG AGTTTGTGCT TTCAACCAAA ATTGGCTATG GCATCGAGGG CTACGACGAT TGGACCAGCC CAATCATCAC TGCCGGAATT GATGCAGCGC TGCAACGCAT GCACACCGAT TATTTGGATA TTGTGCATTT TCATTCTTGC CCGGTCGAGA CCTTACGAGC AGGCGAGGTG ATTGAGGCAC TTGAGCGGGC GGTTCAAGCA GGTAAAGTAC GGGTTGCCGC CTATTCTGGC GATAATGCAC CCTTAGCTTG GGCGGTGCAG TCAGGCCATT TTGGTGCGAT TGAATGCTCG GTGAACCTCG CCGATCAACG GGTGATCAGC CAAGTATTGC CCCAAACCCA ACAACGCCAA ATTGGTGTAA TTGCCAAACG CCCAGTGGCC AACGTCGCGT GGCGTTTTGC CCAACGGCCA GTTGGCGATT ATGCTGAAGT GTATTGGGAA CGACTCCAAG CCATGCAGCT TGATTTCGAC CCTGAACGAC TTTTGGATAT TGCCTTGCGG TTTACCGCCT ATACCCCAGG TGTCCATAGC TGTATTGTTG GTTCGCGCAG CCTCGAACAT ATGCAGCATA ATTTAGCATT ATTGCAACAA GGCCCACTTG AACCCCAACT CTATGCCTAT CTCTCCAGCC AATTTCAGAC GAACGATCAA GGTTGGGTTG GTCAGATCTA A
|
Protein sequence | MNFPLLREFG STGLQVSALG FGAGHIGGNE LTEAEAAKLL HGLLDLGINL IDTARGYGLS EERIGRHLQQ RRHEFVLSTK IGYGIEGYDD WTSPIITAGI DAALQRMHTD YLDIVHFHSC PVETLRAGEV IEALERAVQA GKVRVAAYSG DNAPLAWAVQ SGHFGAIECS VNLADQRVIS QVLPQTQQRQ IGVIAKRPVA NVAWRFAQRP VGDYAEVYWE RLQAMQLDFD PERLLDIALR FTAYTPGVHS CIVGSRSLEH MQHNLALLQQ GPLEPQLYAY LSSQFQTNDQ GWVGQI
|
| |