Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1172 |
Symbol | |
ID | 5733065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1345674 |
End bp | 1346669 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278312 |
Product | aldo/keto reductase |
Protein accession | YP_001543948 |
Protein GI | 159897701 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0170373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAC GAACGCTTGG CACGACTGGC TTGGTTGTCT CGGAAATTGG CTTGGGCTGT ATGGGCATGA GCCAAAGCTA TGGCCCTGGC GGCGATAAAC AATCAGCAAT TAACTTAATC CATACAGCGG TTGAACGCGG CGTAACCTTC TTCGATACCG CCGAAGTCTA TGGCCCCTAT ATCAACGAGG AATTGGTTGG CGAGGCGCTT GAGCCATTTC GCAACCACGT CGTGATTGCT ACCAAGTTTG GTTTTAACTT ACACCCCGAT GGCAAACCAG GCTGGTCGGG AGTCAACAGC CATCCTGATC AGATTAAACG GGTGGCGGAA GCATCGCTCA AACGCCTACG GATCGAAGCG ATCGATTTGT TCTATCAACA TCGCGTCGAT CCGAATGTGC CAATTGAAGA AGTTGCGGGT GCAGTCAAGG ATCTCATTCA GCAAGGCAAA GTTAAGCATT TTGGGCTTTC TGAGGCAGGC GCACAAACGA TTCGCCGTGC CCACGCGGTT CAGCCTGTCG CCGCCTTGCA AAGCGAATAT TCCTTGTGGA CCCGCGAACC AGAAGCTGAA ATTATGCCGA CGCTGGCAGA ATTGGGGATT GGCTTTGTGC CATTCAGCCC ACTAGGCAAG GGCTTTTTGA CTGGTAAAAT CGACCAAAGC ACCGTCTTTG CCCAAGGCGA TATTCGTAAT CGAATTCCAC GATTTTCGCC TGAAGCCCTG CAAGCCAACC AAGCCTTGAT CGATTTGCTC GAAGCGATTG CCGCGCAAAA ACAGGCCACT ACTGCGCAAA TTGCCTTGGC ATGGTTACTA GCGCAAAAAC CATGGATCGT GCCGATTCCA GGCACGCGCC GCGTAGAACG CTTGGAAGAA AATCTGGGCG CGGCAGCAAT TCGCCTAAAC GAGGCTGATT TACAAGCAAT CGAGCAAGCC GCCGCTTCAG TGAACATCCA AGGCGCACGC TACCCCGAAG ATTTAGAAAA AATGACTGGT TTGTAG
|
Protein sequence | MQQRTLGTTG LVVSEIGLGC MGMSQSYGPG GDKQSAINLI HTAVERGVTF FDTAEVYGPY INEELVGEAL EPFRNHVVIA TKFGFNLHPD GKPGWSGVNS HPDQIKRVAE ASLKRLRIEA IDLFYQHRVD PNVPIEEVAG AVKDLIQQGK VKHFGLSEAG AQTIRRAHAV QPVAALQSEY SLWTREPEAE IMPTLAELGI GFVPFSPLGK GFLTGKIDQS TVFAQGDIRN RIPRFSPEAL QANQALIDLL EAIAAQKQAT TAQIALAWLL AQKPWIVPIP GTRRVERLEE NLGAAAIRLN EADLQAIEQA AASVNIQGAR YPEDLEKMTG L
|
| |