Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1797 |
Symbol | |
ID | 5733699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2085790 |
End bp | 2086776 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278940 |
Product | aldo/keto reductase |
Protein accession | YP_001544568 |
Protein GI | 159898321 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTATC GTGATTTAGG TCGAACTGGC TGGCAGATTT CGACGATTGG CTTTGGGGCA TGGGCGATTG GCGGCGATGC TTGGGGCAAA ACCGACGATG CTACCTCGCT GGCCACAATT CACACAGCGA TTGATGCTGG GGTGAATTTT ATTGATACTG CTGATGTCTA TGGTGATGGC CATTCCGAGC GCCTGATTGC CCAAGTATTG CGTGAACGTC CAGGCGAGAT TGTCGTTGCA ACCAAGGCTG GTCGCCGTTT GAATCCGCAT ACAGCGGCTG GCTATAATCG CGAAAACCTA ACTGCCTTTG TTGAACGTAG TTTGCATAAT TTGAATACCG AAGCCCTCGA TTTGCTGCAA CTGCATTGCC CACCAACCGA CGTTTATTCA ATGCCCGAAG TCTTCGAGGT GCTTGATAAT TTGGTGCGAG CTGGCAAGGT GCGTTACTAC GGTGTAAGTG TCGAGCGCGT CGATGAGGCT TTGGCCGCGA TTGAATATCC CAATGTGCAA AGCGTGCAAA TTATCTTCAA TCTTTTTCGC TACAAACCAA TTGAGCAATT TTTTGCGGCG GCCAAAGCCA AGCGGGTTGG TATTCTGGCG CGGGTTCCCT TAGCCAGCGG TTTGCTATCA GGCAAAATCA GCGCTGCAAC CACGTTTGAA GCTAGCGATC ATCGCAATTA CAACCGCCAT GGCGAATCGT TTGATCAAGG TGAAACCTTC TCAGGGGTTG ATTATGCTAC CGGATTGCAA GCCGTTGAGG AATTACACGC TTTAGTGCCA ATCGATGTCA GCATGGCCCA ATTTGCCTTG CGCTGGATTT TGATGTTCGA TGCGGTGACG ACGGCAATTC CTGGGGCCAA AACGCCTGAA CAAATGCGAG CCAACGCTGC TGCTGCCGAG CTAGCCCCGC TCGATGCGGC AACCATGGCC CAAGCCCAAG CGATTTATGA TCGTTTGATT CGCCCGTTAG TGCACGAGCG CTGGTAA
|
Protein sequence | MDYRDLGRTG WQISTIGFGA WAIGGDAWGK TDDATSLATI HTAIDAGVNF IDTADVYGDG HSERLIAQVL RERPGEIVVA TKAGRRLNPH TAAGYNRENL TAFVERSLHN LNTEALDLLQ LHCPPTDVYS MPEVFEVLDN LVRAGKVRYY GVSVERVDEA LAAIEYPNVQ SVQIIFNLFR YKPIEQFFAA AKAKRVGILA RVPLASGLLS GKISAATTFE ASDHRNYNRH GESFDQGETF SGVDYATGLQ AVEELHALVP IDVSMAQFAL RWILMFDAVT TAIPGAKTPE QMRANAAAAE LAPLDAATMA QAQAIYDRLI RPLVHERW
|
| |