Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3226 |
Symbol | |
ID | 5735094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4083664 |
End bp | 4084875 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280372 |
Product | 2-alkenal reductase |
Protein accession | YP_001545991 |
Protein GI | 159899744 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAAC GCAGCATGCG TATTTTGATT CCAGCATTTC TTATTCCACT AGGCCTTTGT ATTGTGGCAA GCCTGTTGAT TTGGATCTTC GCCCCAGCTA ACGATGCTGC TGGAGAAAGC TCAAATAACG ATTCAACTCT TTCAGTTGCG CCAACCACAG CTCCAGTTAG CGCTTTGAGC GAAGCACCGG GCAATGGCTT GAGCGACGAA CAAGCGCGGA TCGATTTGTA TAAGCGAGTT GGCCCAGCGG TTGTCAGCAT CGATACCGAA GTTACCGGTG AAGGTAGCGA GGCGGCAACT GGCGAGGCGC TTGGCTCAGG CTTTTTGGTC GATGATCAAG GCCATATTGC TACCAACAAC CATGTAATCG AAGGTGCAAC CCGTATTTTC GTCACTTTTG CTGATGGCCG CCAAGTGCCA GCCACTCTGC GCGGAACCGA TGAAGATAAC GATATTGCGG TGATTAAGGT TGATGCCGCA GCCGTGAGCA AGATTGCGCC GATGGTATTT GGCAACTCAC GCGAAGTTCA AGTTGGTCAA GATACGATTG CAATTGGCAA TCCGTTTGGC TTGCAAAATA CCATGACTTT GGGGATTGTG AGCGCGGTCG AAGGGCGTTC GTTGCCAGGC CGTACCCTTG CTAATGGTGG TCAATTTCGC ATCAGCCGAA TTATTCAGAC CGATGCGGCG ATCAACCCAG GCAATAGCGG TGGGCCATTG CTCAATAGTA AAGGCGAAGT GATTGGGATT AACACCGCGA TTCGGGTGAG TGACCCAACT GCTGCACCAG CCTTTGCCGG AGTTGGTTAT GCCGTGCCCG CCAACACGGT CAAAGTGATC GTCGAAGATT TGATTAAAAC TGGCAAGCAC GATTCAGCCT ATCTTGGCGT GAGCATGCTA ACGATCAGCG CTCAATTGGC GCAAGAATTG AAATTACCAG TCAGCCAAGG CGCGTTGGTG ACCAATGTAG TCGTTGATGG TCCCGCCGAT CAAGCTGGAA TTCGCTTGGG TACAACCAGT ATCGAAGTTG ATGGTGCTGC ATTAATTATT GATAGCGACA TCGTGACTGC CTTCAACGGC GAAACGATTC GCTCCAGTGA CGATTTGATT GCCCATATCA ACGATTCGCG AGTTGGCGAT AAAGTCATTT TGACGATCAT ACGGGGTGAT GAAGAGCAAA CGATCGAGGT AACCTTGGGG GCACGCCCTT AA
|
Protein sequence | MKQRSMRILI PAFLIPLGLC IVASLLIWIF APANDAAGES SNNDSTLSVA PTTAPVSALS EAPGNGLSDE QARIDLYKRV GPAVVSIDTE VTGEGSEAAT GEALGSGFLV DDQGHIATNN HVIEGATRIF VTFADGRQVP ATLRGTDEDN DIAVIKVDAA AVSKIAPMVF GNSREVQVGQ DTIAIGNPFG LQNTMTLGIV SAVEGRSLPG RTLANGGQFR ISRIIQTDAA INPGNSGGPL LNSKGEVIGI NTAIRVSDPT AAPAFAGVGY AVPANTVKVI VEDLIKTGKH DSAYLGVSML TISAQLAQEL KLPVSQGALV TNVVVDGPAD QAGIRLGTTS IEVDGAALII DSDIVTAFNG ETIRSSDDLI AHINDSRVGD KVILTIIRGD EEQTIEVTLG ARP
|
| |