Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2848 |
Symbol | |
ID | 5736885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3612794 |
End bp | 3613729 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279991 |
Product | aldo/keto reductase |
Protein accession | YP_001545614 |
Protein GI | 159899367 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0082562 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTAC GTCAACTCGG TAAGTCGGAA TTACGGATTC CTAAAGTAGT GATTGGGGGC AATGTATTTG GCTGGACGGT TGATCGCGCC CAAACATTTC GTTTGCTTGA TGCCTTTGTT GCTGCAGGGC TTAACACAAT TGATACTGCC GATGTTTATT CGGCATGGGT CGAAGGTAAC CAAGGTGGCG AATCGGAAAC GCTGATTGGA GAATGGATTA AGCAACGCCA GCGTCGCGAT GATTTGATTA TTCTGAGCAA AGTTGGAGCT GGCACAGGTT TAGCCAAGGC GCATATTGCC CAGGCAATCG ATGCCTCATT ACAACGGCTT CAAACTGACT ATCTTGATTT ATATCAAGCG CATGTTGATG ATGCCAATAC TCCGTTAGAT CAAACGCTTA GCGCATTTGC CGAGCTAATC CAACAAGGCA AAATACGCGT GATTGGGGCA TCGAACTATA GCGCTGAACG ACTACAAGCT GCCTTAGCAA TCAGCCAAGA GCTGAATATA CCACGCTATG AAAGCCTTCA ACCGCTATAC AACCTATACG ATCGTGAAGC ATACGAAGCA CAGTTAGAAA CAGTTTGCCA AGAAGCAGAT TTGGGTGTTA TTCCCTATTC AACCTTAGCA TCAGGCTTTT TAACCGGCAA ATACCGTAGC GAAGCCGATT TGAGCCAAAG CGCCCGTGGT TCGCGAGTGC GGAGCTATCT CAATCCACGT GGCTGGATTA TTCTTAAGGC GCTTGATGAG GTGGCGGCTG AAGTTCAAGC GACACCTGGC CAAGTTGCCT TAGCGTGGCA AATTGCCCGT CCTAGCATCA CCGCGCCAAT TGCTAGTGCC ACCAACCTCG ATCAAGTCAA CGATCTGATT AAGGCGGCCC AACTTGAATT AACCTCAAGC ATGATTGATT ATTTAAACCA AGCTAGCCAA GCCTAA
|
Protein sequence | MQLRQLGKSE LRIPKVVIGG NVFGWTVDRA QTFRLLDAFV AAGLNTIDTA DVYSAWVEGN QGGESETLIG EWIKQRQRRD DLIILSKVGA GTGLAKAHIA QAIDASLQRL QTDYLDLYQA HVDDANTPLD QTLSAFAELI QQGKIRVIGA SNYSAERLQA ALAISQELNI PRYESLQPLY NLYDREAYEA QLETVCQEAD LGVIPYSTLA SGFLTGKYRS EADLSQSARG SRVRSYLNPR GWIILKALDE VAAEVQATPG QVALAWQIAR PSITAPIASA TNLDQVNDLI KAAQLELTSS MIDYLNQASQ A
|
| |