Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0971 |
Symbol | |
ID | 5732857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1113703 |
End bp | 1114557 |
Gene Length | 855 bp |
Protein Length | 284 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278103 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001543747 |
Protein GI | 159897500 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.374881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCGTT CAGATCTGCT CAAAGACAAA GTGATCATCA TCACTGGTGG GGGTTCGGGC TTGGGGCGTT CAATGGGTGA GCGCTTCCTG GAGTTGGGGG CAAAGTTGGC AATTACCAGC CGCAATGCCG AAAAATTAAG CACTGTGGCC AACGAAATGA TGGCAGCCAA AGGCGGCGAG GTCTTTACCG TGCCCTGCGA TGTGCGCGAC CCCGAAGCAG TTGACCAGAT GATCGAGGCC GTTTGGAACC ATTTCGGGAC AGTCGATATT TTGGTCAACA ATGCTGCTGG CAACTTTATC AGCCCAACCG AGAAGCTTTC ACATCGCGCA GTTGATGCTG TTTTGGGGAT TGTGCTGCAT GGCACGTTCT ACTGCACCTT GGCTTTAGGC AAAAAGTGGA TCGAAGCTGG ACGTGGTGGT CAATGTTTGA ACATCGTTAC AACCTATGCT TGGTCGGGTA GTGGCTTTGT TGTGCCATCG GCGGCAGCCA AAGCTGGGGT TTTGGCCCTG ACCCGCTCGT TGGCGGTCGA ATGGGCACGC TATGGTATTC GCATGAATGC AATCGCACCT GGCCCCTTCC CCACTCAAGG CGCGTGGGAA CGACTTGCCC CAACCCCCGA ATTGGCCGAA CAAGCACTCA ATCGCGTGCC TTTACGCCGT GTTGGCGAAC ATATTGAGCT GGCCAATTTG GCCGCCTATA TGTTGGCCGA CGAAGCAGGC TATATCAACG GCGAATGCAT CACGATTGAT GGCGGCGAGT GGCTGTATGG CGCTGGCCAA TTCTCAGGGC TTGATCGTTT GCCCAACGAA ATGTGGGATA TGCTCTCCAA AATGACCAAG AAAAGCGGCA GCTAA
|
Protein sequence | MFRSDLLKDK VIIITGGGSG LGRSMGERFL ELGAKLAITS RNAEKLSTVA NEMMAAKGGE VFTVPCDVRD PEAVDQMIEA VWNHFGTVDI LVNNAAGNFI SPTEKLSHRA VDAVLGIVLH GTFYCTLALG KKWIEAGRGG QCLNIVTTYA WSGSGFVVPS AAAKAGVLAL TRSLAVEWAR YGIRMNAIAP GPFPTQGAWE RLAPTPELAE QALNRVPLRR VGEHIELANL AAYMLADEAG YINGECITID GGEWLYGAGQ FSGLDRLPNE MWDMLSKMTK KSGS
|
| |