Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5203 |
Symbol | |
ID | 5737161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 291859 |
End bp | 292728 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641282367 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001547958 |
Protein GI | 159901712 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCGT CTCCTGCTTC CCGCATCATG CTCGTGACAG GAGCCAACAG TGGCTTAGGA CGCGCCACGG CGCTCGGCTT GGCATGCGAC GGTGCGACGG TGATTATGAT TTGTCGTGAT GCAGCACTGG GGGCCGCCGC GCAAGCCGCC ATTATCCGCG AGAGTGGCAA TCCGGCGGTT GATTTAATCA TCGCGGATTT ATCCTCACAG CATGCGATTC GACAGCTGGC CCAGACCGTG CTCGCCCGCT ACCCGCAGCT GCATGGACTG ATTAATAATG TTGGAGCATC GTTTCCGACG CGTCGCGTCA CGGTGGATGG CATCGAGTCT TCGCTGGCGA TCAATCACCT AGCGGCCTTT TTACTCACCA ATCTCCTGTG TGATCGGCTC ATTGCGAGCG CCCCTGCTCG TATCATTAAC GTTGGAACAC GGATTACAAC GCGGATGGAT TTCGATGATC TCCAGTTTGA AAAACGACCA TATCGAGCCC TGGCGGCCTA TAGTCAGACG AAATTGGGCA CCATCCATTT TACCTATGAA CTGGCTCGCC GCCTTGCGGG AACCGGAGTC ACGGTCAATT GCGTGCATCC CGGGGTTTTT AAATCGCGTT TAGGCCAAGA CGACGGTCGT CAATCATGGT TTTTTCGGAT GCTTGGCCTC CTTGGCCAGT ATGTCCTCCC TGATGCTGCC CAAGCCGCCA AACAGATAGT GTATCTCGCC ACATCGCCAG CAGTTGCGGA CATCACGGGT TCCTATTTCG CCGCGATGCG CCCCATTTCA TCACCACCAC AGACCTATGA TCGCGCGGCC AATGCCCGGC TCTGGGATCT GAGTGCGACC TTAACCAAGC TCGACACCGA GGGGGAGTGA
|
Protein sequence | MSSSPASRIM LVTGANSGLG RATALGLACD GATVIMICRD AALGAAAQAA IIRESGNPAV DLIIADLSSQ HAIRQLAQTV LARYPQLHGL INNVGASFPT RRVTVDGIES SLAINHLAAF LLTNLLCDRL IASAPARIIN VGTRITTRMD FDDLQFEKRP YRALAAYSQT KLGTIHFTYE LARRLAGTGV TVNCVHPGVF KSRLGQDDGR QSWFFRMLGL LGQYVLPDAA QAAKQIVYLA TSPAVADITG SYFAAMRPIS SPPQTYDRAA NARLWDLSAT LTKLDTEGE
|
| |