Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3982 |
Symbol | |
ID | 5735843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5081069 |
End bp | 5081908 |
Gene Length | 840 bp |
Protein Length | 279 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281132 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001546742 |
Protein GI | 159900495 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTGC TCGATAAAGT AATTGTCATT ACTGGTGGCA GCCGTGGCTT GGGCTTGGCG ATGGCCGAAG CAATGCTCAG CCAAGGAGCC AAAGTTGTGA TCGCTGGCCG CGATCAAGCT AGCCTTGAAC AAGCCCTTGC TCAACTCAAA CGCCAATCGA GCCATGTGCT GGCAACAACT TGCGATGTTG GCGATTTGGC AGCAATCGAA GCCTTGCGCG ATCAAACTAT CGCCCAGTTT GGCAAACTCG ATGTCTGGGT CAACAATGCT GGCGTGGCTG GGCCTTATGG CGCGACTGTG GCAATTCATC CCCGCGATTA TCGGCGTGTG ATCGACACCA ATATTTTTGG CACGTACCAT GGCTCAATTA CTGCACTCAA ATATTTCCAA CAACAAGGCC ATGGCAAATT GATCAATCTG TTTGGGCGTG GTGATACTGG GCCTGTGCCG TTTCAAACCG CCTATGGCGC TAGCAAAAGC TGGGTGCGCA ACTTTACCTT GGCCCTCGCC AAAGAACATC GTAACCAAGG CATCGAAATT TTGGGCTTTA ACCCAGGCTT GATGACCACC GATATGCTGA CCGATGTGCA AGTGATGGCG GGCTACGAGT CCAAACTTGA AGCCCTCAGT ACCATTTTGC GCATGTGGGG CAATCCGCCT AGTGTGCCGG CGCAAAAAGC GGTTTGGCTT GCATCGAAGG CCACCGATGG CCGCAATGGC TTGAGCATCA GCCTACTTTC GCCACCACGC TTGTTGCTGG GCGCACTCAA AGAAGCAATT CGGCGTGTAC GCGGCAAGCC TGCTCCAGCA TTACCAATCA AACTGACCAC CATCGATTAA
|
Protein sequence | MKLLDKVIVI TGGSRGLGLA MAEAMLSQGA KVVIAGRDQA SLEQALAQLK RQSSHVLATT CDVGDLAAIE ALRDQTIAQF GKLDVWVNNA GVAGPYGATV AIHPRDYRRV IDTNIFGTYH GSITALKYFQ QQGHGKLINL FGRGDTGPVP FQTAYGASKS WVRNFTLALA KEHRNQGIEI LGFNPGLMTT DMLTDVQVMA GYESKLEALS TILRMWGNPP SVPAQKAVWL ASKATDGRNG LSISLLSPPR LLLGALKEAI RRVRGKPAPA LPIKLTTID
|
| |