Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0763 |
Symbol | |
ID | 6374428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 811774 |
End bp | 812724 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642683270 |
Product | short chain dehydrogenase |
Protein accession | YP_001959196 |
Protein GI | 189499726 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.920637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.460734 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTTG TTAACGCTCT TTGTTTTCAG GGCAGAACAC TGATGACTAT GGCAAAGAAA GAACAATGGG ATACCCGGTT GATGCTCGAT CAGAGCGGGA AGGTGGCGAT CGTTACCGGA GCGACGAGCG GGCTCGGGTA TGAAACTGCC AGAGCTCTTG CGGGAAAGGG AGCCAGGGTG ATCATTGCTG CGCGCGATAC AGCAAAGGGA GAAAGCGCGA AAGAAAAACT TAAAAAAGAG TATCCAGAGG CGGATGTTGC GGTTATGAAG CTCGATCTTG CTGATCTTCA GTCAGTGAGG AAGTTCAGTG ATGATTTCAG CAAACGCTAC TCCCGTCTTG ACCTGCTGAT CAACAACGCG GGGGTTATGG CTCCTCCCCA CGGAAAAACA GCGGATGGTT TCGAGCTGCA GTTCGGCACC AACCATCTCG GTCACTTTGC GTTGACAATT CTTCTGCTCG AAATGCTGAA AAAAGTGCCT GGAAGCAGGG TCGTGACGGT CAGTAGCGGT GCCCATGCGT TCGGGATGCT TGATTTTGAC GATCTTAACT GGGAAAAGCG AAAGTATAAC AAGTGGCAGG CATATGGAGA CAGTAAGCTT GCGAATCTGT ATTTTACGAG AGAGCTGCAG CGTCTTCTTG ACCAGGCCGG GGTAAACGTG TTTTCCGTCG CGGCCCATCC CGGCTGGGCG GCAACGGAAC TCCAGCGATA TCAGGGATGG CTTGTCTTGC TGAACAGTTT TTTCGCGCAG CCTCCTGGTA TGGGGGCGCT GCCGACGCTC TACGCGGCGA CAGCGCCCGA TGTGCACGGA GGGGATTTTT TCGGTCCTGA CGGTTTCGGG GAGATGCGCG GCTATCCGGT AAAAGTACAG TCAAGCAGGC GCTCACGCGA TATGGATGCT GCCCGCAAGT TATGGGAGGT TTCTGAAAAA ATGACCGGGA TCAGGTGGTA G
|
Protein sequence | MSFVNALCFQ GRTLMTMAKK EQWDTRLMLD QSGKVAIVTG ATSGLGYETA RALAGKGARV IIAARDTAKG ESAKEKLKKE YPEADVAVMK LDLADLQSVR KFSDDFSKRY SRLDLLINNA GVMAPPHGKT ADGFELQFGT NHLGHFALTI LLLEMLKKVP GSRVVTVSSG AHAFGMLDFD DLNWEKRKYN KWQAYGDSKL ANLYFTRELQ RLLDQAGVNV FSVAAHPGWA ATELQRYQGW LVLLNSFFAQ PPGMGALPTL YAATAPDVHG GDFFGPDGFG EMRGYPVKVQ SSRRSRDMDA ARKLWEVSEK MTGIRW
|
| |