Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3977 |
Symbol | |
ID | 8734435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 4218779 |
End bp | 4219609 |
Gene Length | 831 bp |
Protein Length | 276 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646504602 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_003395769 |
Protein GI | 284045429 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00568978 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGG AGAGCCAGAA GCTCGAGGGC AAGGTCGCGA TCATCACCGG GGCGGGCCAG GGGATCGGGT TGGCGATCGC GCGCCGCTAC GCGCGGGAGG GCGCCAAGGT GGTGCTCGCC GACATCAACG CCGAGCGCGT GGAAGCCGCC GCGGACGCGA TCGCGTCCGA GGGCTACGAT GCGCTTGCCG TCCCGACCGA CGTCGCCAGC AGCGACGAGG TCGACCGGCT CTTCGACAGG ACGCTCGAGA CGTTCGACGA CGTCGACATC ATGGTCAACA ACGCCGCCTA CACGGCCGAT GCGATCCGCC ACGTGCTGGA GGCCGACGAG GCATGGTGGG ACCGCATGAT CGACGTCAAC CTCAAGGGCC ACTTCCTCTG CTCGCTGCGG GCCGCGCGCA TCATGGCGCC CAAGCGCTCC GGCGTGATCA TCGCGACGTC GAGCGGCGGC GCGACGAAGT CACATCGCGG GATGGTGCCC TACGACGCGT CCAAGGGCGG GATCGAGGCG CTCGCGCGGG CGCTCGCGCT CGACCTCGCG CCGTACGGCA TCCGCGTCGT CACGCTCGTC CCCGGGCTGA TCGCACCGAA CCGCACCGAC GTGCCGCAGG AGCTGCTCGA CGCCACCGAC GCGACGGTGC CGCTCCAGCG CGCCGGCCTG CCCGAGGACC TCGCCGGTCC CGCCGTCTTC CTCGCATCCG ACGACGCGGC GTACGTGACG GGCGCCAAGC TCGTGGTCGA CGGCGGCGTG CTCGCGCAGC AGCGCTCGCC GCAGGTCGAG CGCTTCCCCG TCTCGAGCTT TCCGGTCGTT CCGGAGCGCG AAGCGGCGTG A
|
Protein sequence | MSTESQKLEG KVAIITGAGQ GIGLAIARRY AREGAKVVLA DINAERVEAA ADAIASEGYD ALAVPTDVAS SDEVDRLFDR TLETFDDVDI MVNNAAYTAD AIRHVLEADE AWWDRMIDVN LKGHFLCSLR AARIMAPKRS GVIIATSSGG ATKSHRGMVP YDASKGGIEA LARALALDLA PYGIRVVTLV PGLIAPNRTD VPQELLDATD ATVPLQRAGL PEDLAGPAVF LASDDAAYVT GAKLVVDGGV LAQQRSPQVE RFPVSSFPVV PEREAA
|
| |