Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4649 |
Symbol | |
ID | 5902111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5025362 |
End bp | 5026147 |
Gene Length | 786 bp |
Protein Length | 261 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641565168 |
Product | short chain dehydrogenase |
Protein accession | YP_001686267 |
Protein GI | 167648604 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.21279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAGA GCACGGGCCG GGTCGCCGGC AAGAAGGCCT TCATCACCGG CGGGGCCCAG GGACTGGGCG CGGCGACAGC GCGGCTGCTG GCCGAGCACG GCGCCAAGGT CACGGTGGCC GACATCAACT TCGCCGGGGC CAAGGCCGTG GCGGACGAGC TGAACGCCGC CCACGGCGCT GGCACGGCCT TCGCCTTCGA GCTGGACGTC ACTCAGGAAG ACCAGTGGAT CGAGGTGCTG GAAAAGGCCG CCGAGGCGAT GGGCGGCCTG TCGGTGCTGG TCAACAACGC CGGTATCGGC GGCGACGGTC CGATCGAGAC GCTGGACTTC GGCCTCTGGA AGAAGGTGAT GTCGGTCAAT GTCGACTCGG TGTTCCTGGG CGCCAAGCAC GCCCTGACCC ACATGCGCGC CCACCAGCCG GGCTCGATCG TGAACCTCTC CTCGATCGCC GGCCTGATCG CCAACGGCAA TTCCCCAGCC TACAACGCCT CGAAGGCGGC GGTGTGGCTG CTCAGCAAGA ACATCGCCCT CTACTGCGCC AAGCAGGGGC TGAACATCCG CTCCAACTCG ATCCACCCGA CCTTCATCGA CACCCCGATC CTTGATGGGT TCACCGCGCG GTTCGGCAAG GAGGAGGCTC ACGCCAAGCT GGCGCGGCAG GTCCCCATGG GCCGCATCGG CGAGCCGCTG GATATCGCCA ACGCGGTGCT CTACCTGGCC AGCGACGAGA GCAAGTTCAT GACCGGCGCC GAGATCAAGG TCGACGGCGG CATTTCGGCG ATGTGA
|
Protein sequence | MAQSTGRVAG KKAFITGGAQ GLGAATARLL AEHGAKVTVA DINFAGAKAV ADELNAAHGA GTAFAFELDV TQEDQWIEVL EKAAEAMGGL SVLVNNAGIG GDGPIETLDF GLWKKVMSVN VDSVFLGAKH ALTHMRAHQP GSIVNLSSIA GLIANGNSPA YNASKAAVWL LSKNIALYCA KQGLNIRSNS IHPTFIDTPI LDGFTARFGK EEAHAKLARQ VPMGRIGEPL DIANAVLYLA SDESKFMTGA EIKVDGGISA M
|
| |