Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0688 |
Symbol | |
ID | 5898143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 751155 |
End bp | 752012 |
Gene Length | 858 bp |
Protein Length | 285 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641561170 |
Product | short chain dehydrogenase |
Protein accession | YP_001682319 |
Protein GI | 167644656 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.902774 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGC AGAACAAGAC CCTCTTCATC ACCGGCGCCT CGCGCGGCAT CGGCCTGGCC ATCGCCCTGC GCGCGGCGCG CGACGGGGCC AATATCGTCA TCGCCGCCAA GACCGCCGAG GCACACCCCA AGCTGCCCGG CACCATCTAC ACCGCCGCCG CCGAGATCGA GGCGGCCGGC GGCAAGGCCC TGCCGCTGGT GGTCGACGTG CGCGAGGAGG CCAGCGTCCA GTCGGCCGTC GACAAGGCGG TCGAGACGTT CGGGGGCATC GACATCTGCG TGAACAACGC CTCGGCGATC TCGCTGACCG GCACGCTGTC GACCGACATG AAGCGCTACG ACCTGATGCA CCAGATCAAC GGCCGGGGCA CGTTCCTGAC CTCGAAGCTC TGCCTGCCGC ATCTGCGCAA GGCGGCCAAC CCGCACATCC TGGCCCTGTC GCCGCCGCTG GACATCAAGC CCCGCTGGTT CGCGCCGCAC GTCGCCTATT CGATGGCCAA GTACAATATG AGCCTGTGCA TGCTGGGCAT GGCCGAGGAG TTTCGCGCTG ACGGCGTGGC CTGCAACGCC CTGTGGCCGC GCACCGCCAT CGCCACCGCC GCCATCCAGT TCGCCCTGAC CGGTGAGGAG GGCCTCAAGC ACTGTCGCAC CGTCGAGATC ATGGCCGACG CCGCCCACGC GATCTTCTGC AAGCCTTCGC GCGAGTTCAC CGGCCAGTTC GTGATCGACG ATACGTTCCT GTATGGCGAG GGCGTGCGGG ATTTCGACCA GTACCGCGTC GACCCGACCG CGACCCTGAT GCCCGACTTC TTCGTACCCG AGGACAGCGT CCCGCCGCCG GGGGTGGTGA TCGGGTAA
|
Protein sequence | MSLQNKTLFI TGASRGIGLA IALRAARDGA NIVIAAKTAE AHPKLPGTIY TAAAEIEAAG GKALPLVVDV REEASVQSAV DKAVETFGGI DICVNNASAI SLTGTLSTDM KRYDLMHQIN GRGTFLTSKL CLPHLRKAAN PHILALSPPL DIKPRWFAPH VAYSMAKYNM SLCMLGMAEE FRADGVACNA LWPRTAIATA AIQFALTGEE GLKHCRTVEI MADAAHAIFC KPSREFTGQF VIDDTFLYGE GVRDFDQYRV DPTATLMPDF FVPEDSVPPP GVVIG
|
| |