Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3319 |
Symbol | |
ID | 5900774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3599864 |
End bp | 3600784 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641563825 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_001684944 |
Protein GI | 167647281 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.804262 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTCG CGCTCTATAC CCATCTGGAT ATGCTCGACC ATCGGCCCGG CGACAACCAC GCCGAACGGC CCGAGCGCCT GCGGGCGGTG ATCGACGCCT TGCAGGACGA CGCCTGCCTG GATCTGGAAA GCGTCGAGGC CCCGCTGATC GAGCTTGCCG ACCTGGCCCG CGTGCATTCC CAGGGCTTCA TCGACGCGAT CCTGGCCGCC GCGCCCAGCG CCGGCCGCCA CGCCCTGGAC CCGGACACGG TGCTGTCGAC CGGCAGCCTG ATCGCCGCGC GCCGGGCCGC CGGGGCGGTG GCCGCCGCGA CCCGGGCGGT GGCGAGCGGG CAGGGGACAC GGGCGTTCTG CGCCGTGCGG CCGCCGGGCC ACCATGCCGA GCCGGGCGTG GCCATGGGTT TTTGCGTGTT CTCCAACATC GCCGTGGCCG CCCGGGTGGC CCAGGCCTCG GGGTTGAAGC GCGTGGCGAT CGTCGACTTC GACGTCCACC ACGGCAACGG CACCCAGGCG GCGTTCGAGC ACGACGCTTC GGTGTTCTTC GCCTCGATCC ACCAGTCGCC GCTCTATCCG GGCACCGGCG ATCCGTCGGA GACCGGAGTC GGCAATGTCG CCAACGCCAC GGTCGCGCCC CACGCCCCTC GCGAGGTCTG GCGCAAGGGA TTCGAAGGGC TGATGGACCG GGTCGACGGA TTCGCGCCGG ACCTGATCCT GGTCTCGGCC GGATTCGACG CCCACGTCCG CGATCCCCTG GCCGCCCAGA GCCTGGAGGC CGAGGATTTC GCCTGGGCGA CGCGGGCGAT CGCCTCGGTG GCGAACCGCC GTTGCGGCGG GCGTATTGTT TCCTCGCTCG AGGGCGGCTA CGACCTAGAA GCGCTCGGCC GCTCGGCGGC GGCGCATGTC AAAGCGTTGC AGGAGGGGTG A
|
Protein sequence | MDVALYTHLD MLDHRPGDNH AERPERLRAV IDALQDDACL DLESVEAPLI ELADLARVHS QGFIDAILAA APSAGRHALD PDTVLSTGSL IAARRAAGAV AAATRAVASG QGTRAFCAVR PPGHHAEPGV AMGFCVFSNI AVAARVAQAS GLKRVAIVDF DVHHGNGTQA AFEHDASVFF ASIHQSPLYP GTGDPSETGV GNVANATVAP HAPREVWRKG FEGLMDRVDG FAPDLILVSA GFDAHVRDPL AAQSLEAEDF AWATRAIASV ANRRCGGRIV SSLEGGYDLE ALGRSAAAHV KALQEG
|
| |