Gene Caul_3319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3319 
Symbol 
ID5900774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3599864 
End bp3600784 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content72% 
IMG OID641563825 
Producthistone deacetylase superfamily protein 
Protein accessionYP_001684944 
Protein GI167647281 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.804262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTCG CGCTCTATAC CCATCTGGAT ATGCTCGACC ATCGGCCCGG CGACAACCAC 
GCCGAACGGC CCGAGCGCCT GCGGGCGGTG ATCGACGCCT TGCAGGACGA CGCCTGCCTG
GATCTGGAAA GCGTCGAGGC CCCGCTGATC GAGCTTGCCG ACCTGGCCCG CGTGCATTCC
CAGGGCTTCA TCGACGCGAT CCTGGCCGCC GCGCCCAGCG CCGGCCGCCA CGCCCTGGAC
CCGGACACGG TGCTGTCGAC CGGCAGCCTG ATCGCCGCGC GCCGGGCCGC CGGGGCGGTG
GCCGCCGCGA CCCGGGCGGT GGCGAGCGGG CAGGGGACAC GGGCGTTCTG CGCCGTGCGG
CCGCCGGGCC ACCATGCCGA GCCGGGCGTG GCCATGGGTT TTTGCGTGTT CTCCAACATC
GCCGTGGCCG CCCGGGTGGC CCAGGCCTCG GGGTTGAAGC GCGTGGCGAT CGTCGACTTC
GACGTCCACC ACGGCAACGG CACCCAGGCG GCGTTCGAGC ACGACGCTTC GGTGTTCTTC
GCCTCGATCC ACCAGTCGCC GCTCTATCCG GGCACCGGCG ATCCGTCGGA GACCGGAGTC
GGCAATGTCG CCAACGCCAC GGTCGCGCCC CACGCCCCTC GCGAGGTCTG GCGCAAGGGA
TTCGAAGGGC TGATGGACCG GGTCGACGGA TTCGCGCCGG ACCTGATCCT GGTCTCGGCC
GGATTCGACG CCCACGTCCG CGATCCCCTG GCCGCCCAGA GCCTGGAGGC CGAGGATTTC
GCCTGGGCGA CGCGGGCGAT CGCCTCGGTG GCGAACCGCC GTTGCGGCGG GCGTATTGTT
TCCTCGCTCG AGGGCGGCTA CGACCTAGAA GCGCTCGGCC GCTCGGCGGC GGCGCATGTC
AAAGCGTTGC AGGAGGGGTG A
 
Protein sequence
MDVALYTHLD MLDHRPGDNH AERPERLRAV IDALQDDACL DLESVEAPLI ELADLARVHS 
QGFIDAILAA APSAGRHALD PDTVLSTGSL IAARRAAGAV AAATRAVASG QGTRAFCAVR
PPGHHAEPGV AMGFCVFSNI AVAARVAQAS GLKRVAIVDF DVHHGNGTQA AFEHDASVFF
ASIHQSPLYP GTGDPSETGV GNVANATVAP HAPREVWRKG FEGLMDRVDG FAPDLILVSA
GFDAHVRDPL AAQSLEAEDF AWATRAIASV ANRRCGGRIV SSLEGGYDLE ALGRSAAAHV
KALQEG