Gene TM1040_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2066 
Symbol 
ID4077993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2169776 
End bp2170804 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content63% 
IMG OID638007385 
Producthistone deacetylase superfamily protein 
Protein accessionYP_614060 
Protein GI99081906 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.229511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGA TCTATGATCC CCGCCAACGC GCCCATAACC CCAGCCAGTT CATGGCCTTT 
GGGGTGATGA AGCCCAATCC CGAACAACCC GAACGCACCG AGATCCTGCG CAGCGGCGCG
GAGGCGGCAG GCTGTACCTT CACAGCGCCC GAGGATGCGG GCCTTGGCCC CATCGCGGCG
CTGCATTCGC CGGAATACCT GACCTTCCTG CAGACCATCC ACGCCCGCTG GAGCGAGATC
GAAGGCGCAG GCCCCGAGGT GATTTCTCAT ATCAAGCCGG GAGATCGCCG TGACAGTTAT
CCGCGGTCTG CGCTAGGGCA GGCGGGCTAT CATCAGGCCG ATACCTCCTG TCCGATCAAT
GCCGACACTT GGGGCTCTGC CTATTGGTCG GCGCAGACTG CGATCACCGC CGCCGACCTG
ATTGCAAAGG GCGAGCGCGC CGCCTATGCG CTCTGCCGCC CGCCGGGGCA TCACGCGTTT
GGAGATATGG CGGGGGGGTT TTGTTTCCTC AATAACTCCG GCATCGCGGC GCAGCTGTTG
CGGGATCGGG GCCTCAGGCC CGCAATTCTG GACGTGGATG TCCACCACGG CAACGGCACG
CAGGGGCTGT TTTATGATCG CGACGACGTG CTGACGCTCT CGATCCACGC CGACCCTGCG
GACTTCTACC CGTTCTTCTG GGGCCACAGT TCTGAGCGCG GCGAGGGCCG GGGGCGGGGC
TATAACCTCA ACCTGCCGCT GCCGCGTGGT ACCGAAGATG CACCGTTCCT GGATGCACTG
GACACCTGTC TTGACCGGGT GCGCGCCTTT GGGTGCGATG TGCTTGTGAT CGCGCTGGGC
CTTGATGCCT CCGTGGACGA TCCGTTTCAG GGCTTTCAGG TTACCGGCGA CGGGTTTTCG
CGCATTGGCG AGGCAATTGC CCGCGCCGGC CTTCCAACGC TTTTTGTGCA GGAAGGCGGT
TACATCTCCG ACAGCCTCGG TCATAACCTC ACCCGCGTGC TTGGCGGATT TACCTCTGCT
GCACGCTGA
 
Protein sequence
MKAIYDPRQR AHNPSQFMAF GVMKPNPEQP ERTEILRSGA EAAGCTFTAP EDAGLGPIAA 
LHSPEYLTFL QTIHARWSEI EGAGPEVISH IKPGDRRDSY PRSALGQAGY HQADTSCPIN
ADTWGSAYWS AQTAITAADL IAKGERAAYA LCRPPGHHAF GDMAGGFCFL NNSGIAAQLL
RDRGLRPAIL DVDVHHGNGT QGLFYDRDDV LTLSIHADPA DFYPFFWGHS SERGEGRGRG
YNLNLPLPRG TEDAPFLDAL DTCLDRVRAF GCDVLVIALG LDASVDDPFQ GFQVTGDGFS
RIGEAIARAG LPTLFVQEGG YISDSLGHNL TRVLGGFTSA AR