Gene Acid345_3902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3902 
Symbol 
ID4072239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4616603 
End bp4617676 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content56% 
IMG OID637985928 
Producthistone deacetylase superfamily protein 
Protein accessionYP_592976 
Protein GI94970928 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000251706 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.03889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGCTT TGCCCGTCCG GTCGTATTGC GAACCTCGCG CTAATCAACG AAAATGCAGT 
GTTTCTATGT TGCCCTTCAA GCTCGTTTAT AGCGACCACT ACCGCTTACC TCTGGGCGAG
CATGTGTTCC CCACGCAGAA ATACGAACTC GTAAAACAGG AGCTGCTGGA AGAAGGGGTC
GCCTCGACGC AGGATTTCCT TACGCCGACG CCTGCTACAG AAGCCGATGT TCTGCTGGTG
CACTCCCATT TCTACGTAGA TAAGCTGATC GAAGGAACAT TAACAGCGCG TGAGGAACTG
GCCCTCGAGA TCCCCTACTC CCACGAAGCC GTGCAAGCGT TCTTGTGGCA CACCGGAGGC
ACCATCCTGG CGGCGGAGCG CGCACTTTCC GATGGAGTGG CGTTCAACCT CGGCGGCGGA
TTTCACCACG CGTATCCCGA CCACGGCGAA GGGTTCTGCA TGATTCACGA CGTGGCGGTG
GCGATCAGGA AACTGCAAAA ACAAGGCAGA ATCCAGCGTG TGATGACGCT CGACTGCGAT
GTTCATCAGG GAAATGGAAC TGCCGTAATT TTCGCAAAAC ACAGAGATGA GAATTCCGAG
GCCCTGCCTT CGCGTTCTAC TTCGACGATC GGCAACAGGC TCAGCGGAAC GATGTTGGAG
CGCGGTGCCG ACGATGTCTT CACGATCTCA TTGCATCAGG AGAACAACTA CCCACTGCAA
AAGCCGCCGT CGTCCATAGA CGTCAATCTC CCCGACGGTA CAACAGATTC CGAATACATC
GCGTGGCTCG ACAACGCGAT AAGTTCGGGG TTCCGGCAGT TCCAACCAGA TTTGCTTTGC
TATATCGCCG GCGCGGACCC TTACAAGGAA GATCAACTCG GCGGCCTGAA CCTCACTATT
GACGGTTTGA AACATCGTGA TGAGCTCGTA TTCCAAGCGG CACGCGCAAA GGGAATTCCC
GTCATGGTGA CATTTGCCGG CGGCTATGCG CGCAAGATCC AGGACACCGT GCGGATACAC
CGCAACACGG TCGGGGCAGC GAAGGAAGTT TTCTCAGGAG CAGGGAAGAG CTAA
 
Protein sequence
MAALPVRSYC EPRANQRKCS VSMLPFKLVY SDHYRLPLGE HVFPTQKYEL VKQELLEEGV 
ASTQDFLTPT PATEADVLLV HSHFYVDKLI EGTLTAREEL ALEIPYSHEA VQAFLWHTGG
TILAAERALS DGVAFNLGGG FHHAYPDHGE GFCMIHDVAV AIRKLQKQGR IQRVMTLDCD
VHQGNGTAVI FAKHRDENSE ALPSRSTSTI GNRLSGTMLE RGADDVFTIS LHQENNYPLQ
KPPSSIDVNL PDGTTDSEYI AWLDNAISSG FRQFQPDLLC YIAGADPYKE DQLGGLNLTI
DGLKHRDELV FQAARAKGIP VMVTFAGGYA RKIQDTVRIH RNTVGAAKEV FSGAGKS