Gene GSU1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1222 
Symbol 
ID2685273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1326135 
End bp1327292 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content61% 
IMG OID637125896 
Producthistone deacetylase/AcuC/AphA family protein 
Protein accessionNP_952275 
Protein GI39996324 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0627011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCAGCCA GAACAGCCCT CATATACTCG AACGATTTCG CCCGGTTCAG CTACGGCGAC 
GATCACCCGT TCAAGATCCA GCGTTTCATT CTCGCCTTTG AGCTCATGCG CGCCTATGGC
CTCATGGAGC TTCCGAACGT CAAAATCCTC GACTGCCCCC GAGCTGCGGA AGAGGCACTG
CTTACCTTTC ACGCGCCCGA CTATCTCGAT CGTCTCAGGG AATTCAGCGA GTCGGACGAT
GCCCGCGCCG ATTTCCGGTA CGGTCTCGGC GATCTGGACA ACCCGGTTTT CCGGGGGCTC
TACGACTGGG CACGGCTGGG TGCCGGAGGG ACCATCGAGG CGGCCCGGCT GGTTGCCGAG
GAGGGCTATG ACATCGCCTT CAATCTTGCG GGGGGGTGGC ATCACGCCCA TCGGGCCAAG
GCATCGGGAT TCTCCTATCT GAACGACGCG GTCGTGGCCA TCAACCTGCT CCTGGAAAAG
GGCCTGCGGG TGGCGTACCT CGATATCGAT GCCCACCACG GCGACGGAGT GCAGGAAGCG
TTTTACGATA CGGACCGGGT CCTGACCATT TCGATTCACG AGAGCGGCAT GTACTTCTTT
CCCGGCACCG GTTTCGAGGG GGAAACCGGC ACCGGCGCGG GCACGGGGTA TTCGGTCAAT
ATCCCGCTGG TGGCCCACGC CGACGATGCG CTTTTCATGA AGGCCTTCGA CGAAGTGGCG
TTTCCGCTTC TCGCCGCCTA TAATCCCGAC GTCCTCGTGA CTCAACTGGG CGCCGACACC
TTCCGTACCG ATCCTCTCAC GCGGCTTGAG GTGACGACTC ATAGCTACAC CTATATCCTG
CGCAAGCTCA AGGCGCTCGG CATCCCCTGG GTTGCCGTGG GAGGGGGCGG ATACAACCTG
GTCAATGTGG CCAGGGCCTG GACCCTTGCC TGGGGGGTGA TGAACGGGGT CGAACTGCCG
CCCCGACTGC CGGATTCGTT TGTGTCGATC ATCGGCCGGC TCGGCTATCC CAACAGGATG
CTCCTCGATG CCATGCACTG GGCCCAGGAG GACGACCGCA ACCAGGCACT GGACGCGGTG
GAGCGAAGCA TAGCTGTCAT CCGGAAGACG ATTTTTCCGG TGATCATCGG TTCCTATGGC
GAGACTTCCG GGGAATGA
 
Protein sequence
MPARTALIYS NDFARFSYGD DHPFKIQRFI LAFELMRAYG LMELPNVKIL DCPRAAEEAL 
LTFHAPDYLD RLREFSESDD ARADFRYGLG DLDNPVFRGL YDWARLGAGG TIEAARLVAE
EGYDIAFNLA GGWHHAHRAK ASGFSYLNDA VVAINLLLEK GLRVAYLDID AHHGDGVQEA
FYDTDRVLTI SIHESGMYFF PGTGFEGETG TGAGTGYSVN IPLVAHADDA LFMKAFDEVA
FPLLAAYNPD VLVTQLGADT FRTDPLTRLE VTTHSYTYIL RKLKALGIPW VAVGGGGYNL
VNVARAWTLA WGVMNGVELP PRLPDSFVSI IGRLGYPNRM LLDAMHWAQE DDRNQALDAV
ERSIAVIRKT IFPVIIGSYG ETSGE