Gene Pcal_1126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_1126 
Symbol 
ID4909224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp1057101 
End bp1058111 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content58% 
IMG OID640124880 
Producthistone deacetylase superfamily protein 
Protein accessionYP_001056017 
Protein GI126459739 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.546018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTGG TGTTCTCCCC CGTGTTTAAG TTGCACAAAC CGCCGTACAG CCACCCCGAG 
GCTCCCGACA GACTCGACGC AGTGTTAAAA GGCGCCGAGG AGGCCGGCGC CGCCGTCCAA
GCGCCAAGGG CCAGAGAGGA CGTGTGGAAC CTCGTGTCGC TGGCCCATGA GAGGGGGTAC
ATAGAGTTTG TGAGAAGGCT GTGTAAAGAG GACTATGCCC AAATCGACGG CGACACGTAC
GTCTCCTCTG GCACATGCGA GGCAGCGGCC TTGGCAGTAA GCGCTATGGC AGACGCCGTG
GACGCAGGCG AGACAGTTTT GCTCGCAGTT AGGCCGCCCG GCCACCATGC GGGGTTTGTG
GGGAGGGCGC TGACGGCCCC AACGCAGGGC TTCTGTATCT TCAATACTGC GGCAATTGGC
GCATTGTACA AGGGGGAGGG CGTCGCTGTG GTAGACATCG ACGTACACCA CGGAAACGGC
ACACAAGAAA TTCTCTACGA CAAGGACCTA CTCTACATCT CTACGCACCA GCACCCAGCC
ACGCTGTACC CAGGCACGGG CTACCCAGAC GAAGTAGGCA CAAGCAAGGG GGAAGGCTTC
AACGTCAATA TACCGCTCCC GCCTCTCACT GGAGACGACG CGTACGCCGA GGCAGTAGAG
GAGGTTATAG AGCCAATCTT GCGGCAGTAC GACCCCAAGA TCATAATCGT GTCCCTGGGA
TGGGACGCAC ACAGAGACGA CCCCTTGGCC AACATGGGGC TGACGATAAA CGGCTACCTA
CGCGCAATTT CCACCATACT CAAACTCCAA AAGCCCACGA TCTTCTTACT CGAGGGAGGC
TACAACAGAG ACGTGTTGAG GAGAGGCACT GCGGCGCTGA TAAGGCTGGT GGACAAGGGC
GAAGAAGCCG CCGGAGAAGC GCCTTCAAAG ACCGACGCCA AGGTATACGA GAGGTTTAGG
AGAGAGCTCG GCGAAGTAAA GAGGCACGTC TCAAAACACT GGCGCCTATA G
 
Protein sequence
MKVVFSPVFK LHKPPYSHPE APDRLDAVLK GAEEAGAAVQ APRAREDVWN LVSLAHERGY 
IEFVRRLCKE DYAQIDGDTY VSSGTCEAAA LAVSAMADAV DAGETVLLAV RPPGHHAGFV
GRALTAPTQG FCIFNTAAIG ALYKGEGVAV VDIDVHHGNG TQEILYDKDL LYISTHQHPA
TLYPGTGYPD EVGTSKGEGF NVNIPLPPLT GDDAYAEAVE EVIEPILRQY DPKIIIVSLG
WDAHRDDPLA NMGLTINGYL RAISTILKLQ KPTIFLLEGG YNRDVLRRGT AALIRLVDKG
EEAAGEAPSK TDAKVYERFR RELGEVKRHV SKHWRL