Gene Msed_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1556 
Symbol 
ID5104001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1513108 
End bp1514133 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content48% 
IMG OID640507442 
Producthistone deacetylase superfamily protein 
Protein accessionYP_001191635 
Protein GI146304319 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0460927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGTATG GGGACTCATT AGGGATAGTC TGGGATGAAA GGTTTAGGGA GATCTCCTTC 
TCCCACCCCA TGATAAGGGA CGTTTCCAAG AGCAGAATCG TGAGGTTCAG GGAGCTGGTC
TCAACTCTCA ATGTCTACTT CATATCTCCT AAACCTGCCA CGTACGAGGA TCTGCTCGAG
GTTCACGATG AGAGCCTTCT AAGAAAGATC AAGGAAGTTA GTTCCCTCCC TTACATTGGC
TTCCTAGATT CGGGGGACAC GGTTCATTAC CCTGGCATGT TTGACGATAT TCTTCTAGTT
GCGGGTTCGA CTCTCACGGC AATCTTCATG TCAAGGTTCT TTGACTCAAT TTACATTCCG
CTGGGGGGAT TTCATCATGC CACTAGATCA AGATCCATGG GTTTTTGCCC CATAAACGAC
GTTAACTTAG CGATATCGAG ACTCATGAAA ATGGGAGAAA GGGTCGCACT GGTTGACGTA
GACGCCCACC ACGCGAATGG AGTGGAGGAA ATGTTCTATG ACAAACCAGT GCTCAAGATC
AATATTTTCG CTTATGATGG CAAATTCTTC CCGGGAACCG GTGATTACAG GAGAAGGGGG
AAAGGAGAAG GGACGGGTTA TAACTTCAAT GTCGGGTTAC CTCTGGGCTC TGCTGACGAC
TCGTTTCAGG AGGCATTAAG GTTGCTCGAT GTGGTTGAAG ACTTCAAACC ATCAGTCCTA
CTCGTGGTTG CCGGGGTAGA TGGGCATAAG GATGACGGAC TCAAATCCCT CAATCTGACC
ACGAACTCCT TCAATTTACT GGGCCTAAAG GTGAGCAGGT TAGCTAGGAG GGTTAACGCA
AGGATTATTT CCTTTGGAGG AGGAGGCTAC GGTCCAGGTT CAGCACCCTC TATGTTTTCC
TTTGTGAAGG GACTCATGGG AAACAGGTCA GAGGATGAAA TGCCCACACA AGATGAGGAG
AAAAAGGCTT ACGTGAAAAA ATTAGTGGAT CTACTCCTTG AACGCCTTCC CAGTAGCCTT
GAGTAG
 
Protein sequence
MTYGDSLGIV WDERFREISF SHPMIRDVSK SRIVRFRELV STLNVYFISP KPATYEDLLE 
VHDESLLRKI KEVSSLPYIG FLDSGDTVHY PGMFDDILLV AGSTLTAIFM SRFFDSIYIP
LGGFHHATRS RSMGFCPIND VNLAISRLMK MGERVALVDV DAHHANGVEE MFYDKPVLKI
NIFAYDGKFF PGTGDYRRRG KGEGTGYNFN VGLPLGSADD SFQEALRLLD VVEDFKPSVL
LVVAGVDGHK DDGLKSLNLT TNSFNLLGLK VSRLARRVNA RIISFGGGGY GPGSAPSMFS
FVKGLMGNRS EDEMPTQDEE KKAYVKKLVD LLLERLPSSL E