Gene Ssol_1460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1460 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1344455 
End bp1345876 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content37% 
IMG OID 
Producthistone acetyltransferase, ELP3 family 
Protein accessionACX91692 
Protein GI261602089 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGTAA TAAGGAAACC GACTAGGATG CTTTCTGGCG TTACAATAGT ATCTATCATG 
ACTCATCCAC ACTCTTGCCA GCATGGGAAG TGTATCTTCT GCCCTGGAGG AGCAGATATA
GGCACTCCTC AAAGTTATTA TGGAAGGGAA CCAACTTTAA TGAGAGCAAT AGAAAACAAT
TACGATCCAT TTTATCAAGT ACAGTCTAGA TTAAGACAAT ATGTTGAGAA TGGACATACA
CCAAGCAAAG TAGAGCTAAT AATTATGGGT GGTACTTTTC TATCACTTCC CATGGATTAT
CAAGATTGGT TCGTGACCTA TGCCTTAGAA GCTATGAACA GATTTCCTAA CTCCAATAAA
CCTCCCTTTG TGTATTTAGA AGACGCGCAG TTAAATAACG AAAAAGCGGA TATACGATGC
GTAGGAATGA CAATTGAAAC AAAACCCGAT TGGGCTAAGG AATGGCATGC AGATCAAATG
CTAAGATTAG GTGCGACTAA AGTTGAATTA GGAGTACAAA CTGTATACGA TGACATTTTA
AAGTTCACCA ATAGGGGTCA TACTGTAAAA GACTCCATAG AATCTACTAG AATACTAAAG
GACTCGGGAT TCAAAGTGGT TTATCATATC ATGCTTGGCC TCCCTAAATC GGACCCCGAT
AAAGATCTGG AGGCGTTCAA GACGATATTT TCAGACCCTA ATTTTAGGCC GGATATGTTA
AAAATATATC CTACCTTAGT AGTTGAAACA GCACCATTAG TAAATTTGTG GAAAAGAGGA
TTGTATAGAC CTTACGATAC GGAAACTTTA GTTGACTTAA TATCTGAAAT GTACAAGTAT
ATTCCTAAAT GGGTTAGAGT AATGAGAATA CAAAGAGACA TCCCCGCAAA TGTAATCTTA
GACGGTAATA AAAAGGGGAA TTTGAGAGAA CTAGTAGAGA AGAGAGTTCT AGAAAAGGGA
ATGAAGATTA AAGAGATTAG GTTTAGGGAA GTCGGAATGA TGTGGCAACA CAGAGGATTA
TTACCAGATG ATAGCAAAAT TCACCTTTAC AAGGAAATTT ATGAGGCAAG TGAAGGTACT
GAAATTTTTC TATCTTTTGA AGACAATAAA GAGATCTTAA TAGGTTATTT ACGATTAAGA
ATTCCGTCAA ATAAAGCTCA CAGGAAAGAG ATAGATGGAA AGACTGCGAT AGTGAGGGAA
CTTCATGTAT ATGGAATAGA AGTACCCATA GGTAGCTGGG ATGAACTGGG CTTCCAACAT
AAGGGTTACG GAAGTAAGTT GCTAAGCGAA GCTGAAAGAA TTGCGAGAGA GGAGTTTGAC
ATGAAGAAAA TTTCGGTTTT ATCTGGTATA GGTGCTCGGG AATACTATGC TAAAAAAGGC
TATATAAAAG AAGGTCCTTA TATGTCTAAG AAGTTGATTT AA
 
Protein sequence
MQVIRKPTRM LSGVTIVSIM THPHSCQHGK CIFCPGGADI GTPQSYYGRE PTLMRAIENN 
YDPFYQVQSR LRQYVENGHT PSKVELIIMG GTFLSLPMDY QDWFVTYALE AMNRFPNSNK
PPFVYLEDAQ LNNEKADIRC VGMTIETKPD WAKEWHADQM LRLGATKVEL GVQTVYDDIL
KFTNRGHTVK DSIESTRILK DSGFKVVYHI MLGLPKSDPD KDLEAFKTIF SDPNFRPDML
KIYPTLVVET APLVNLWKRG LYRPYDTETL VDLISEMYKY IPKWVRVMRI QRDIPANVIL
DGNKKGNLRE LVEKRVLEKG MKIKEIRFRE VGMMWQHRGL LPDDSKIHLY KEIYEASEGT
EIFLSFEDNK EILIGYLRLR IPSNKAHRKE IDGKTAIVRE LHVYGIEVPI GSWDELGFQH
KGYGSKLLSE AERIAREEFD MKKISVLSGI GAREYYAKKG YIKEGPYMSK KLI