Gene Ssol_0554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0554 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp500463 
End bp502199 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content35% 
IMG OID 
Productglycoside hydrolase 15-related protein 
Protein accessionACX90830 
Protein GI261601227 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000579959 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAC TTGGATTCAT CTCAAATCAG ATAACATCAG CCCTAATTGA TTTATCTTCA 
ATTGTTTGGT TTCCAGTTCC TAAGTTCGAC TCCCCGTCAG TGTTCACTAG GTTGTTAGAT
GAAGATGGAG GAGAGTTTTC TATATTACCA GAAGGACAAG AAATAATAGC TGTTAAACAA
GAATATGTTT ATCCATTAGT ATTAATGACT GCCATACGTA CTAAACAAGG CGAAATAAGT
ATTACTGATC TCATACCATT GGGTGAAACA GTAATTATAA GAAAGGTTGA ATCGGAAATT
CCTTTTAAGG TCGTATTTAA ACCTAGATTC TATTATTCTC TATACAAGCC CATAATCGAT
GGTAATAGAT TTGTAAATCC AAGAGGAAGG GATTGTATGG CGTTTCTTTA CGACTTTTCG
GGCGAGGTCA AAAGATCTGG AAATTATGTT TGGAATTTTA GTAATGGGAA AGGATATTTA
ATAGCCAACT ATGCCTCTGA CGTTAAACAT GGGGTTTTCA GTGAAAGGGG TTCAACGTTA
AATGCCATAT ATGAAAGATC GTTTGAAAAT ACGATAAACT ATTGGAAAAG TATTGATGTG
AAAGACGCTA AATCATTTAA TGACCTTTAT AAGGCATCCA TATATACAAT GCTAGGCTCT
ATTTATGCGC CTTCTGGAGG AGTAATTGCA GCTCCTACAA CTTCTTTACC AGAAGTTGAA
GGTGGAAAGA GAAATTGGGA TTATAGATTT GCATGGGTAA GAGATTCTTC GATCATAGCT
GAAGCCTTGT TAGAAGCTGG ATCCATTGTA GAAGCTAGAA GAATAATAAA CTTCTTACTT
TCGCTCATAA ATTTCTCGTC AAAGCCATTT TACTATCCCC TATATACGAT AGAGGGCACA
ATTCCTCCCC CTGAGAGGGA ATTACGATGG CTATCTGGAT ACAAGAACTC TAAACCAGTA
AGAATAGGAA ACGGAGCTTC TTCTCAGATT CAATTAGATA TTGAAGGATT TTTCATTTCG
GCTCTTTATA AATATGTAAA GATGACTAAT GATCAAGTGT TTCTGAAAGA CGTTTTTAGT
AAAGTGAAGT ACATTGGGGA TTGGATATCA GAGAATTGGA GCTTAAAAGA TTCTGGTATT
TGGGAGGATA GGGGGAGTCC TCAACACTAT ACTCACTCTA AAATTATGAT GTGGATAGCA
CTAGATAAAA TAGGGAAACT AGCAAACTTA ATCGGATATG CGGACATTTG GGCTAAAGAG
AGGGAAAAGC TTAGAAACTG GATATTCACT AACTGTGTAA AGAACAATTA TTTTATCAGA
TATTGTGGGA ATACTGATGA CGTAGATTCA TCATTATTAT CAGCACCATT GTATGGGTTC
ATTGAAGTTA GTGATAGTAC ATTTATTAAT ACACTAACGA AAATCGAAAA CGATCTAAAA
ACCGACGTAT TTGTGAAAAG ATACAAAACT GATTTCATGG GAGAAGCTAA ACACCCATTT
TTGTTGACTA CAGTGTGGCT TGCTAGAGTT TATATGAGAT TAGGAAAAAT AGATAGTGCT
ATAGAAATCT TGAATAAGAT CAATAAGGTT TCAAGAGAAC TACATTTAGT AGGTGAACAC
GTTGATGTGG AAAAAGGGGA GTTTACGGGT AACTTTCCTC AGATTTTTGT TCATGCGCAA
TTGGTAATTG CAATAAAAGA GCTTAACGAC ACGTTAACTG ATAAAAATAT TATATAG
 
Protein sequence
MKTLGFISNQ ITSALIDLSS IVWFPVPKFD SPSVFTRLLD EDGGEFSILP EGQEIIAVKQ 
EYVYPLVLMT AIRTKQGEIS ITDLIPLGET VIIRKVESEI PFKVVFKPRF YYSLYKPIID
GNRFVNPRGR DCMAFLYDFS GEVKRSGNYV WNFSNGKGYL IANYASDVKH GVFSERGSTL
NAIYERSFEN TINYWKSIDV KDAKSFNDLY KASIYTMLGS IYAPSGGVIA APTTSLPEVE
GGKRNWDYRF AWVRDSSIIA EALLEAGSIV EARRIINFLL SLINFSSKPF YYPLYTIEGT
IPPPERELRW LSGYKNSKPV RIGNGASSQI QLDIEGFFIS ALYKYVKMTN DQVFLKDVFS
KVKYIGDWIS ENWSLKDSGI WEDRGSPQHY THSKIMMWIA LDKIGKLANL IGYADIWAKE
REKLRNWIFT NCVKNNYFIR YCGNTDDVDS SLLSAPLYGF IEVSDSTFIN TLTKIENDLK
TDVFVKRYKT DFMGEAKHPF LLTTVWLARV YMRLGKIDSA IEILNKINKV SRELHLVGEH
VDVEKGEFTG NFPQIFVHAQ LVIAIKELND TLTDKNII