Gene Ssol_0802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0802 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp749222 
End bp750708 
Gene Length1487 bp 
Protein Length495 aa 
Translation table11 
GC content39% 
IMG OID 
Productglycoside hydrolase family 29 (alpha-L-fucosidase) 
Protein accessionACX91055 
Protein GI261601452 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.246166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAAA ATTCTTACAA AATCTTGAAA TCACTTCCAG TACCATCTAA TGGTCCTTTC 
AAACCTACTT GGAGTTCATT AAAAAAGTAT ATAGTCCCAT CGTGGTTTAC CACCTCTAAA
TTCGGTATTT TTATCCATTG GGGAGTATAC TCAGTACCAG CATTTGGTAA TGAATGGTAC
CCTAGATACA TGTACATGCC AGATAGACCA GAACACCAAT ATCACCTAAA AAATTCGGCC
CAGTAACCGA TTTCGGATAT AAGGATTTCA TACCGATGTT CACTGGAGAG AATTGGGATC
CATATGAGTG GGCTAAGGTC TTTAAGAAAA GTGGAGCTAA ATTCGTAGTC CTAGTTGCAG
AACATCACGA TGGATTTGCA CTATGGGAAT CAAATTACAC TAGGTGGTGT GCAACCAAGA
TTGGACCTAA AAGGGACATT GTTAGAGAAC TTAAGGAAGC TGTTGAAGGT CAAGGGCTAA
TATTTGGCAT TTCGTATCAT AGGGCTGAGC ACTGGTGGTT TTTCGATCAA GGGATGAAAA
TAGAGTCTGA TGTAAAGGAC CCCAGATATC TTGATTTATA TGGCCCAGCT CAGTCTGCTT
CCCTAAATCC TAGAGATCCA CCTTCACTGG ATAATGTACA GCCAAATGAT GAGTTTCTAA
TGGATTGGTT GCTTAGAATT GTTGAGGCTG TTGAAAAGTA TAGGCCATGG CTAGTCTATT
TCGACTGGTG GATTGCCAAT CCCTCTTTCC AACCATATTT GAAGGCCTTT GCGTCCTATT
ACTATAATAG GTCATATAAA TGGGGAATAG AACCCGTAAT AATTTACAAG CAAGGGGCAT
TTGGGGAAGG TACAGCCATA CCGGATTTAG CTGAAAGGGG AACAATAAAG AACGTATATC
CCTCCACATG GTTAGCTGAC ACTTCTATAG ACTACAAATC CTGGGGTTAC ATCAAAGATG
CTGAATACAA GCTACCTAGT GTTATATTAT CCCATTTAGG TGATGTTGTT AGTAAAAATG
GAGTTTTTCT CTTGAATATA GGACCTAAAG CTGACGGTAC GATACCAGAA GAGGCTAAGA
GAATTCTACT TGATGTTGGG GATTGGCTAA ATGTAAATGG CGAAGCGATT TTCGGATCAA
AACCGTGGAG AGTTTACGGA GAAGGTCCTT CTGGAATTAA TGAAGGGGGA TTCTTTACAG
AGAGAAAAAT TACTTTAGGC TATCAAGATG TGAGATACAC TGTGAAAGAC TATTATCCGC
GACAAAGGCA TATTTACGCT ATTCTCTTCG GAAAGCCTAA GGAAATTACG TTAAGGTCGT
TTATGAAAAA TCTAAAGCTA ATAGAAGAAG CTGTAATAGT AGATGTAAGC AGATTAGATG
GGAAAGGTAA GTTAGAGTGG AGTTTAAGTG ATGAAGGTTT AAAGATAAAA ATAGAGGAAG
TTATAAGGGC TCCTCTTGTT ATAAGGGTTA TCCTAGATTA TAGATAG
 
Protein sequence
MSQNSYKILK SLPVPSNGPF KPTWSSLKKY IVPSWFTTSK FGIFIHWGVY SVPAFGNEWY 
PRYMYMPDRP EHQYHLKKFG PVTDFGYKDF IPMFTGENWD PYEWAKVFKK SGAKFVVLVA
EHHDGFALWE SNYTRWCATK IGPKRDIVRE LKEAVEGQGL IFGISYHRAE HWWFFDQGMK
IESDVKDPRY LDLYGPAQSA SLNPRDPPSL DNVQPNDEFL MDWLLRIVEA VEKYRPWLVY
FDWWIANPSF QPYLKAFASY YYNRSYKWGI EPVIIYKQGA FGEGTAIPDL AERGTIKNVY
PSTWLADTSI DYKSWGYIKD AEYKLPSVIL SHLGDVVSKN GVFLLNIGPK ADGTIPEEAK
RILLDVGDWL NVNGEAIFGS KPWRVYGEGP SGINEGGFFT ERKITLGYQD VRYTVKDYYP
RQRHIYAILF GKPKEITLRS FMKNLKLIEE AVIVDVSRLD GKGKLEWSLS DEGLKIKIEE
VIRAPLVIRV ILDYR