Gene Ssol_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1968 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1753534 
End bp1755402 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content33% 
IMG OID 
Productglycoside hydrolase 15-related protein 
Protein accessionACX92179 
Protein GI261602576 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTTT CCTCCATAGG AAATGGCAGA ATGCTGATAA ACTTTGATGA GAAAGGAAGA 
ATAGTCGATA TTTATTATCC TTATATAGGA ATGGAGAACC AGACTTCTGG AAACCCAATT
AGGTTAGCTA TTTGGGACAA AGATAAGAAA GTGGCATCTC TAGATGAGGA TTGGGAAACT
ACTGTATTAT ATATAGATGA AGCTAATATG GTTGAGATTA GGAGTGATGT TAAGGAGTTA
GGACTTTCTC TTCTCTCTTA TAACTTTCTA GATTCTGATG ATCCGATATA TATGTCTATT
GTAAAAATAG CAAATAACGA AAATAATAGC AGAAATATAA AAGTATTTTT TATACATGAT
ATAAATTTAT ATTCAAACCC TTTTGGGGAC ACTGCATTCT ATGATCCCCT ATCCCTTTCA
ATTATACATT ATAAGTCTAA ACGATATTTA GCCTTTAAAG TGTTTACCAC GGTATCGACA
CTTTCTGAGT ATAACATAGG CAAAGGTGAC TTAATTGGAG ATATTTATGA TGGCAATTTA
GGACTTAATG GTATAGAAAA TGGTGATGTA AATTCAAGTA TGGGTATAGA GATAAATATA
GATCCTAATT CCTATTTGAA ATTATACTAC GTAATAGTCG CAGATAGAAA CTTGGAAGGC
TTAAGGCAAA AAATAAGGAA AATAAACTTT GCAAACGTAG AGACATCGTT TACGTTAACC
TATATGTTTT GGCGGAATTG GTTAAAGAAA AATAAACTCT TCAGAAATAA TTTAATGCAG
GATATTAAGA GAGTCTATGA TGTGAGTCTT TTTGTGATAA GAAATCACAT GGACGTTAAC
GGGTCAATAA TAGCTTCCTC AGACTTCTCC TTCGTCAAGA TTTATGGGGA CTCATATCAG
TATTGTTGGC CTAGAGATGC GGCAATTGCA GCTTATGCTC TAGATCTAGC TGGCTATAAG
GAACTAGCAT TAAAACACTT CCAGTTCATT TCTAATATTG CAAATTCTGA AGGCTTCCTA
TATCATAAAT ATAATCCAAA TACAACTCTA GCTAGTTCTT GGCATCCTTG GTATTATAAA
GGTAAAAGGA TATACCCAAT TCAAGAGGAT GAGACGGCAT TAGAAGTATG GGCAATAGCT
AGTCATTACG AAAAATATGA AGATATTGAC GAAATACTTC CATTATATAA GAAGTTCGTG
AAGCCAGCCT TAAAATTTAT GATGTCTTTT ATGGAAGAAG GATTGCCAAA ACCTTCTTTT
GACCTATGGG AAGAAAGGTA TGGTATACAT ATTTACACAG TATCTACGGT TTACGGCGCA
TTAACAAAGG GAGCAAAGTT AGCTTATGAT GTAGGTGATG AAATATTAAG TGAAGATTTA
AGTGATACAT CGGGTTTATT AAAAGGAATG GTTTTGAAAA GAATGACTTA TAATGGAAGA
TTTGTTAGAA GAATAGACGA GGAAAATAAC CAAGATCTAA CTGTGGACTC AAGTCTCTAT
GCTCCATTCT TCTTTGGTCT TGTTAATGCA AATGACAAAA TCATGATAAA TACCATTAAC
GAGATTGAAA GCAGATTAAC TGTGAATGGT GGGATAATAA GGTATGAGAA TGATATGTAT
CAGAGGAGGA AAAAACAACC AAACCCTTGG ATAATTACGA CATTATGGCT ATCTGAATAT
TATGCAACAA TTAACGATAA AAATAAGGCA AACGAGTACA TAAAATGGGT AATTAATAGG
GCATTACCAA CCGGCTTTTT ACCAGAACAA GTTGATCCAG AAACTTTTGA GCCAACTTCA
GTTACACCTT TGGTATGGTC TCATGCTGAA TTCATAATAG CAATTAATAA GCTCTTAAAC
CATATATAA
 
Protein sequence
MRVSSIGNGR MLINFDEKGR IVDIYYPYIG MENQTSGNPI RLAIWDKDKK VASLDEDWET 
TVLYIDEANM VEIRSDVKEL GLSLLSYNFL DSDDPIYMSI VKIANNENNS RNIKVFFIHD
INLYSNPFGD TAFYDPLSLS IIHYKSKRYL AFKVFTTVST LSEYNIGKGD LIGDIYDGNL
GLNGIENGDV NSSMGIEINI DPNSYLKLYY VIVADRNLEG LRQKIRKINF ANVETSFTLT
YMFWRNWLKK NKLFRNNLMQ DIKRVYDVSL FVIRNHMDVN GSIIASSDFS FVKIYGDSYQ
YCWPRDAAIA AYALDLAGYK ELALKHFQFI SNIANSEGFL YHKYNPNTTL ASSWHPWYYK
GKRIYPIQED ETALEVWAIA SHYEKYEDID EILPLYKKFV KPALKFMMSF MEEGLPKPSF
DLWEERYGIH IYTVSTVYGA LTKGAKLAYD VGDEILSEDL SDTSGLLKGM VLKRMTYNGR
FVRRIDEENN QDLTVDSSLY APFFFGLVNA NDKIMINTIN EIESRLTVNG GIIRYENDMY
QRRKKQPNPW IITTLWLSEY YATINDKNKA NEYIKWVINR ALPTGFLPEQ VDPETFEPTS
VTPLVWSHAE FIIAINKLLN HI