Gene Ssol_2223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2223 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2001631 
End bp2003385 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content38% 
IMG OID 
Productsugar isomerase (SIS) 
Protein accessionACX92412 
Protein GI261602809 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.22495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGGAA TTTTCGCTTT CGTATGCAAG GATTCCATTG ATGTTTCAAT TATTAACAAG 
GGTTTAAAGA AGTTGATTTA TAGGGGATAT GATAGCGCTG GTATTGCTTA CCTTGAAGAC
GATAGCTTAG TAATAAAGAA AATTTTAGGT AATATCTCAA AGAATGAGAT AAGCGTTAGT
GACAAGGCAA GAGTTGCAAT AGGTCACACT AGATATGCGA GTAGGGGTTG GCCAACTTTG
GAAAACGCTC ACCCACTGAC TGATTGTAAT GGGAAGATAG CAGTTGTAAT GGACGGTATT
CTTGACGATT ACGAAAAAAT TAGGGAAGAT CTGATTGCGA AGGGACATAA ATTCGTCTCT
ACAACTGACG CTGAGGTGAT TCCCCACTTA CTTGAGAATT CAACAAACTA TCTAAACTCA
TCATTAAACG TTATGAAAAG GGTAAAGGGC ATTTACTCTC TGGTTTTTGT AACCATAGAC
ATTGATAAAA TATTCGCAAT TAACTCTGGC CAACCCTTGA TGATAGGTAT CACACAAGAG
TGTAAATACG TTTCTAGCGA TTTACCCTCT TTGAGCGGTT TTGCTGAGAA TGCGATAATA
ATGCCAGAAA ATACTGTGGC AGTAATCTCT TGGAATGATG TGCAAGTGTA TAATATTGAA
GGTAATGAGG TAAAACCGGA AATTAAGAGA GTTAAATACA AGGAGGAGAT AGCTGAAAAG
GGTGGATTTC CACACTTCAT GTTAAAGGAG ATATACGATA TCCCACAAGC GTTAATAAAC
TCATTTAACT CTCTAATGGA AAAGTACCTT TCCTTAGCCT CAATGATAGT ATATGGTGCC
AAGAACGTCT ATATAATAGG TAATGGGACT AGTCTTCACG CTGGATTTAT CTCATCATAT
TACTTTTCTG AAATTAGCCT AAATGTTAAT GTTGTAAGTG CAGCGGAGTT TCCCTATTAC
GCCTTGAAAA ACGTGACTAC TGGTTCGGTA ATTATTGCTA TAAGTCAAAG TGGGGAGACA
AGTGATGTTA TAAGGAGTAT TAAAATGGCT AAGCAAAGAG GGGCTGTAAT ATTAGGTATA
ACCAACTCTG TAGGTTCAAG ATTAGCCTTA GAATCTAACG TGTACTTACC AATAACTGCT
GGGCCAGAGA TGGCTGTACC AGCGACAAAA ACTTTCACTT CAACTATTGT AGTATTAAAA
GTGCTTTCGC TATACACTGG ACTTCACTCT GGTAAAAACG ATAGGAGTGA GATCAGTTCG
TTAAAAAGTG AGATTGAAGA ATTGGCTAAA CAGTTAATGG TAAGGTTACC GGAGATGGAG
AAAGAGGCAG AGAAATTGGC TCCTAAATTA GACAAGGAAA GCTTATACAT TTCGAGTAGT
GGTATAAATT ACCCCATAGC CCTAGAAGGA GCTTTGAAGT TTAAGGAAGC TTCGATGACT
CACGCAGAGG GGATTCAGCT GGGAGAACTC CTCCACGGTC CCATTGTTCT AACAAATAAA
GGTTACCCCG TAATTTTAAT AAAACCTGTG GAGGCTGAGG ATTTATATAA CAAGGTTATT
AGATCTATAA AGGAAAGAGG AGATGTAATT GTGACCGTTG CTGAAGATGG TGATATGAAA
AGTATAAAGG CTACTAGGGA TTTAACTCCC ATAAGCAATG TAATACCGTT ACACTTATTG
GCCTATAAAC TGGGAGTTAG GAAAGGGTTG CCGATAGATA CTCCTCCAGG GTTAGTGAAA
GCTGTGATAG TTTAA
 
Protein sequence
MGGIFAFVCK DSIDVSIINK GLKKLIYRGY DSAGIAYLED DSLVIKKILG NISKNEISVS 
DKARVAIGHT RYASRGWPTL ENAHPLTDCN GKIAVVMDGI LDDYEKIRED LIAKGHKFVS
TTDAEVIPHL LENSTNYLNS SLNVMKRVKG IYSLVFVTID IDKIFAINSG QPLMIGITQE
CKYVSSDLPS LSGFAENAII MPENTVAVIS WNDVQVYNIE GNEVKPEIKR VKYKEEIAEK
GGFPHFMLKE IYDIPQALIN SFNSLMEKYL SLASMIVYGA KNVYIIGNGT SLHAGFISSY
YFSEISLNVN VVSAAEFPYY ALKNVTTGSV IIAISQSGET SDVIRSIKMA KQRGAVILGI
TNSVGSRLAL ESNVYLPITA GPEMAVPATK TFTSTIVVLK VLSLYTGLHS GKNDRSEISS
LKSEIEELAK QLMVRLPEME KEAEKLAPKL DKESLYISSS GINYPIALEG ALKFKEASMT
HAEGIQLGEL LHGPIVLTNK GYPVILIKPV EAEDLYNKVI RSIKERGDVI VTVAEDGDMK
SIKATRDLTP ISNVIPLHLL AYKLGVRKGL PIDTPPGLVK AVIV