Gene Ssol_1088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1088 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1019452 
End bp1020828 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content33% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionACX91331 
Protein GI261601728 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACTTA ATATATTACT TAACTTGGCA ATATTTATAA TACCTTCTCT GGCAGTTTGG 
AATCAGATCA TTTTATATAT ATTCGGTAAG AGTGGTGATA TAGCGAATTT GATTTTACTT
AATAACGAGA GTAATCTACC TAAATTATCA ATTATAGTAC CAACAAAGGG TGAACGAATA
GAAGTAATCC AAGGATTAAT TGATAATATA CATGAAGCAA TATGGGATAG AAACAAATTA
GAAATAATAA TTGTATCAGA TGACGAGCAA GAATATTTTG ATAAATTATT ATCGACATTA
ATTATTCCTC CAGACTTAGA TGTAAGAATA TTTAGAAGAG AAAAGAAGCT TGGATATAAA
AGTGGTGCCT TAGCGTTTGG TCTACAGAAA AGCACTGGTG ATTTAATTTT AACATTAGAT
GTAGATGCCA GGATTGAAAA AGATTCGTTA ATAAAAGCTT ACAATCACAT GGTGAATTTA
GGCTGTGATG CAATCACTAT GGAATGGCAT GGCTATTCTA ATATTAGCAC GTCTTTAGCA
AAAGCGTTAA TGGTATCAAC AGTACTTACT AGTAAATCTA TATTAAGAGG AAGAGATAAA
TTAGGACTAA AAGTATTGCC AATAGGATGT GGGACGATCT ATAAGCGTAG TGCATTGGAG
GCGGTAAACG GATGGGATTA CAAAATGATT CAGGATGATT ATGAGCTGGG AGCTAGACTT
ATAAATAAAG GATTTAAGGT TTGCGCATCT TCCTCCCCTG TTTATGTTGA AGTTCCAGAC
AATTTAATAG CGTTTTATGT GCAGCAAACT AGATGGGCAA TGGGGACGAT GGAAGTTCTT
TACTGGAGAT TTAAATATAT TATTAGCAGC GATATAAAAC TTTGGCAAAA ATTAGAGATT
ATAGCTTATC TTTCTCAATA CATTCCAATT ATACTAACAT TTATAAGTGC AATTATTTTT
GCAATAGCTG GTTTTGTAGG TATAAGGCTT AGTATGAATT TGCCAATGTT TATTATATGG
GCCATAACAC TATCAATCTA TGCTTCAATT TTTGTCAATA GCGCAAAGAA GTCAGGGATA
GATACTGTGA CAGCTATAAA AGCCTTGGGA AGGTTGTCTG CATATACTGT TGGAATTTCG
CCATTTTTGC TTATTGGTAC AATTAACGCA TTTAAGAAGA CTAGAACGTA TATTGTTACT
CCTAAAGGCA AGAAGGCTAA AAGTAATATA GGCTATCCAA TCTTAGCCTT TGGTGTATTT
TTCATTCTAT CAGCTTTCCT TTATATGTTT AGAGGCGACT TCCTAACCTT TATTTGGCTA
GCTTACTATT CGATAGCTTT CCTCTACACC TTTATAGCAT ATATTAAAGG ATTATGA
 
Protein sequence
MILNILLNLA IFIIPSLAVW NQIILYIFGK SGDIANLILL NNESNLPKLS IIVPTKGERI 
EVIQGLIDNI HEAIWDRNKL EIIIVSDDEQ EYFDKLLSTL IIPPDLDVRI FRREKKLGYK
SGALAFGLQK STGDLILTLD VDARIEKDSL IKAYNHMVNL GCDAITMEWH GYSNISTSLA
KALMVSTVLT SKSILRGRDK LGLKVLPIGC GTIYKRSALE AVNGWDYKMI QDDYELGARL
INKGFKVCAS SSPVYVEVPD NLIAFYVQQT RWAMGTMEVL YWRFKYIISS DIKLWQKLEI
IAYLSQYIPI ILTFISAIIF AIAGFVGIRL SMNLPMFIIW AITLSIYASI FVNSAKKSGI
DTVTAIKALG RLSAYTVGIS PFLLIGTINA FKKTRTYIVT PKGKKAKSNI GYPILAFGVF
FILSAFLYMF RGDFLTFIWL AYYSIAFLYT FIAYIKGL