Gene Ssol_0530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0530 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp474884 
End bp476143 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content37% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX90809 
Protein GI261601206 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.649609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGAA ATCTAAGATG GAAGATCGTT CTGATTATAT TTGCATTAAT TATAGTAGAC 
TACATAGATA GGGGGCTAAT CAATACTGCA CTACCAGTAC TAAAAAGTGA ATTCCACATA
AGTTCTTTTG AAGCTGGAAT AATTGGAGAT GGTTTCACTT TTGGATATCT TATAATGAAC
CCTTTAGTTG GTTATTTTTT GGACAAGTAC GGACCTAAAA GGGTTTTTAG TAGATTTGCA
ATCCTTTGGG GGGCTGTACA AGCTATTAAT GTATTTGCTT TCTCAACTTT TTACTTTATA
GTAACTAGAG TATTATTGGG AATTGGAGAA GCAGTAGGTT TCCCAGGAGT TACAAAGATA
GTCGCAAATT GGCTTAGAAA AGACGAAAAG GCTAGAGGTG GTACCATTTC AGATTCTGGG
GTGAATTTAG GGATAGTTTT CGGTTCCTTA TTTATGTTGG GATTGTTTGC AATAATTCCT
AACCAAGAAT TAGCGTGGAG GTTGGGATTT TTAGTTAGTG GTCTTTTGGC TATAATTCTA
GCTATCATAT TAGGTCGCCT TTTGTATGAT TTACCAGAAC AGCATCCGAA GATTTCTAAG
GAAGAATTAG ACTACATTTT GTCAGGCAGA GAAAAGGTTG GCGTGAAAAC TAAGCTTCCA
TTGTCATATT GGTTAAGAAG TAAAAATTAC TGGGGTTATA TGCAAGGTTT GGGTGCGCAA
GCTGGGATAT TCTTCGGTCT ATTTACTTGG TTGCCTTTAT ATCTTTTTTA CGCTAGACAT
CTTTCTCTAT CGTTTACATT AGAATACACT GCATTGATTT GGAGCTTTGG CTTCATAGGA
GAGTTAGTAG GAGGTTATGT CGTTGACAAA TTAATTAAGA GAAATCCCAA TTTAGGTTTC
AAAGTAGGAT TTGCGGTTAG CTCTTTGGCA GTAACTATAG GTCTTGCAGC TGCTACTCTC
ATTTCTTCAC CGATAGAAGC AGTTGAAATA CTAATGGTAA CTTTCTTCTT CCTAAGATGG
TCGGGGATAC AATGGGCAGC ACCATCCTTT CTAGTTTCTA CTGAATTAGC TGGTCAATTT
GGTGGTCATA TTGGCTTTTG GGAAACTCTA TGGGGTATAA TAGTACCAAT AGTATTTGGT
GCTACTGTGG AAGTCACCAA AGCATATCTC TTAGGAATGG AAATCTTAAT TGGGATAGGT
CTTATATACT TCATCGGAAC AGTAATAATA ACTAACTATA GGCAAATAAA GGTGAGCTAA
 
Protein sequence
MKGNLRWKIV LIIFALIIVD YIDRGLINTA LPVLKSEFHI SSFEAGIIGD GFTFGYLIMN 
PLVGYFLDKY GPKRVFSRFA ILWGAVQAIN VFAFSTFYFI VTRVLLGIGE AVGFPGVTKI
VANWLRKDEK ARGGTISDSG VNLGIVFGSL FMLGLFAIIP NQELAWRLGF LVSGLLAIIL
AIILGRLLYD LPEQHPKISK EELDYILSGR EKVGVKTKLP LSYWLRSKNY WGYMQGLGAQ
AGIFFGLFTW LPLYLFYARH LSLSFTLEYT ALIWSFGFIG ELVGGYVVDK LIKRNPNLGF
KVGFAVSSLA VTIGLAAATL ISSPIEAVEI LMVTFFFLRW SGIQWAAPSF LVSTELAGQF
GGHIGFWETL WGIIVPIVFG ATVEVTKAYL LGMEILIGIG LIYFIGTVII TNYRQIKVS