Gene Ssol_0861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0861 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp803740 
End bp805191 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content37% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX91108 
Protein GI261601505 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGATT ATTCTGATGG AGAGAGTATG TCACAATACT CTAAAGAAGA AATAGATAGG 
GGTATCAAAA AGATATATGA TACGGTTTTG AATACAAAAA ATATAACCTC AAGATATATA
GTAATTTTAG CATTAGCTAG TCTCTGGCTA GATGCCTATG ACTTCGCAAG TATGACTTTT
GGTACTGCAT CCCTTAAGAG CACTTTCCCT TCTGTACCAT CTGTTTTAAT TTCCTTGGCT
ATAGGTGCAG TACAATTAGG TGCTATTATA GGTGCAGTGG TTGGCGGTTG GCTAAACGAT
CGTATTGGTA GGAGGAACAT GTTCATACTT AACATGATAT TATTTACGGG TATGGCAATC
TTGGGAGGAT TATCTACTAA CATTTTAGAA TTATCAATCT TTAGAGGCTT ATTAGGTTTC
GCTTTAGGGG CTGATACTGC AACAGGTTTT GCTTATATAT TCGAGTATTT AGAGAAAAAA
CAGAGACTAT TTTGGTCGAA TCTATGGCAA ATACAATGGT ATATCATGTA CGAGGTTACC
ATAGCTTTAA TAGTAGTTCC ATTCTTCTTT ACAGTTCACT CATTGACATC TCCCTTATTA
TGGAGAATTA TTATGATAGT TGGAGGTGTT CTAGCTTTAG TAATATTACT ACTAAGAGGT
AGGATTCCAG AATCGGTATT ATGGTTAGCA TACCAAGGTA GATTAGCTAC TGCCAAGAAA
ATATTGAAAC AGACCTACGG AATAGATTTG CCAGAGGTCC CAGACGTAGA TTTGCAATTA
AGGGCACCAG CTAGAGGAGT AAGGAGTGCC TTTAGTATAT TTAGAGCGAG TAAATGGAGG
GAGCTGGTCT ATTCCTTTAA CGGTAACTTT GAACAAGGAT TCATATTCTA TACATTTGGT
TTTTATGTAC CTTATATATT ACTAGCGCTA AAACTGGCTG GGCCACTTGC CTCAATAGTA
GCTTCTGCAT TCCTATATGC CGGAGGCGTA ATAGGTGGCT ATTTGACAGC ATGGCTAACA
CCTAAAATTG GTACTAAATC ACAATATGTA ATAGGAGCAA TAGGTGAGGG AATTTCAGTA
GGGTTAATTG CACTTACGTA TATTTATCAC TTACCACTAA TATATTTTGT AGTGTTCTCT
TTCCTATTTT ACTTCTTCCA TGTCATAGGT CCAGCCAGTC AAGGGATGAC ATCCATAAAT
GCATTTTTTG GAGCCAAAGA AAGAGGTACT GCTGCGGGAT GGGGATATTT CTGGGTAAAA
TTGGCTGCTT TTATAGGATT GCTAATAGGA ATTGTTGGTA TCGCATATAA CCCAGTTTCC
TTGACATTAG GTTTGGCAAT TTATGGTGTA TTAACAGGAA TAGTAGGGTT AATAATAGGT
TATGATACAA GGACATACAA ATTAGCAGAT GTGGAAGAAT TAGGAGAGAA TCCACAAGAG
AGTGTACAGT AG
 
Protein sequence
MYDYSDGESM SQYSKEEIDR GIKKIYDTVL NTKNITSRYI VILALASLWL DAYDFASMTF 
GTASLKSTFP SVPSVLISLA IGAVQLGAII GAVVGGWLND RIGRRNMFIL NMILFTGMAI
LGGLSTNILE LSIFRGLLGF ALGADTATGF AYIFEYLEKK QRLFWSNLWQ IQWYIMYEVT
IALIVVPFFF TVHSLTSPLL WRIIMIVGGV LALVILLLRG RIPESVLWLA YQGRLATAKK
ILKQTYGIDL PEVPDVDLQL RAPARGVRSA FSIFRASKWR ELVYSFNGNF EQGFIFYTFG
FYVPYILLAL KLAGPLASIV ASAFLYAGGV IGGYLTAWLT PKIGTKSQYV IGAIGEGISV
GLIALTYIYH LPLIYFVVFS FLFYFFHVIG PASQGMTSIN AFFGAKERGT AAGWGYFWVK
LAAFIGLLIG IVGIAYNPVS LTLGLAIYGV LTGIVGLIIG YDTRTYKLAD VEELGENPQE
SVQ