Gene Ssol_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1970 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1757293 
End bp1758483 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content37% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX92181 
Protein GI261602578 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA GAATACTAAC CTTAACTTCA TTATCCCACT TTATAAACGA CGGAAATAGC 
TGGGTCCTTC CAGTGACGTT CACATTCCTA ATAACGTATC TTGGTATATC AAAATTCCTA
ATCGGAATAC TATCTGGCGC ATTTTTCTTC GGAATATCAG CATTAGCTTC GCCTTTAGTC
TCCAAGATAG CAGACAAGTT TACCAATTAT TCCAGCATAA TGGGAATAGG AATATTACTA
TGGGGAATTG GATTAATATC ATTCGGTTAC TCAATACAAC TCCACTTTTT GCCATTAGTA
TTCATTTCAG TGGCAATAGC TGGTTTTGCA TCAGCATTCT ATCACCCAAT AGGTGCGGCT
GTTCTATCAA TAACATACAA GGGAAATGCT GGTATTGCAT TAGGCATAAA CGGGTCAATG
GGTAGCCTTG GCAGGGCAAT TTACTCAACA TTAACCCTTT CACTATTTGC AATATTGAAT
AAGGATATGA CCTTAGATAT GTTAATAATA GGTATAATAT CAATAATAGC TGCATTGCCA
TCAATATTCC TAAAGATTTC TATCACGAAA GAGGAGGATC ATAAAACACC TTCCTCTTCC
AATACCACTA GTACCAGAGG CACATTATTT GTAGTAATCT TATTGACTAT CATTGCATTA
CTACGAAGTA TATTTGGTCA AGGAATTTCA CAATTCCTTC CAACATTATT AGTAGAAAAT
TATGGTTATT CTTACAACGT TAACTTAGGT GAAGCAATTA CAATCGCTCT AGCAGCAGCT
ATAGTAGGGC AACCAATACT AGGATTCCTA TCAGATAGAG TAGGGAGAAG GCTAATTTAC
GCTATATCGA CCTTTGGTGC TGCCTTAACA TTACTTTTGT TTCTAAAAAT ACCAAACATA
GCCTTGCTAT CATTATTTGG ATTTTTTAAC TTCAGCGCAT TCCCACTAAT GCTATCAATA
GTAGGAGATT TTGTACCTAG AAATTCAGCG AGTTTTGCCA ATTCACTAGT TTGGGGATTA
GGAGTTACTG GTGGTGGAGT TATTGGTCCA ATAGTAGTGG GAGCAGTATC CCAAGTTTCA
AACTTAGTGT TCGCAAGTGA AATAGTAACC ATAATGGCTT TCGTCGCAGG AGCGTTAACA
GCATTAATTC CTAAACCACC AAAGAGAACC AAAGTACCAT TATTTGGATA A
 
Protein sequence
MKIRILTLTS LSHFINDGNS WVLPVTFTFL ITYLGISKFL IGILSGAFFF GISALASPLV 
SKIADKFTNY SSIMGIGILL WGIGLISFGY SIQLHFLPLV FISVAIAGFA SAFYHPIGAA
VLSITYKGNA GIALGINGSM GSLGRAIYST LTLSLFAILN KDMTLDMLII GIISIIAALP
SIFLKISITK EEDHKTPSSS NTTSTRGTLF VVILLTIIAL LRSIFGQGIS QFLPTLLVEN
YGYSYNVNLG EAITIALAAA IVGQPILGFL SDRVGRRLIY AISTFGAALT LLLFLKIPNI
ALLSLFGFFN FSAFPLMLSI VGDFVPRNSA SFANSLVWGL GVTGGGVIGP IVVGAVSQVS
NLVFASEIVT IMAFVAGALT ALIPKPPKRT KVPLFG