Gene Ssol_0518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0518 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp461435 
End bp463138 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content36% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX90797 
Protein GI261601194 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCAGT ACAAGTGGAT TGCATTGTCC AATACAAGTC TCGGTGCATT CATGGGATTT 
ATGAACGCTA ATGTAGTATT AATAGCATTA CCAGCAATAT TTAGAGGAAT TAATATTAAT
CCGTTTAACT CTTTCCAATA CTTATTATGG ATCTTATTTG GTTATAGTAT AGTATCAGCA
ATCTTAGTAG TGAATGTAGG TAGGATCTCA GATATCTTCG GAAGGGTGAG AATGTACAAT
TTAGGATTCC TAATTTTTAC AATTGCATCT ATACTATTAT ATTTAACCCC TGGCAAAGGA
GATATAGCCG CAATTCAATT AATAATTTAC AGGATACTTC AAGGTATAGG AGGATCATTC
TTAATGGCTA ATAGCGCTGC AATTTTATCT GAGGCATTTC CACCAAATGA GAGGGGATTT
GCATTAGGAT TAAATGGCGT AATAGGAATA TTCGGTGGAG TAGCTGGAAT AATCATCGGT
GGAATTCTTG CTTCAATATA CTGGCGTGAC GTATTTTTAG TGAGCGTTCC TATAGGTATT
TTAGGAACTA TTTGGTCATA TAAATCCTTA AGGCAATTAA ATAAGCCTAA TAGAAATCAA
AGTATCGATA TAGTTGGAAA TATCCTTTAT GCCGTTTCCC TTATATCAAT ACTGCTTGGG
ATAACGTATG GTATTTTACC ATATGGTAAT CAAGTTACTG GTTGGACAAA TCCATTTGTG
ATATCGGGTA TTGTAGCAGG TGTAGGAATG TTCATCGGAT TCTTGTTCGT TGAGAGTAGA
GTTAAAGACC CCATGTTTAG AATAGAACTA TTTAAAATAA GGGCATTTAC ATCTTCAGCA
GTATCAATAA TCTTAGCACA GCTAGCATTT GGAGGTCTTC AATTGATGCT AGTATTATTA
CTTCAAGCCA TATGGCTACC TTTACACGGT TATAGTTATG AAGTAACACC TTTCTGGGCC
GGAGTATATC TTCTTCCACT ATTAGCTGGA TTTGGAATAA TGGGGTCAAT AGCGGGTAGA
CTAGCAGATC GTTATGGAGC GCGAAGCTTA GCCACTATTG GACTTTTAAT CATGGGAATA
GGAGTTTTGA CACTAACAAC ATTGCCATAT AATTTCAACT ATATAGAATT TGCTGTGATA
ATATTTTTCA TAGGTGTTGG AAATGGACTA TTTGTATCGC CTAATATGAC AGCCTTAATG
AATGCCTCAC CACCACAGCA TAGAGGATCT GCTTCTGGAA TAAGAGCTAT GTTAACTAAT
ACCGGTAGTA CATTAAGTAT TGGGATATTC TTTACTATAG TAATTGACAC CTTATACATT
TCATTACCTC CAGTATTAAC TAATGCATTA ACTGCTGCTG GAGCTCCTCA ACTAGCCCCA
ATATTAAGTA AAATACCTCC AACTGCTGCG ATATTTGCGA GTTTTCTAGG CTATAATCCT
GTTTCGACGA TACTTTCGCA ACTACCGACC TCATTAGTTA ACGCAATTCC TGCCTCTACA
ATAGCTACCA TAACCGGTAC ATATTGGTTC CCTAACGTGA TAGCATCACC CTTCATGGAA
GCTCTAAGAA TTGCATTTTA CGTTTCAGCA TCAATGGCAT TTTTAGCTAC TATAGCTTCA
GCTTTAAGAG GAAAAACGGT GATTTATGAA AGAGATTTAA TGAGACCTAT GGCTGTAAAC
TCCTCAGATG ATAAAAAGGA TTAA
 
Protein sequence
MVQYKWIALS NTSLGAFMGF MNANVVLIAL PAIFRGININ PFNSFQYLLW ILFGYSIVSA 
ILVVNVGRIS DIFGRVRMYN LGFLIFTIAS ILLYLTPGKG DIAAIQLIIY RILQGIGGSF
LMANSAAILS EAFPPNERGF ALGLNGVIGI FGGVAGIIIG GILASIYWRD VFLVSVPIGI
LGTIWSYKSL RQLNKPNRNQ SIDIVGNILY AVSLISILLG ITYGILPYGN QVTGWTNPFV
ISGIVAGVGM FIGFLFVESR VKDPMFRIEL FKIRAFTSSA VSIILAQLAF GGLQLMLVLL
LQAIWLPLHG YSYEVTPFWA GVYLLPLLAG FGIMGSIAGR LADRYGARSL ATIGLLIMGI
GVLTLTTLPY NFNYIEFAVI IFFIGVGNGL FVSPNMTALM NASPPQHRGS ASGIRAMLTN
TGSTLSIGIF FTIVIDTLYI SLPPVLTNAL TAAGAPQLAP ILSKIPPTAA IFASFLGYNP
VSTILSQLPT SLVNAIPAST IATITGTYWF PNVIASPFME ALRIAFYVSA SMAFLATIAS
ALRGKTVIYE RDLMRPMAVN SSDDKKD