Gene Ssol_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2108 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1885372 
End bp1886523 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content37% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX92314 
Protein GI261602711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAAAA ACAGAAACGA AGTCTTGAAA ATCTCCTTTT CAGCCTTTTT CGCTGATCTT 
GGATACCAGG CTGTAGTAGC CTCTTTTCCA ATAATTTTCG TATTGATCTT TAAAGCTCCA
ATTCCCCTTT ATGGTTTTGC TGAGGCATTA AATTACGGGA TTGGTACTGT TATGGCTTAT
GCTGGAGGTT TAGCTGGGGA TAGGTTTGGG AGAAAAAGGA TTGCTATTCT TGGAAATGTT
CTTATTCTAT TTACTTCCCT AATAGGACTA TCTAGGGATT ACATTCAAGC CCTCATATTC
TTCATGATAG GGTGGTGGTT TAGGAACTTT AGATCACCAC CAAGAAGGGC GATGATGGCT
GAAGTCACAT CACCGGAAGA GAGATCTGAG GCATTTGGAA TTTTGCATTC TTTAGACATC
GCTGGTGCGT TAATAGCAAT AATTTATCTA ACTGTATTAC TTTACCTTCG CGTTTCCATC
TTTTTTGTTC TTTTATTCAC CTCAATACCT TTGCTTATGT CAACAATTGT TTTAACCATG
GTAAATGCTG GGAAGAAAAG TGAGAAAGCG AAGAGAAAAG AAGCAGAAAG TAAAATAACC
CAAAAAAGGG TCTTCTGGAC TCTTATATTA TCTACAATGT TCTTCGGATT TAGTCAGTAC
AGCTTTGGAT TTCCTATTCT AACTACAACA GAGATTACTG GAAAGGAGTA TTTGGGAGTA
TTATCTTATG GCATATTTCT TGGCGCTTCC TCTTTATTTG GGTACCTATT TGGTAGAATA
AGAATGAAAG AATTTGAAAG TTTAGCATTT CTAGGATATT TAATTGGAGC ACTAGGATCT
CTGGGGTTTG CGTATTTATC GAGTTTTGGA GTGTTTTCCC TTTATCCTCT CTCTTTCTTA
ATGGGAACTA GTGTCGCCTC AACTGAGACT TTTGAACCTA CCATAATATC GAAGATAACT
AAAGAAGAAG CATTTAGTAC AAGTATGGGC TACTTATCAG CAGGTAGAAG TATTGGGATA
TTTCTTGGTA ATGTAATAAT GGGGTTCTTA TATCAAATAA GCTATACATA TGCGTATCTA
TTCGCCGCTA TAACTTCGTT AATCTCCTTT GCACTAATCT TAAACTTGAT CATGAGACCA
AATGCTTCTT AG
 
Protein sequence
MLKNRNEVLK ISFSAFFADL GYQAVVASFP IIFVLIFKAP IPLYGFAEAL NYGIGTVMAY 
AGGLAGDRFG RKRIAILGNV LILFTSLIGL SRDYIQALIF FMIGWWFRNF RSPPRRAMMA
EVTSPEERSE AFGILHSLDI AGALIAIIYL TVLLYLRVSI FFVLLFTSIP LLMSTIVLTM
VNAGKKSEKA KRKEAESKIT QKRVFWTLIL STMFFGFSQY SFGFPILTTT EITGKEYLGV
LSYGIFLGAS SLFGYLFGRI RMKEFESLAF LGYLIGALGS LGFAYLSSFG VFSLYPLSFL
MGTSVASTET FEPTIISKIT KEEAFSTSMG YLSAGRSIGI FLGNVIMGFL YQISYTYAYL
FAAITSLISF ALILNLIMRP NAS