Gene Ssol_2792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2792 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2554088 
End bp2555188 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content32% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX92875 
Protein GI261603272 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATA GACAGTTTAT TTTGTTTACA ATATTAGTCT TTTTTACAGG ATTATATCTA 
GGAACTCTAA GAATAATTAT TCCGGTATTT GAGAAACAAA TAAACATCTC AATAATGTTA
AGCCTATTAT TACCCTTGGT ATCCTTTGGG TTTGTAAAAG GCGCATTTAA CTTCATTGCG
GGAAAGCTCT CTGATGACTT GGGAAGAAAG AGAGTACTCG TAATAGGCTG GTTAGTGGCG
TTGATTTCAG TCCCTTTATT TCTCTCAATT AACATATATA CAGTCATCAT TATTTCGATT
CTGCTTGCAA TAAATCAAGC TTTGACGTGG ACTACTACCG TCACTTCACA AATAGACATT
AGCGGTAAAT TAAGAGCAGG CTTCGCTACT GGAATAAATG AAATGTCGGG ATATTTGGGA
GTCTCTTTTG GAAGTCTCTT CGCTAGTTAT TTATTTAAGC TAAGTAGTAT TTTCATCGGA
ATAATTTGCT TGATAGCATT AATTTCTTCC TTTAACGTAA TTGAGACTAA AACATTAATA
CCAAATGCCA CTTTATCGAA AAAGGAAAAT AATCATATAA ATTACTTTTC CATTACTAAA
ATAAGCATTG CAGGACTCCT AGAGAAGTTT GTAGATTCAG CATTCTTTAT CTTGATACCC
ACATTTCTAT TATTACAACA TTATACGTTA TTTTTGATAG GAATAACTGT ATCTAGCTAT
ACGTTTACTT GGTCACTTTC GCAACCACTA TTCGGGTACT TGGCAGATAC TTACAACAAA
AGAAGACTAA TACTTGTAAT AGGTTTTTTA TTAATGTTTG TTGGCTTTAT AAAATATTCT
GAACTTCCGA TTTTATTTTC AATAATAGAA GGTATTGGTA TGGGCATGAT CTATCCTAAT
TTAATAGCTT TTGTTAACGA TAAGATTAAC GAGAGTGTAA GAGGAAAAGC ATTAGGCTAT
TACAGGTTAT ATAGGGATAG TGGATATGGT GTGGCTGGCT TACTACTACC ATTACTTTAC
TCGTTTTATG GATACGAATA TACTTTATTG ATAGTAGGAA TATTGCAAGT TGTAGCTCTC
TTACTAGTAG TAAGATCTTA A
 
Protein sequence
MMNRQFILFT ILVFFTGLYL GTLRIIIPVF EKQINISIML SLLLPLVSFG FVKGAFNFIA 
GKLSDDLGRK RVLVIGWLVA LISVPLFLSI NIYTVIIISI LLAINQALTW TTTVTSQIDI
SGKLRAGFAT GINEMSGYLG VSFGSLFASY LFKLSSIFIG IICLIALISS FNVIETKTLI
PNATLSKKEN NHINYFSITK ISIAGLLEKF VDSAFFILIP TFLLLQHYTL FLIGITVSSY
TFTWSLSQPL FGYLADTYNK RRLILVIGFL LMFVGFIKYS ELPILFSIIE GIGMGMIYPN
LIAFVNDKIN ESVRGKALGY YRLYRDSGYG VAGLLLPLLY SFYGYEYTLL IVGILQVVAL
LLVVRS