Gene Ssol_2623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2623 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2404706 
End bp2405902 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content39% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX92728 
Protein GI261603125 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.600696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA TAAGCGTTTT TTCCCTAACA ATCTCAAGGA TTTTGAGAAG TTTGTCAGCA 
GGTATAATAT TCGTAATCCT ACCCTACTTG GTTCTAGTTG AATTGAAATA CTCATCGCTA
ATTCTAGGAT TTATATACAC CGTTGGCACA TTTGCTACAG CTATACTTGG ACTCGCTGTA
GGTTATCTTG CAGACTTATA TGGAAGGAAA AATTCCCTAA TTCTAGTTTC ACTTTTCCTG
CCTTTAAGTG TATTACTTAT TTTCATAAAT CACTCCTTAA TAGCGCTCTT TATTGCCTCA
GCCTTAGGAG GGTATTCCGC CACTGGTGCA GTCGCCGGAG GTGGAATTGG GGGAATAGTA
GCTCCAATTC AGAACGCGCT AATTACTGAG CTAACTGATA ACGAGGATAG AACATTTTAC
TTCTCCCTAT TCACATTTTT ATCTGGAATA TCCGCTTCAA TTGGTTCCCT CGTAGCTGGA
TTCTTTACTT CAAAGCAAGG GTTTCTGTTT GCGATAATAC TAGGATTTCT GTCAGTTTTA
GCACTATTTC CAATAAGGGC TAAAAACATA AAGGCTAGGT CAGCCTCATT GAAAAGTAAA
GTTGTCATAG GGAAGTTCTC AATTACGGGG TTATTAAATG GCATATCTGT AGGTCTAATA
ACTCCATTCC TAATTCCGTA TTTCATTATC GTATATCACA CGCCAAAATC CGAAATGTCA
ATCTACACTT TCCTTAGTAG TCTTATCGCC TCAACAATAA TCCTCTTGGC CCCGGTTTTA
GATAAGAAGA TAGGATTTTT GAAGAGTATA GCTATAACTA GGGGTATAGG AGCACTTCTT
TCAATTATAA TGCCCCTCAT AAGGGTATTC CCAATATCCC TAGGAATTTA CCTTATACTT
CCGGGAATGA GAGTGCTAGC GCTACCCATA CAACAAAGGG CTATGACGGA AATGGTGAGC
CAAGATGAAG TAGGTAGAGC CATGGGAATT AATCAAGTGA CTAGGTTAGC AGCATCCTCT
GGTTCAACTG GTCTCACTGG TTACTTATTC TCGGAATCGC AGATTGATGT ACCATTCTTA
GCCTCCGGGA TCATAATGGC TTTAAATATT TACATGTACT ATAAATTCTT TGGAGGTAAA
AATGAAGTTA ATAGACCTTC ACGAGGATCT AGCTTACTCA AACCAACAAG GGATTGA
 
Protein sequence
MNKISVFSLT ISRILRSLSA GIIFVILPYL VLVELKYSSL ILGFIYTVGT FATAILGLAV 
GYLADLYGRK NSLILVSLFL PLSVLLIFIN HSLIALFIAS ALGGYSATGA VAGGGIGGIV
APIQNALITE LTDNEDRTFY FSLFTFLSGI SASIGSLVAG FFTSKQGFLF AIILGFLSVL
ALFPIRAKNI KARSASLKSK VVIGKFSITG LLNGISVGLI TPFLIPYFII VYHTPKSEMS
IYTFLSSLIA STIILLAPVL DKKIGFLKSI AITRGIGALL SIIMPLIRVF PISLGIYLIL
PGMRVLALPI QQRAMTEMVS QDEVGRAMGI NQVTRLAASS GSTGLTGYLF SESQIDVPFL
ASGIIMALNI YMYYKFFGGK NEVNRPSRGS SLLKPTRD