Gene Ssol_0822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0822 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp768804 
End bp770075 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content37% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX91070 
Protein GI261601467 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTATC GTTCTCTAGT TGAAACATCA AGAAGTAAAA GATACGTGAG ACTACTCCCT 
ATGCTCTTTT TCTTATATAT AGTAAACTTT CTCGATAGAG TAAATATTTC CTATGCAATA
GATGCCGGAA TGTTCCAATA TCTAGGAGTT TCTGCTAAGG AAGTTGGTAT AATTGCTTCC
CTAGCTTCAT CATTATTCTT TGTAGGTTAT TTTATTCCCC AAATCTTCTC AAATCTCGGT
ATTAACAGAT ATGGTGTTAG GAAAATATTT CTTATAGCAT TTCTAGCGTG GGGCATAATA
ACTATTGCAA CTGGTTTTGT AACCTCAGTT TCGCAAGTAT ACGTACTAAG ATTTCTACTT
GGAATAGCTG AAGGACCATT CTTTGCGGGT GTAATGTTCT ATCTGAGTTT GTGGTTCTTG
AAACCAGAAA GAGCTACTGC CAATAGTTTC TTTACTGCTG CAATTCCAAT ATCTGGAATA
TTTGGTAGTC TAATTGCTGG AGCAATATTT GCAACTTATG GTAACTATCC AGGATGGCGA
TATTTGTTCT GGTATGAAGG AATGTTGGCA ATATTTGGTG GGTTACTTGC GTACTTTATT
TTAACAGACT TCCCAAGTGA TGCCAAGTGG TTAAGCAATG ACGAAAGGAG TGCATTAGAG
CAAGCATTTA AGGAGGAGGA GGTGGAGAAG GTTAAGGTTA GTTGGACTAA AGCATTAGCT
GACTTAAATG TAATATTACT TGCCATTGTT TACTTTTTAG GAGTTACAAG TCTTTACGGC
TATTCGATCT GGCTACCATC AATTATTAGC TTTATTGGTA AAGTTAATGC TACAATTGCT
AGTTTTCTGT CAATAGTACC ATATGTTATT GCCTCGATAT CATTAATTCT CATAGCTCGC
TATGCTGATA GAAGGCAAAA TCATAGATTT GTAACCTTTG CAGTATTTCT AGTTGCTGGA
ATTGGTTTAG CATTAAGTGC TGCAACTCAG AGTATATTTA TTTTATCATT CATGCTTTTC
GCAATAGCTG CTATAGGGAT TTACAGCTTC ATTGCAACGT TTTGGGCGGT ACCTCAAGGT
TATTTATCTG GCGATGCAGC TGCAGCTGCT ATAGGTTTGA TAAACGCTAT TGGAAATCTT
GGAGGAATTG CCGGTCCTAT AGTAGTGGGA TTTCTAAAAT CGTACACTGG ATCATTTGTA
GATGGAATTT ACGTCATGGC GCTATTCTCA ATTTTAGCTG GTGTAATAAC ATTGCTTATC
AGGAGGAGTT GA
 
Protein sequence
MRYRSLVETS RSKRYVRLLP MLFFLYIVNF LDRVNISYAI DAGMFQYLGV SAKEVGIIAS 
LASSLFFVGY FIPQIFSNLG INRYGVRKIF LIAFLAWGII TIATGFVTSV SQVYVLRFLL
GIAEGPFFAG VMFYLSLWFL KPERATANSF FTAAIPISGI FGSLIAGAIF ATYGNYPGWR
YLFWYEGMLA IFGGLLAYFI LTDFPSDAKW LSNDERSALE QAFKEEEVEK VKVSWTKALA
DLNVILLAIV YFLGVTSLYG YSIWLPSIIS FIGKVNATIA SFLSIVPYVI ASISLILIAR
YADRRQNHRF VTFAVFLVAG IGLALSAATQ SIFILSFMLF AIAAIGIYSF IATFWAVPQG
YLSGDAAAAA IGLINAIGNL GGIAGPIVVG FLKSYTGSFV DGIYVMALFS ILAGVITLLI
RRS