Gene Ssol_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0334 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp295372 
End bp296652 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content37% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX90626 
Protein GI261601023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATG AAACTAAAGT TTCCATAGCT GCCATGATAG GTATCGCCTT CGAGTTTTAT 
GATTTTTTAA TATTTGGTTT CGTATCGAGT ATTCTGGCTA GCTTGTTCTT TCCATCAACC
AACAAGATAG TATCTTTGTT AGATACATTA GCAGTTTTCG CTACGGGATT TGCAGGTAGA
CCGTTGGGCG CTATAGTATT CGGCCACCTT GGTGACAAGA TAGGAAGGAA ATATACCCTA
ATTGTAACTA TGTCGTTAAT GGGGTTATCA TCCTTGTTTA CTGGTTTATT GCCAAGCTAT
GCAGTATTGG GAATCTTAGC TCCTACGTTA TTAACAGTTT TAAGGCTTCT CCAAGGTTTT
TCTTTAGGAG GCGAATTCGG TGGTGGTATA ACATTATCTG CCGAGTTTGC TAAACCCACT
AATAGAGCTT TTTATATAGG AATTGCTCAG ATGGCTCAAG GAGTAGGACC CTTAATGGCT
ACTGGATTAA TATTCATTTT TAGTAGTATT ATGTCTCCTC CTGCTTTTGC CTCTACTGGT
TGGAGAATAC TTTTCATAAT TGGAGCATTT ATTGCTGTAA TTGGAGTAAT TATTAGATTA
AAAATTTCAG AATCACCAGT TTTCAAGAAC GTTAGGGAAA TGGGCCAGAT ATCAAAATTC
CCCCTTGCTG AAGCATTCAG ACTCTACTGG AAGAGAATAT TATTAGGTTT AGGATTTATT
ATAGGTGGAA CTACGTTAAC TTATGCTACT AGTGTTTTCG CGGCTTCTTA TTTAGAAACT
GTAATAGGAG TACCAGCGAA GACTGTTTCA TTAGCGTTAA CAATAGGATA TATTGTGGAA
GCTATATGTA TACTTGCATT TTCACTACTG GCTGATAAGA TAGGGAGGAA ACCAATGATG
ATAACTACCG CAGTCGGTCT ATTAATTCTT GTGTATCCGT ACTTCTACTT AATTTCTACC
GGTCAGTTCT CATTAATATT ATTAGCTCAA ATTTTATACT CAACAATAGG CTCATTTTCA
ACTGCAGCTT ATGCTGCTGC CTTAACTGAA CTGTTTCCAA CTAAAGTTAG ATATACGGCA
TTATCCTTCG ACTATCATGT AGGAGTTGCA GTTTTTGGAG GTACAACACC ATTTATTGCA
AGTTATTTAA TATACGCTAC AGGTTATAAG CTAGCTCCGG TATATTGGGG TATAGCAGGG
ATGGTAATAA CATTAATAGC CTATTTGTTA TATAAGGAGA CTAAGGGGAC AATATTTGAA
GGGCAGGAAA GAGTAAGGTG A
 
Protein sequence
MKDETKVSIA AMIGIAFEFY DFLIFGFVSS ILASLFFPST NKIVSLLDTL AVFATGFAGR 
PLGAIVFGHL GDKIGRKYTL IVTMSLMGLS SLFTGLLPSY AVLGILAPTL LTVLRLLQGF
SLGGEFGGGI TLSAEFAKPT NRAFYIGIAQ MAQGVGPLMA TGLIFIFSSI MSPPAFASTG
WRILFIIGAF IAVIGVIIRL KISESPVFKN VREMGQISKF PLAEAFRLYW KRILLGLGFI
IGGTTLTYAT SVFAASYLET VIGVPAKTVS LALTIGYIVE AICILAFSLL ADKIGRKPMM
ITTAVGLLIL VYPYFYLIST GQFSLILLAQ ILYSTIGSFS TAAYAAALTE LFPTKVRYTA
LSFDYHVGVA VFGGTTPFIA SYLIYATGYK LAPVYWGIAG MVITLIAYLL YKETKGTIFE
GQERVR