Gene Ssol_0482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0482 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp428412 
End bp430493 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content38% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionACX90763 
Protein GI261601160 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.181954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAGG AATTAGTATT AGAAGTAGGT GTAATATTCT CGATATCTGT AATGCTCTTT 
TCAATCAGTG GTATAATGAT AGCCAACTCA GCATCTTCAC CATTTCCGTC AACCCTTTAC
TTAGGATGGT ATAATTCTAA CGTCGAAGCT TATTCTTCAT TTGCTACTTA TAATCCCAAT
ATATTTGCAG GAGGTTTAGG TGGATCTTTT TACGGATTGC TCTATGCATA CTCTGCAATA
ATTAATGTTA GTAATTCTCA AGCGATTCCC GTGATAATAA CTTCATGGAA CTTCTCTCCA
GCAAGCTGGC AGCAGATTTG GCAGACAAAA CCGGTTAATG TGACGCTGAT TATACGCAAT
GATTCTGGTT GGAGTAATGG GCAACCTCTC ACTGCTTACG ATCTTATGGC CACATGCCTA
ATTTTAGATA TGTTTGGTGC TCCGCCATTT CCTAATTATA CTGTTATTAA TAATTACACG
CTAGTTGAGA CTTGGCCTCC AGATACGTTA TCACCGATAT TGTTTACAGA CACGCTAATT
AATACGGTAG GCTTAGGGGA AGTTGCAATT ATAATACCAT ACCAAGTTTG GAAGCCTATA
ATTAGTCAGA TAATGGGCAA TTGGACAACA ATACAAAATA CAAGCAATAT GAAACTTGCG
CAGCAGATTA TTAAGAATAT TAGGTCAGAA ATCCTTCACT TTAGACCAAA CCCTGCCACA
TATCCTTATA GTGGTCCGTT CTACCTATCA CAGTTATCAT CGAATGAAAT AGTACTAAAT
AAGAATCCAT ATTATTATGA CGCTAAATAT ATACCATTCA ATCAAGTAAT AATATATCAA
TACGGTCAGG CTGACTTAGC AGCAGCTGCA ATAACGGGTG GTCAAGTTTC TATGGACTGG
TCCGGTTTGA CAGGATTATC ACCAACGCAG CTAGAAAGTT TGCCGACCAC ACTTGAGGTA
ATTAATATAC CTCAACCATT TGGCTGGGGT ATAGCATTCA ATCTGAAGAA TCCATGGTTA
AGAGAGTATC AAGTAAGGGC GGCAATTGCA TATATTCTAA ATAGGACTGC CATAGCTTCA
GTGGGAGGTC CTTTAACAGC ACCGGTTACT ATTCCTAATG CAATACCCAA TTTATCGTAC
TATTCATTCA TGACATCTTC GCAATACTAC TCATCTTTAA ATCCGTATAA TGTTAATTTA
ACAAAGGCAG CACAGCTGCT TAAGAGTGTA GGGTTTTACC AGAAGTCTGG AGTTTGGTAT
ACTCCAAATG GTACACCGTT TACTTTAACA ATAGGTGCTG GAAGTCCACC AACGCAAGCA
CTAGCAATGA TGGAAGAAGT ACAAAAAGAA TTACAGCAAT TTGGAATTAA CGTACAATTA
CACATATATA CTGTAGTTAG TCAGTGGCAT CAAGCATGGC AAAATGGTAC CGGCTATGAT
TTATGGTTTG AGAATTGGGG TAGTTCATAC TCGCCGGGGA CTGCACCATG GAGTTTGGTA
CTTTCTTATT TTGGAGGATA TCCATGGAAT GTTACACAGT GGAATGAGAA TCTTACTTTA
CCTAATGGTA CTATAATTGA CTTCCATAAA TTACTTGAGG AAACCGAGTC TCCTAATACC
ACACAACAGT TAATACAAGC TAATCAAGAG TTATCATATT ATATGAACCA ATACTTACTT
CCAATCTTAC CTCTAGTTGA GATCGAGAAT TACGTCATTG TAAATCCCTC GCTACTTATT
GCAGCTCCAC CTGCTAATTC GTGGATATGG GAAGAGGCAC AATACGGTAT AGGTGGTACT
GCTATGGTAC AAGCTCTAAT AGATTATTGG TACGCCCCAT TATATGAGAG CACTATAATT
ACTACAACTA CGACTTCTAT TTCTACTACT ATAACTACAA CAACCACTTC CGCAACAACA
GCTACCACGT CTGTAACTAC GACTACGTCT GTAACTACGA CTTCTATTTC TACTACTACA
GTTACTGTGA CTTCTACTTC AACTATTCCA ATTATAATAG CAATAGTTAT AGTTATTATT
GTTATAATTG CTGCTGTGGC AATATTAATG AGAAGGAGAT AA
 
Protein sequence
MRKELVLEVG VIFSISVMLF SISGIMIANS ASSPFPSTLY LGWYNSNVEA YSSFATYNPN 
IFAGGLGGSF YGLLYAYSAI INVSNSQAIP VIITSWNFSP ASWQQIWQTK PVNVTLIIRN
DSGWSNGQPL TAYDLMATCL ILDMFGAPPF PNYTVINNYT LVETWPPDTL SPILFTDTLI
NTVGLGEVAI IIPYQVWKPI ISQIMGNWTT IQNTSNMKLA QQIIKNIRSE ILHFRPNPAT
YPYSGPFYLS QLSSNEIVLN KNPYYYDAKY IPFNQVIIYQ YGQADLAAAA ITGGQVSMDW
SGLTGLSPTQ LESLPTTLEV INIPQPFGWG IAFNLKNPWL REYQVRAAIA YILNRTAIAS
VGGPLTAPVT IPNAIPNLSY YSFMTSSQYY SSLNPYNVNL TKAAQLLKSV GFYQKSGVWY
TPNGTPFTLT IGAGSPPTQA LAMMEEVQKE LQQFGINVQL HIYTVVSQWH QAWQNGTGYD
LWFENWGSSY SPGTAPWSLV LSYFGGYPWN VTQWNENLTL PNGTIIDFHK LLEETESPNT
TQQLIQANQE LSYYMNQYLL PILPLVEIEN YVIVNPSLLI AAPPANSWIW EEAQYGIGGT
AMVQALIDYW YAPLYESTII TTTTTSISTT ITTTTTSATT ATTSVTTTTS VTTTSISTTT
VTVTSTSTIP IIIAIVIVII VIIAAVAILM RRR