Gene Ssol_0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0808 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp756035 
End bp757897 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content38% 
IMG OID 
Productarabinose ABC transporter, arabinose binding protein 
Protein accessionACX91059 
Protein GI261601456 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAGGC GTAGATTATA CAAGGCGATA TCTAGAACAG CAATAATAAT AATTGTAGTA 
GTAATCATAA TTGCAGCAAT TGCTGGAGGT CTTGCAGCAT ATTATAGCTC TAGCAAACCA
CCCGCTACCA GTACGAGTTT AACCAGTACG AGTAGCAGTT TATCCGTGAC GACTTCTTCT
ACAACTTCTA CCTTATCTTC TATTACGACA ACGACGTCAA CTGCTTCATC ATATGTAGTA
GACTTTATAA ATCCGTGGGG TGCTGAAGAC CCTGTAGGAC TAAAATGGAT TGGAGGAAAT
TTCTCAATTT ACTACCCTGG CTATTCAGTG CAATTCACAT CATTACCAGG AGCTAGTGGT
GTAGAGGAGA GGTATGTTGT AATTAACGAT ATTGAGGCTG GTAAGCTACA GGGTATCTTT
TGGGCTCATG GAGGCCCAGA AGTACTTTCC TATGTGGAAC TATTACCTAG CCCCCACGAT
TTATACAATA TGACCCCACT GTTAGCCCAA GAGGGATTAT TCCAGAAGGG TGTCACGGAA
GCACTTATGG CTATATCTTA TAATGGTACA ATATTCGGCT CTCCAACAAA CGTGCACAGA
GCAGAGGAGT TATACTTCAA TCCTCAAATC CTTAAGAAAT ACAATTTACC AATTCCTACC
AATTTAAGTT TACTAATCTA TGATACTCAA CAATTAGAAG CACATGGAAT AAATCCGTGG
GCAATGTCTG GCGCTGAAGG TGGATATGAA CAATTACATT TGTGGTTCGC AATATTCCTA
TCTGTAGCTG CGCAATACTA TGGAGCTGCA GGAGCTGCTA AATTATCAAA TGAGTTAATG
TATGGAGTGC TTAACTTAAA TAACGTAACA GTTCAAAAGA TTATTAACGA AACTGATAAC
GTATTCTTGC AGTTTGTAGG TCAAAGCAGT GTGATTCCGA GCTGGCAAAG TCAATCTATA
TGGTCCGCTT TAGCACTAGT AATAAAGGGA CAAACTGTAT TCGAAGCTGG TGGCAATTGG
CTTGCTGAAT ACGCAGCTAT ATGGTATAAT ACCACTACTT ATCCTGCTAC ACAACCCTAT
CTAAACTGGT CCAATATTAC ACTAATGGCA ATGCCATTCC CTGGTACACA AGGTATTTAC
GTTATAGATA TGGACTCTGT TGCAATACCT ACTGTTAATA ATCCACAAGA GCAAGCTGCT
ATAAATTTTG CAAAATTCTG GGCTTCTTAT GAAGGACAGA AGATATGGAC ATACTACAAG
GGTGTGTCAA TATGGGCTAA TTCAACAGAT TACTATTCTA CTCCAATGCA GTGGTATGAT
TATCAATCTT TACTAAATAC GCCAGCACAG AACTTTACAT GGGCGTTCGC TGATGGGACC
CTATTTGATG ATGTGTTCTA CTTCCTAATA GCGCAAGAGT TGAATCTACA AGAGCAAGGT
TCATCGTATA TTCCCACATT TAACGCAGCT CTATTCAAAG CTGAAAATAT GACATTCCAC
GAATGGCAAA TAGCTGCGAA AGATGGTTTC GGCTTCGTGG GACAAAGAGG AAATCCATTT
GGTAATTATC TACCACCATG GGTTAATCCA AGCACTTATA CATATAATTC AAGTTATACT
CCATCATTCT TACTAACACC ACCTCACTAT TTATTGCCAT ATCTTAAGAA ACTTGGACAA
CAGCAATATG CTAATAAGGT TAATAGTGTA AATTACTATG CTATCAACGG TGTTAATTTA
TTACCATTTC CATTGCTAAT CGTATTAATG ATATATTTAG ATCAAACTAG ATATAAATAT
GTTATAAAGA CATTCTTAAA TAAGATAAAC TTTTTTCAAA TATATTTTTA TCTAAATTTT
TAA
 
Protein sequence
MSRRRLYKAI SRTAIIIIVV VIIIAAIAGG LAAYYSSSKP PATSTSLTST SSSLSVTTSS 
TTSTLSSITT TTSTASSYVV DFINPWGAED PVGLKWIGGN FSIYYPGYSV QFTSLPGASG
VEERYVVIND IEAGKLQGIF WAHGGPEVLS YVELLPSPHD LYNMTPLLAQ EGLFQKGVTE
ALMAISYNGT IFGSPTNVHR AEELYFNPQI LKKYNLPIPT NLSLLIYDTQ QLEAHGINPW
AMSGAEGGYE QLHLWFAIFL SVAAQYYGAA GAAKLSNELM YGVLNLNNVT VQKIINETDN
VFLQFVGQSS VIPSWQSQSI WSALALVIKG QTVFEAGGNW LAEYAAIWYN TTTYPATQPY
LNWSNITLMA MPFPGTQGIY VIDMDSVAIP TVNNPQEQAA INFAKFWASY EGQKIWTYYK
GVSIWANSTD YYSTPMQWYD YQSLLNTPAQ NFTWAFADGT LFDDVFYFLI AQELNLQEQG
SSYIPTFNAA LFKAENMTFH EWQIAAKDGF GFVGQRGNPF GNYLPPWVNP STYTYNSSYT
PSFLLTPPHY LLPYLKKLGQ QQYANKVNSV NYYAINGVNL LPFPLLIVLM IYLDQTRYKY
VIKTFLNKIN FFQIYFYLNF