Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0808 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 756035 |
End bp | 757897 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | arabinose ABC transporter, arabinose binding protein |
Protein accession | ACX91059 |
Protein GI | 261601456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAGGC GTAGATTATA CAAGGCGATA TCTAGAACAG CAATAATAAT AATTGTAGTA GTAATCATAA TTGCAGCAAT TGCTGGAGGT CTTGCAGCAT ATTATAGCTC TAGCAAACCA CCCGCTACCA GTACGAGTTT AACCAGTACG AGTAGCAGTT TATCCGTGAC GACTTCTTCT ACAACTTCTA CCTTATCTTC TATTACGACA ACGACGTCAA CTGCTTCATC ATATGTAGTA GACTTTATAA ATCCGTGGGG TGCTGAAGAC CCTGTAGGAC TAAAATGGAT TGGAGGAAAT TTCTCAATTT ACTACCCTGG CTATTCAGTG CAATTCACAT CATTACCAGG AGCTAGTGGT GTAGAGGAGA GGTATGTTGT AATTAACGAT ATTGAGGCTG GTAAGCTACA GGGTATCTTT TGGGCTCATG GAGGCCCAGA AGTACTTTCC TATGTGGAAC TATTACCTAG CCCCCACGAT TTATACAATA TGACCCCACT GTTAGCCCAA GAGGGATTAT TCCAGAAGGG TGTCACGGAA GCACTTATGG CTATATCTTA TAATGGTACA ATATTCGGCT CTCCAACAAA CGTGCACAGA GCAGAGGAGT TATACTTCAA TCCTCAAATC CTTAAGAAAT ACAATTTACC AATTCCTACC AATTTAAGTT TACTAATCTA TGATACTCAA CAATTAGAAG CACATGGAAT AAATCCGTGG GCAATGTCTG GCGCTGAAGG TGGATATGAA CAATTACATT TGTGGTTCGC AATATTCCTA TCTGTAGCTG CGCAATACTA TGGAGCTGCA GGAGCTGCTA AATTATCAAA TGAGTTAATG TATGGAGTGC TTAACTTAAA TAACGTAACA GTTCAAAAGA TTATTAACGA AACTGATAAC GTATTCTTGC AGTTTGTAGG TCAAAGCAGT GTGATTCCGA GCTGGCAAAG TCAATCTATA TGGTCCGCTT TAGCACTAGT AATAAAGGGA CAAACTGTAT TCGAAGCTGG TGGCAATTGG CTTGCTGAAT ACGCAGCTAT ATGGTATAAT ACCACTACTT ATCCTGCTAC ACAACCCTAT CTAAACTGGT CCAATATTAC ACTAATGGCA ATGCCATTCC CTGGTACACA AGGTATTTAC GTTATAGATA TGGACTCTGT TGCAATACCT ACTGTTAATA ATCCACAAGA GCAAGCTGCT ATAAATTTTG CAAAATTCTG GGCTTCTTAT GAAGGACAGA AGATATGGAC ATACTACAAG GGTGTGTCAA TATGGGCTAA TTCAACAGAT TACTATTCTA CTCCAATGCA GTGGTATGAT TATCAATCTT TACTAAATAC GCCAGCACAG AACTTTACAT GGGCGTTCGC TGATGGGACC CTATTTGATG ATGTGTTCTA CTTCCTAATA GCGCAAGAGT TGAATCTACA AGAGCAAGGT TCATCGTATA TTCCCACATT TAACGCAGCT CTATTCAAAG CTGAAAATAT GACATTCCAC GAATGGCAAA TAGCTGCGAA AGATGGTTTC GGCTTCGTGG GACAAAGAGG AAATCCATTT GGTAATTATC TACCACCATG GGTTAATCCA AGCACTTATA CATATAATTC AAGTTATACT CCATCATTCT TACTAACACC ACCTCACTAT TTATTGCCAT ATCTTAAGAA ACTTGGACAA CAGCAATATG CTAATAAGGT TAATAGTGTA AATTACTATG CTATCAACGG TGTTAATTTA TTACCATTTC CATTGCTAAT CGTATTAATG ATATATTTAG ATCAAACTAG ATATAAATAT GTTATAAAGA CATTCTTAAA TAAGATAAAC TTTTTTCAAA TATATTTTTA TCTAAATTTT TAA
|
Protein sequence | MSRRRLYKAI SRTAIIIIVV VIIIAAIAGG LAAYYSSSKP PATSTSLTST SSSLSVTTSS TTSTLSSITT TTSTASSYVV DFINPWGAED PVGLKWIGGN FSIYYPGYSV QFTSLPGASG VEERYVVIND IEAGKLQGIF WAHGGPEVLS YVELLPSPHD LYNMTPLLAQ EGLFQKGVTE ALMAISYNGT IFGSPTNVHR AEELYFNPQI LKKYNLPIPT NLSLLIYDTQ QLEAHGINPW AMSGAEGGYE QLHLWFAIFL SVAAQYYGAA GAAKLSNELM YGVLNLNNVT VQKIINETDN VFLQFVGQSS VIPSWQSQSI WSALALVIKG QTVFEAGGNW LAEYAAIWYN TTTYPATQPY LNWSNITLMA MPFPGTQGIY VIDMDSVAIP TVNNPQEQAA INFAKFWASY EGQKIWTYYK GVSIWANSTD YYSTPMQWYD YQSLLNTPAQ NFTWAFADGT LFDDVFYFLI AQELNLQEQG SSYIPTFNAA LFKAENMTFH EWQIAAKDGF GFVGQRGNPF GNYLPPWVNP STYTYNSSYT PSFLLTPPHY LLPYLKKLGQ QQYANKVNSV NYYAINGVNL LPFPLLIVLM IYLDQTRYKY VIKTFLNKIN FFQIYFYLNF
|
| |