Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2001 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1796460 |
End bp | 1797668 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | ACX92211 |
Protein GI | 261602608 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTAGAGA AAAATTTATT ACCGGAAATA TTACTGGCTA TACATATGCC GTTAAATAAA GGGTTAACTA GGGTTAAAGC TATCGTGATA ATAATTGTTG TTATAATCGC AGTGATAGCT GGGGTTGTAG GATATTATTT AATTAATCAT CCTTCTAATT CTGTAACTAC TTCATCTTCA TCTACTACAA CTAGTTCTTC CCTATCTAGT ACTAGCATAT CTTCATCTAC TACTAATATT ACTTCATCTC AAGGTATTAC AGTCTTCGTA GCGGGTGCTT ATCTTGCAAT TCTCAACTAC CTAGCTGACC AATTTCAGAA CGCTACTGAA ATTCCAGTTC ATGTTGTAGG TAGTGGCTCC TTCGCATTAG CTTCACAAAT AGCTTCCCAG ACTCCAGTTC CAGCAAACGT TTTCATTCCA GTTGCCTATA TTCAAGCTGT TGAGTTAACT GGCAGTAGGA ATCCCGGTTG GGCTATAGCT TTTCTATCAG ATCAGATGAC AATAGTTTAC TCTAACTACA CTACCAAATC TCCTTATTGG TCCCAACTAT ACTCCAATTA CACCATGGCT ATGGAGACCA ACAATACTAA GTATTGGTAT AATTTCTTCT ACTTATTGAC CACCAGGTTC AGTCTGGGAA TTGCTAATCC TAACACTGAC CCAGAGGGAT TATATGCGTA TTTGATACTT CAAATGGCAA GTTATTTATA TGCTAATCAT AATATAAGCT ACTTTGTGCA TCTCGTTAAA GCGAATCCAA ATGTCAAAGT AGCCCCTAGT ACAGCTAACT ATGTAGCGCC CTTAAAGGCG GGTACTTTAG ACTTCACATT CTCTTATGTT TCCTATGCTG TATCTCAAGG ATTGGAATAT CTAAAACTAC CTCCTTGGTT AAGTTTTGGT TATTATCCGA ACGAGACGAC ATGGTACAGT CAATTTGCTT ATAATATAAG TGTAAATGGC CAAACATTAA CAATTCATGG AAATCCAGTT TACTTATACA TTACCATTCC ATTAAACGCT TCGAATATAC AAACTGCATA TCAGTTTATT GGCTTCGTAC TGGGTCATGA ATCTCAACTT ACCAGATTTA ATGTAATTCC AATACAACCA GCTTTATTGT ATAATGAAAC TAGTAATATT CCGCAGCCTA TATTGAACTT GTTAAAATCT GGTGAGTTGA AGTATGCGGG TAATTTCTCT GAAGTTTAA
|
Protein sequence | MLEKNLLPEI LLAIHMPLNK GLTRVKAIVI IIVVIIAVIA GVVGYYLINH PSNSVTTSSS STTTSSSLSS TSISSSTTNI TSSQGITVFV AGAYLAILNY LADQFQNATE IPVHVVGSGS FALASQIASQ TPVPANVFIP VAYIQAVELT GSRNPGWAIA FLSDQMTIVY SNYTTKSPYW SQLYSNYTMA METNNTKYWY NFFYLLTTRF SLGIANPNTD PEGLYAYLIL QMASYLYANH NISYFVHLVK ANPNVKVAPS TANYVAPLKA GTLDFTFSYV SYAVSQGLEY LKLPPWLSFG YYPNETTWYS QFAYNISVNG QTLTIHGNPV YLYITIPLNA SNIQTAYQFI GFVLGHESQL TRFNVIPIQP ALLYNETSNI PQPILNLLKS GELKYAGNFS EV
|
| |