Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0454 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 406433 |
End bp | 407620 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | ACX90737 |
Protein GI | 261601134 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATTCTT TAAATCCAAG TAATTCTACT TTTTTTGACT CTAATAAATT ATTATCAGAA TTTATAAGGC AAGCGAGCGT TTGTCACGGT TGCAGATTAT GTTTCAACTA TTGCGATTCT TTTCCCCTTA TGTTTACTTA TACTGATAAG AAAGGCCCCA AAAACTTAAC CTTAGATGAC TTGTTTAATG TAGCCTCTAA GTGCTTTCAC TGTAAGATGT GCTACGTCAA TTGTCCCTAT GTTCCTCCTC ACGAATTTAA CATGGACTTT CCAAGCCTAA TGGAATGGGC GTGGCTATAC TATAAGAAAA ATCGAGGATT AACTGTAAGG GATTTTATCT TTGAAATGCT AGATGGTGTG AAGTTTGCAA GGCCCTTAGC TAAAGTAATT ATGGAAAAGA ACAAGGAGTT ATTAGGTATT CACAAAGAAG CCCCCACGTT ACCAGTAGCG GAGAAAGGTT TAAGGGAAAG AGTTAAGCCC AAACGTATCG ATAGTCCCAA AGCAAGGGTT GCACTATTTC CCACTTGTTT AATTGAGAAT TTCTTCCCAG AAATTGGCGA GGATTTAGTA GAAATATACA ACGAATTAGG GATAGAAGTA ATTATTCCTA ATTTCGTTTG TTGTGGAGCT CCAATGTTGG ATTCTGGTGA CGTTGATAGG CTTAAGAAGA ATGCTGAGTA TAATATCAAA ATAATTGAGG ATTTAATAAA GGAAGGTTAT GATGTAGTTT CGCCTATACC TACTTGTACG TTAATGATTA AGGAGTACAA GAAGGTTCTT GATAGAGAAG TACCTAAGGT TTATGATGCA ATGGAGTATC TTTTAAAATT AAAGAATGAG GGCAAGATAG AGCTAAAGGG TAAGATTGAG AAGAGTGTGT ATTATCATCC TCCATGCCAC CTTAAGTTCT TACAATTAGG ATTACCTGGG GTTAGATTAT TAAGGTCAAT GGGAGCGAAA GTCGATATTT CCAATAATGG TTGTTCCGGT ATAGATGGGG GTTGGGGATT AAGAAATTAT GACACTGCTA AAAGAGTAGG AAGTAAAATG ATGGAAGCTT TTAAACAGAG TAAAGCTGAT CTTTTTTCAA CTGAATGCCC TCTGGCTGGG CTTCAGATAG AAAAATCTTC TGGTAGAAGG CCATTACATC CAATTCAATT GTTAAAGGAG GCGATGAAAA ATGGTTAA
|
Protein sequence | MYSLNPSNST FFDSNKLLSE FIRQASVCHG CRLCFNYCDS FPLMFTYTDK KGPKNLTLDD LFNVASKCFH CKMCYVNCPY VPPHEFNMDF PSLMEWAWLY YKKNRGLTVR DFIFEMLDGV KFARPLAKVI MEKNKELLGI HKEAPTLPVA EKGLRERVKP KRIDSPKARV ALFPTCLIEN FFPEIGEDLV EIYNELGIEV IIPNFVCCGA PMLDSGDVDR LKKNAEYNIK IIEDLIKEGY DVVSPIPTCT LMIKEYKKVL DREVPKVYDA MEYLLKLKNE GKIELKGKIE KSVYYHPPCH LKFLQLGLPG VRLLRSMGAK VDISNNGCSG IDGGWGLRNY DTAKRVGSKM MEAFKQSKAD LFSTECPLAG LQIEKSSGRR PLHPIQLLKE AMKNG
|
| |