Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2100 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1878045 |
End bp | 1879358 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | ACX92306 |
Protein GI | 261602703 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGAGTC AAGATGAGAA ACAAATACAA AGGGAAGTTC GAGAAGCCTT CCCAATGTCA GATGATGTTG ACTGGAATGA GGTTTATCAG AGAATAATTT ATAGGTATAG CACACCTCAT GGACTAGAAC ATGTTAAAGA GGAAATGTAT AAATTAGAAG ATAAGGGCGA AATTATAATA CACCATATTA AGCCCTACAA TAACCCAGTA GAAGCCCAAA CACTAAACGG ATCTCCAAAG AAAATACCCA CAACAAAATT ATGGCATCAT AAGAGTTGTG GACAGTGTGG CCACATACCC GGTTATCCAA CCTCTGTTTT CTGGGTAATG AATAAGTTAG AAATAGATTA CCTAGATGAA CCGCATCAAA CATCGTGTAC TGGATGGAAT TATCACGCGT CTGGTGCCTC CAACCCCGTA GCCTTGGCAG GAGTATATGT AAGGAACATG TGGAGAGCTT ATGAAACGGG TTACTTCCCA TTAATACATT GCGGAACATC ATTTGGTCAT TATAAAGAAG TTAGAAACAT GATAATATTA CACAAAGAGA TAAGAGACAA ACTTAGACCA ATCATGAGAA AACTGGATAT GGACATTGTA ATACCAGAAG AGGTAGTTCA TTATTCTGAA TGGTTATATG TAATGAGCAA GAAAGCTGCA CAGCAGAAGA AATACAATCT AGATAATATT AAGGCAGCTG TACATACTCC TTGTCATGTT TATAAGTTAG TTCCAGAGGA TACTGTTTAC GATCCCGAAG TATTCCAAGG TAGAAGACCA GCAGCCCCAT CTGGAACTGT ACAGAATTTC GGCGCTAAAC TAGTGGATTA CTCAACCTGG TGGGATTGCT GTGGATTCGG ATTTAGACAT ATCCTAACAG AGAGGGAATT CAGTAGAAGT TTCGCACTAT TTAAGAAGGT TATACCAGCA GTTGAGGAAG GGAATGCTGA TATCTTTGTA ACCTCAGATA CTGGGTGTGT TACTACTTTA GATAAGAGTC AGTGGGCTGG AAAGGCTCAT GGTTTCAATT ATAACTTACC AGTATTAGCT GATGCGCAAT TTGCAGCTTT AGCAATGGGC GCTGATCCAT ATATAATTGC TCAAATTCAC TGGCACGCGA CAGATGTAGA AGGTTTCTTA AGAAAGATAG GTGTTCCGGT TGATGATTAT AAAGAGAAGT TCGTACAATA TCTACAAGAT CTAAGAGAAG GTAAGACGGA ACCACAATAC TTATATCCAA AGCATAGAAA GATTGACTTC TACTTATCAC TTCCAGATAG AGTAAAATGG TACAAGAAGG AGGTTCCAAA GTAA
|
Protein sequence | MLSQDEKQIQ REVREAFPMS DDVDWNEVYQ RIIYRYSTPH GLEHVKEEMY KLEDKGEIII HHIKPYNNPV EAQTLNGSPK KIPTTKLWHH KSCGQCGHIP GYPTSVFWVM NKLEIDYLDE PHQTSCTGWN YHASGASNPV ALAGVYVRNM WRAYETGYFP LIHCGTSFGH YKEVRNMIIL HKEIRDKLRP IMRKLDMDIV IPEEVVHYSE WLYVMSKKAA QQKKYNLDNI KAAVHTPCHV YKLVPEDTVY DPEVFQGRRP AAPSGTVQNF GAKLVDYSTW WDCCGFGFRH ILTEREFSRS FALFKKVIPA VEEGNADIFV TSDTGCVTTL DKSQWAGKAH GFNYNLPVLA DAQFAALAMG ADPYIIAQIH WHATDVEGFL RKIGVPVDDY KEKFVQYLQD LREGKTEPQY LYPKHRKIDF YLSLPDRVKW YKKEVPK
|
| |