Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_1450 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1333801 |
End bp | 1335546 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | |
Product | protein of unknown function DUF87 |
Protein accession | ACX91682 |
Protein GI | 261602079 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0452751 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGTA TTTTTGAAAC GGAAGAGGGA AAACTTAGAG AAGCTAAAAT TATTACTAGG CAAACTTCCG ATGGTAGGGG TACAATATCT TTTAGAAATT ATATAGTGGA ATTTCCATTC TCGCTGAAGG ATAAGTTGGG TATAGGCAAA TTGCTCGCTG TAAATACAAT AAAGGAAAAT AACTATCTCA TCTTAGAAGT TGCGGATATT ATTCCAATGC ATTATGGTAT GATAAATCTG GACTCTACAA TACCTAAGGA GATTAGAAAA GAGATTATGA AAAGGGTTAG TGAAAGTTGG TATTCAAATG ACGAGAAAGA AATATGGATA GACTCAATAA CTTATCCCTT AGGATATATT CTTGAAGTAA ATTCGGATAA TGTTCTATTT AAAAAAGGAT ATTTCCCACC TCTATTAGGT TCTTCAGTGA AGATTTTGAA TAAAAAGGCG TACGCATCAT TTGTCTGTGC CAAGAGTAAT ATAAGTTTAG GTAAAATTCT ACATGAGGAA CTTTCTCTAG ACGTAAATTT AGAAAAGGCA ATAAGGTATC ACCTAGGTAT TTTCGCTTTT ACGGGCTCTG GTAAATCAAA TTTAGCATCC TTGATTGCTA GAAAAGTACT AGATAATTTA CTTGACACTA AAGTTATAAT ATTTGACGTG TCAATGGAGT ATGCAATACT TCTCTTAGAT AAGTTGCTTG AAGTCCCATC CAGAGTTGTA AGTGTAGACA GAGTTCCCCC TAATCCAATC GATGCTAGTA GAAAATTTTT AAGGAGTCAC GTAATTCCTG ACGATATTGT AGATATTAGA GATAAGATAA AGAAAGGTGC GGAAATTCTG CATCAAAATG GGAAAATGAA ACAGTTATAC GTTCCGCCTG AAGGTCTGTC TTACTTAACC TATGCTGATT TAATAGATCT AATAAAAAAG CAGATAGAAG ATAAGTATAC TGCAATATCG CAGAAACCAC TGCTATATAC TTTTCTAAGT AAGTTAGATA ATTTCATGAG AGAAAGGAAA TTAACTGCAG ACGACATTAT AGATGACTCC ATTAATCAGT TATTAGATGA AATAGAAAAT TTAGGTAAGG ATGCACACCT AAAGGAAAAT TCGTCACTGT TCACATTTAT ATCTGGCATA AAGGCGTACA TCTCACTCGG TATAAGAGAA ACCGAAGAGT ATGATATAGA AAATTTAGCG ATTGAGATCT TAGATTCTTC TAAGGATTCA CCTAGGTTAT TTATTTTAGA ACTACCTAAC TTAGAAGAGG GAAGGCAAGT AGTTGCGACT ATAATCAATC AAATTTATAA TAGGAGAAAG AGAATGTATT CCGATAATCC CAAAATATTA TTTATAATAG ATGAAGCTCA AGAGTTTATA CCTTATGATA CTAAACAAAA AGACAAAAGT GAAGCTTCAA GTACTGCCAT AGAGAAGTTA CTTAGGCACG GGAGAAAGTA TCATTTACAT TCACTAATAA GTACCCAGAG GCTAGCGTAT CTAAACACTA ATGCTCTACA ACAGTTGCAC TCTTATTTCA TAAGCACACT CCCTAGGCCA TACGATAGAC AATTATTAGC TGAAACTTTT GGAATTAGCG ATATGTTATT AGATAAGACC TTAGAACTGG AACCGGGGCA ATGGTTGTTA GTAAGCTTTA AATCTGCACT TCCTCACGAC GTCCCCGTAT TCTTTTCCGC GGAGAACAAT CTAGATTTAT TAAAGGATAG AATAAATAAG TTATGA
|
Protein sequence | MESIFETEEG KLREAKIITR QTSDGRGTIS FRNYIVEFPF SLKDKLGIGK LLAVNTIKEN NYLILEVADI IPMHYGMINL DSTIPKEIRK EIMKRVSESW YSNDEKEIWI DSITYPLGYI LEVNSDNVLF KKGYFPPLLG SSVKILNKKA YASFVCAKSN ISLGKILHEE LSLDVNLEKA IRYHLGIFAF TGSGKSNLAS LIARKVLDNL LDTKVIIFDV SMEYAILLLD KLLEVPSRVV SVDRVPPNPI DASRKFLRSH VIPDDIVDIR DKIKKGAEIL HQNGKMKQLY VPPEGLSYLT YADLIDLIKK QIEDKYTAIS QKPLLYTFLS KLDNFMRERK LTADDIIDDS INQLLDEIEN LGKDAHLKEN SSLFTFISGI KAYISLGIRE TEEYDIENLA IEILDSSKDS PRLFILELPN LEEGRQVVAT IINQIYNRRK RMYSDNPKIL FIIDEAQEFI PYDTKQKDKS EASSTAIEKL LRHGRKYHLH SLISTQRLAY LNTNALQQLH SYFISTLPRP YDRQLLAETF GISDMLLDKT LELEPGQWLL VSFKSALPHD VPVFFSAENN LDLLKDRINK L
|
| |