Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2056 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1846309 |
End bp | 1847505 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | ACX92262 |
Protein GI | 261602659 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.339925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAC TGGATTTTAT ATCTTATGCT TTAAACTCAC TAAAAGAAAG AAAGGTTAGG GCAATACTTA CTATCCTTGG GATAGTTGTC GGACCTGCTA CAATAATTTC TATAAACTCC ATGGTATTGG GCTACTCACA CACAATAATT TCGCAAATAT CCAACTTTCT CTCACCCTAT GATATTATCG TGACCCCCAC TGGAAGGGGT CTGCCATTAT CCCAATACCT CATACTCCAA CTGGAGACTA TCCTTGGAGT AAAAATGGTT ATTCCATTCT ATTCATTTCC GGCGTTAATT AGAACACCTA ATGGCTATGA AGGGGCAACA GTATTTGCGG TTAATATAAA CCAGCTTAAG ATAGCAGCTC CAGCTATAAG CTTATCATCA GGTTACTTTC CAGCTGCGGA GGTTAGCTAT GAAGCTTCAA TAGGCTATCA GTTGGGAAAT CCGCAAGGTG GATATAGTCC AATAAGACCA AATCAAGTGA TACAAACAAT TATATTCTAT AACGGGAATA ATTTCACTAA GACATTTCTA GTAACTGGAG TTTTGAACGA ATATGGAAGT TTTCTCGGAG TTGATATAGA TAAGTCAATA ATAGTACCTT TATCTTTTGG TCAGTCAATC TCAAGTTCTT ATAGTGGAGC CATAATAATA GTGAGCTCTC TAGGAGAAGT GAATGAAGTT GTAAATGAAA TAAAACAAAA GTTTGGAAAT TCTTTAGATA TTGTAGTGGC GGAGGAATTT ATACAATTAA TAGATAATAC TTTACAATCT CTTAACGGAT TGCTAGTATC TGCAGGAGCT ACGTCATTCA TAGTTTCGTT TATGGGAGTA ACTACAACAA TGTTCACAAC AGTGGTGGAA AGAACTAAGG AGATAGGGAT ATTAAGAGCA TTAGGATTTA CTAGGTTTGA TGTACTCACA ATGTTTTTAG TTGAAGCTAG TGTGATGGGG TTCATAGGTA GTATAACAGG GCTCGCATTA GGTTCAGTAG TTGCATTAAT ATTAACACAA GAACATTTCG GATTGGGATT TAGTTTTCTA AAGGGTCTTT CAGTATCACC GGTCTATTCT CCTACCTTTA TGTTGTTAGT GCTAATATTT TCTACAATTC TAAGCGTCAT TGCAGCACTA GGACCTGCTT ACAATGCATC CAAACTAGAT CCAAATAAAG CTTTAAGATA CGAGTAG
|
Protein sequence | MKILDFISYA LNSLKERKVR AILTILGIVV GPATIISINS MVLGYSHTII SQISNFLSPY DIIVTPTGRG LPLSQYLILQ LETILGVKMV IPFYSFPALI RTPNGYEGAT VFAVNINQLK IAAPAISLSS GYFPAAEVSY EASIGYQLGN PQGGYSPIRP NQVIQTIIFY NGNNFTKTFL VTGVLNEYGS FLGVDIDKSI IVPLSFGQSI SSSYSGAIII VSSLGEVNEV VNEIKQKFGN SLDIVVAEEF IQLIDNTLQS LNGLLVSAGA TSFIVSFMGV TTTMFTTVVE RTKEIGILRA LGFTRFDVLT MFLVEASVMG FIGSITGLAL GSVVALILTQ EHFGLGFSFL KGLSVSPVYS PTFMLLVLIF STILSVIAAL GPAYNASKLD PNKALRYE
|
| |