Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2091 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 1871839 |
End bp | 1873098 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | protein of unknown function DUF1641 |
Protein accession | ACX92297 |
Protein GI | 261602694 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTAT CAACAGTAGA TCCACTTGAG GAAATATTAA AACCGGAAAA CTTAAGTAGG TTATCCAAGA TAGTTGACGC ATTACCTACA ATAGAGAAAG TAACTGATAA AATAACAGAA ATGGATAAAA AAGGTCAAGT GGACTTCTTA CTCTCATTAT TTGATCAAAC AGTTTCCATA CTTGATGCTG TACAGAAGGC AGACCTAATA AACACACTCA TCTCATTCGG AATGGATCAA CTACCAAAAA TACAAGCAAT ATGGCCAATA CTAGAAAAAC TAACAAGCGA AAGGGCTTTG CAATTACTAT CGCAGATTGA TATAGACTCT ACTCTTACCG CGTTAGAGAA ATTATCGCCA ATAATTAAGA AGCTAACTGA CGAAAAAGTG TTGAAAGTAA TTGATCAGAT AGACTATGAT TCTTTAATAG ATAGTACTTC GAAGTTAGTA CCTATTTTAT CTAAACTTGC AAATGAGAGG ACTGTTAAGA CTATTGAAGC CCTAGATATT GATATGTTAC TTAACTTAGC GTCAAAGATG GCTCCTACGC TAAACAAATT TGCGTCATTA ATGGATCAGA TGTCTAGCAA AGGCCAAGTC GACATGTTAG TTAACTTAAT GGAGCAAGGA ATATCGCTAC TTGATGCTGT ACAGAAGGCA GACCTAATAA ACACACTCAT CTCATTCGGA ATGGATCAAC TACCAAAAAT ACAAGCAATA TGGCCAATAC TAGAAAAACT AACAAGCGAA AGAACCCTTA ACTTAATACA AAGTTTAGAT CTGGACTCAA TGTTCAACGC GTTAGAAGCA CTAACGCCAA TAATGAAACA GCTAACAAGC GATAAGGCAA TAAAGATCAT TCAACAATTC GATATTGCTT CTACACTTGG CGCCTTAGAG GCAGCAATGC CACTATTAAA GAAACTAACT GACGAAAAAA CTGTTAAGAT AATATCTCAA ATAGACGTTA ACTCGTTACT TATGCTTACA AACAAACTAG TCGAAATGCA GAAGTCTGGT AGTTTGGATA GACTAATGCA GTTATTGGAA ATAATTTCTG ATCCACAATT CGTTAACGGA CTGGTTACAG TCATGGATAA GTTTTCTAAG GCCTTTAAAG CTTGGGTTAA TGACATACCA AACGCAAGAC CAGTCGGAAC TATGGGCTTA TTGAGAGCTA CAAGCGATAA AGATGTCAGT TATGCTTTAG GATTAATGCT TGAACTAGCT AAAGAAGTTG GAAAGAGTTT TAAATCTTGA
|
Protein sequence | MSVSTVDPLE EILKPENLSR LSKIVDALPT IEKVTDKITE MDKKGQVDFL LSLFDQTVSI LDAVQKADLI NTLISFGMDQ LPKIQAIWPI LEKLTSERAL QLLSQIDIDS TLTALEKLSP IIKKLTDEKV LKVIDQIDYD SLIDSTSKLV PILSKLANER TVKTIEALDI DMLLNLASKM APTLNKFASL MDQMSSKGQV DMLVNLMEQG ISLLDAVQKA DLINTLISFG MDQLPKIQAI WPILEKLTSE RTLNLIQSLD LDSMFNALEA LTPIMKQLTS DKAIKIIQQF DIASTLGALE AAMPLLKKLT DEKTVKIISQ IDVNSLLMLT NKLVEMQKSG SLDRLMQLLE IISDPQFVNG LVTVMDKFSK AFKAWVNDIP NARPVGTMGL LRATSDKDVS YALGLMLELA KEVGKSFKS
|
| |