Gene Ssol_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2091 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1871839 
End bp1873098 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content36% 
IMG OID 
Productprotein of unknown function DUF1641 
Protein accessionACX92297 
Protein GI261602694 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTAT CAACAGTAGA TCCACTTGAG GAAATATTAA AACCGGAAAA CTTAAGTAGG 
TTATCCAAGA TAGTTGACGC ATTACCTACA ATAGAGAAAG TAACTGATAA AATAACAGAA
ATGGATAAAA AAGGTCAAGT GGACTTCTTA CTCTCATTAT TTGATCAAAC AGTTTCCATA
CTTGATGCTG TACAGAAGGC AGACCTAATA AACACACTCA TCTCATTCGG AATGGATCAA
CTACCAAAAA TACAAGCAAT ATGGCCAATA CTAGAAAAAC TAACAAGCGA AAGGGCTTTG
CAATTACTAT CGCAGATTGA TATAGACTCT ACTCTTACCG CGTTAGAGAA ATTATCGCCA
ATAATTAAGA AGCTAACTGA CGAAAAAGTG TTGAAAGTAA TTGATCAGAT AGACTATGAT
TCTTTAATAG ATAGTACTTC GAAGTTAGTA CCTATTTTAT CTAAACTTGC AAATGAGAGG
ACTGTTAAGA CTATTGAAGC CCTAGATATT GATATGTTAC TTAACTTAGC GTCAAAGATG
GCTCCTACGC TAAACAAATT TGCGTCATTA ATGGATCAGA TGTCTAGCAA AGGCCAAGTC
GACATGTTAG TTAACTTAAT GGAGCAAGGA ATATCGCTAC TTGATGCTGT ACAGAAGGCA
GACCTAATAA ACACACTCAT CTCATTCGGA ATGGATCAAC TACCAAAAAT ACAAGCAATA
TGGCCAATAC TAGAAAAACT AACAAGCGAA AGAACCCTTA ACTTAATACA AAGTTTAGAT
CTGGACTCAA TGTTCAACGC GTTAGAAGCA CTAACGCCAA TAATGAAACA GCTAACAAGC
GATAAGGCAA TAAAGATCAT TCAACAATTC GATATTGCTT CTACACTTGG CGCCTTAGAG
GCAGCAATGC CACTATTAAA GAAACTAACT GACGAAAAAA CTGTTAAGAT AATATCTCAA
ATAGACGTTA ACTCGTTACT TATGCTTACA AACAAACTAG TCGAAATGCA GAAGTCTGGT
AGTTTGGATA GACTAATGCA GTTATTGGAA ATAATTTCTG ATCCACAATT CGTTAACGGA
CTGGTTACAG TCATGGATAA GTTTTCTAAG GCCTTTAAAG CTTGGGTTAA TGACATACCA
AACGCAAGAC CAGTCGGAAC TATGGGCTTA TTGAGAGCTA CAAGCGATAA AGATGTCAGT
TATGCTTTAG GATTAATGCT TGAACTAGCT AAAGAAGTTG GAAAGAGTTT TAAATCTTGA
 
Protein sequence
MSVSTVDPLE EILKPENLSR LSKIVDALPT IEKVTDKITE MDKKGQVDFL LSLFDQTVSI 
LDAVQKADLI NTLISFGMDQ LPKIQAIWPI LEKLTSERAL QLLSQIDIDS TLTALEKLSP
IIKKLTDEKV LKVIDQIDYD SLIDSTSKLV PILSKLANER TVKTIEALDI DMLLNLASKM
APTLNKFASL MDQMSSKGQV DMLVNLMEQG ISLLDAVQKA DLINTLISFG MDQLPKIQAI
WPILEKLTSE RTLNLIQSLD LDSMFNALEA LTPIMKQLTS DKAIKIIQQF DIASTLGALE
AAMPLLKKLT DEKTVKIISQ IDVNSLLMLT NKLVEMQKSG SLDRLMQLLE IISDPQFVNG
LVTVMDKFSK AFKAWVNDIP NARPVGTMGL LRATSDKDVS YALGLMLELA KEVGKSFKS