Gene Ssol_0070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0070 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp59940 
End bp61442 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content34% 
IMG OID 
Productprotein of unknown function DUF87 
Protein accessionACX90371 
Protein GI261600768 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATTG GTTATGTAAT TGGTCAAGCT ACAACACAAG AGGCTTTAAT ACTAGCTGAA 
AGGCCTGTTA GATTAGGAAC TTATGTTGTT CTAGAATATG ATAACGTTAA GGCCCTTGGA
CTAATAACAA ACGTGACTAG AGGTAGCCCT ATGCTAGATG ATAATATGAA TGATATAGAA
ATCGTTCAAA GATTAAAACA ATTCAACAAT AGCATACCCG TTTATACAAA GGCTAAAGTA
AAAATGTTAT GTGATATGAA TAATCACTTT TTAATGCCCG ATATACCCCC GTTCGCTGGA
ACCCCAGCTA GAGAGGCTGA AGATGAGGAG TTAAAAAGTA TTTATTCTCA AGATGGCCAA
ATTAGAATAG GAAGCTTAAT AGGTAAAAAT GTGGAGGTTA AATTAAATAT AAATTCCTTT
GCAAGGCATT TAGCTATTTT AGCAGCTACT GGTTCTGGGA AGTCAAATAC AGTAGCAGTT
CTTTCTCAAA GAATTTCTGA ACTTGGTGGA TCTGTTCTTA TATTCGATTA TCATGGAGAG
TACTATGATA GCGATATAAA GAATCTAAAT CGTATTGAAC CTAAACTTAA CCCTCTTTAT
ATGACCCCAA GGGAATTTTC TACGTTACTA GAAATAAGAG AGAATGCAAT TATACAGTAC
AGAATTTTAA GAAGAGCTTT CATAAAGGTA ACAAATGGTA TAAGAGAAAA GCTAAAAGAA
GGGCAAATAC CATTTTCTAC TCTAAATAGC CAATTTTACG AACTAATGAA AGACGAATTG
GAAACTCAAG GAAATAGTGA TAAAAAGAGT AGTGCAAAGG ATGAGGTACT GAATAAGTTT
GAAGAATTTA TGGATAGGTA TTCAAACGTC ATTGATCTTA CATCTTCAGA TATAATTGAG
AAAGTAAAGA GAGGTAAGGT AAACGTTGTA AGCCTAACAC AATTAGATGA AGACTCAATG
GATGCAGTAG TCTCACATTA TTTAAGAAGA ATCCTTGATT CTAGGAAAGA TTTTAAAAGA
AGCAAAAATA GTGGCCTTAA ATTCCCAATA ATAGCTGTAA TAGAAGAAGC TCACGTTTTC
TTGTCTAAAA ACGAGAATAC ATTAACCAAG TACTGGGCGT CCAGGATAGC AAGAGAGGGC
AGAAAATTTG GAGTTGGATT AACAATAGTA AGCCAAAGGC CTAAAGGTTT GGACGAAAAT
ATATTAAGTC AAATGACCAA TAAGATCATT TTAAAGATAA TTGAACCAAC TGATAAAAAA
TACATCTTAG AGTCAAGTGA TAATTTAAGT GAAGATTTGG CTGAGCAATT GTCCTCCTTA
GACGTTGGTG AGGCTATAAT TATAGGTAAA ATAGTAAAAT TACCTGCTGT TGTAAAGATA
GATATGTTTG AAGGAAAATT ACTTGGATCA GACCCTGACA TGATAGGGGA ATGGAAGAAA
GTCGAGGAAA GTGAAAAAAT AGCTAAAGGT TTTGCTGACT TTGGAACAGA AATTGGTGAT
TAA
 
Protein sequence
MIIGYVIGQA TTQEALILAE RPVRLGTYVV LEYDNVKALG LITNVTRGSP MLDDNMNDIE 
IVQRLKQFNN SIPVYTKAKV KMLCDMNNHF LMPDIPPFAG TPAREAEDEE LKSIYSQDGQ
IRIGSLIGKN VEVKLNINSF ARHLAILAAT GSGKSNTVAV LSQRISELGG SVLIFDYHGE
YYDSDIKNLN RIEPKLNPLY MTPREFSTLL EIRENAIIQY RILRRAFIKV TNGIREKLKE
GQIPFSTLNS QFYELMKDEL ETQGNSDKKS SAKDEVLNKF EEFMDRYSNV IDLTSSDIIE
KVKRGKVNVV SLTQLDEDSM DAVVSHYLRR ILDSRKDFKR SKNSGLKFPI IAVIEEAHVF
LSKNENTLTK YWASRIAREG RKFGVGLTIV SQRPKGLDEN ILSQMTNKII LKIIEPTDKK
YILESSDNLS EDLAEQLSSL DVGEAIIIGK IVKLPAVVKI DMFEGKLLGS DPDMIGEWKK
VEESEKIAKG FADFGTEIGD