Gene Ssol_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0454 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp406433 
End bp407620 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content37% 
IMG OID 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionACX90737 
Protein GI261601134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTCTT TAAATCCAAG TAATTCTACT TTTTTTGACT CTAATAAATT ATTATCAGAA 
TTTATAAGGC AAGCGAGCGT TTGTCACGGT TGCAGATTAT GTTTCAACTA TTGCGATTCT
TTTCCCCTTA TGTTTACTTA TACTGATAAG AAAGGCCCCA AAAACTTAAC CTTAGATGAC
TTGTTTAATG TAGCCTCTAA GTGCTTTCAC TGTAAGATGT GCTACGTCAA TTGTCCCTAT
GTTCCTCCTC ACGAATTTAA CATGGACTTT CCAAGCCTAA TGGAATGGGC GTGGCTATAC
TATAAGAAAA ATCGAGGATT AACTGTAAGG GATTTTATCT TTGAAATGCT AGATGGTGTG
AAGTTTGCAA GGCCCTTAGC TAAAGTAATT ATGGAAAAGA ACAAGGAGTT ATTAGGTATT
CACAAAGAAG CCCCCACGTT ACCAGTAGCG GAGAAAGGTT TAAGGGAAAG AGTTAAGCCC
AAACGTATCG ATAGTCCCAA AGCAAGGGTT GCACTATTTC CCACTTGTTT AATTGAGAAT
TTCTTCCCAG AAATTGGCGA GGATTTAGTA GAAATATACA ACGAATTAGG GATAGAAGTA
ATTATTCCTA ATTTCGTTTG TTGTGGAGCT CCAATGTTGG ATTCTGGTGA CGTTGATAGG
CTTAAGAAGA ATGCTGAGTA TAATATCAAA ATAATTGAGG ATTTAATAAA GGAAGGTTAT
GATGTAGTTT CGCCTATACC TACTTGTACG TTAATGATTA AGGAGTACAA GAAGGTTCTT
GATAGAGAAG TACCTAAGGT TTATGATGCA ATGGAGTATC TTTTAAAATT AAAGAATGAG
GGCAAGATAG AGCTAAAGGG TAAGATTGAG AAGAGTGTGT ATTATCATCC TCCATGCCAC
CTTAAGTTCT TACAATTAGG ATTACCTGGG GTTAGATTAT TAAGGTCAAT GGGAGCGAAA
GTCGATATTT CCAATAATGG TTGTTCCGGT ATAGATGGGG GTTGGGGATT AAGAAATTAT
GACACTGCTA AAAGAGTAGG AAGTAAAATG ATGGAAGCTT TTAAACAGAG TAAAGCTGAT
CTTTTTTCAA CTGAATGCCC TCTGGCTGGG CTTCAGATAG AAAAATCTTC TGGTAGAAGG
CCATTACATC CAATTCAATT GTTAAAGGAG GCGATGAAAA ATGGTTAA
 
Protein sequence
MYSLNPSNST FFDSNKLLSE FIRQASVCHG CRLCFNYCDS FPLMFTYTDK KGPKNLTLDD 
LFNVASKCFH CKMCYVNCPY VPPHEFNMDF PSLMEWAWLY YKKNRGLTVR DFIFEMLDGV
KFARPLAKVI MEKNKELLGI HKEAPTLPVA EKGLRERVKP KRIDSPKARV ALFPTCLIEN
FFPEIGEDLV EIYNELGIEV IIPNFVCCGA PMLDSGDVDR LKKNAEYNIK IIEDLIKEGY
DVVSPIPTCT LMIKEYKKVL DREVPKVYDA MEYLLKLKNE GKIELKGKIE KSVYYHPPCH
LKFLQLGLPG VRLLRSMGAK VDISNNGCSG IDGGWGLRNY DTAKRVGSKM MEAFKQSKAD
LFSTECPLAG LQIEKSSGRR PLHPIQLLKE AMKNG