Gene Ssol_2591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2591 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2377318 
End bp2378445 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content49% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionACX92700 
Protein GI261603097 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATACA GACAAAATCG AGAGGATGAG AATTTCTGTA GAAAAGTTTG TATATACGAG 
GGCGAACTTG AACATATGAA GCATGAGAAT ACAACTAAAC CGGCGAAGTT CAACCGGGAC
TTCGCCAGGT CCGCCCTCAA GATAATTTAC TCGATCCTCA CTAAAATACT TTTCCCTGAG
GAACTCCTCA GTGCCTTGCT TAAGGCGAGT GGGAGCTACC TAAGCAGGTT GGGAAAAGAT
GGGAGAAGAG CGTTGAGAAA GTTGAACGCG GTTCAAGTTG AGGACGTGAG GGATGCGTTG
AAGAAAATGG GAAGGATGAC GTTAAGGGGA GTCAGGAACA GGAGGGTAGC AGTGGACTTC
CATGCCATAC CTCAATACCA CGCTGACAAG AGTTTCTTGA GTAGGATAAA GCCAACTAAG
GGGACGTCGT GGGGACTGGT TCAAGCTGCG ATCTTCCTCC TGGGGAGGAC GAGGAGCTTC
TTGGACGTGA TCCCAGTGAC CGTGAAGAAC GTAGCTGAAG GTTTCAAGGC GGTGATGGAG
GTAATCGTGA AGGAGTTGGA GGAGGACAAG CTGAGGCTCG TCATGGTCTT CGCGGACAGG
GAGTTCGCGG TGAACGAAGT GATTAGATAC CTCTTGGAGT TGGGCTTGGA CTTCGTCATA
TCTGCCAAGG CCCAGATGTA CAAGAAGTAC AAGGGGATGT TGCAAGATGT GGATGTGAGT
TTTGGCGGAG TTAGATATAC TGGATTTCTC TGCGTGAGAC ATGGGAGCGG AGCTTATCTC
ATTATCCTGA GGAAGGAAGA CGACAAGATT ATTGCCTTCC TCGTGAGGAG GGAGATGGAT
CTTTATGATG CCATAGTCCT TGCCGAGATG TATAGGGAGA GGTGGGGGAT TGAGAATGTT
TTTCGCTCTC TTGAGGAGTT CAGGATCAGG ACTAGGACTT GTGACGTGAG GAAGGAACTG
GTTCTCGTTC TGCTTTCCTA TCTTCTCTTG AATGTCTGGT TCCTGATCCG TTCTTGGAGG
AAGGTAAAGT TGTGGGAGTT CTCGGTCTCC CTCTCGAATC TCCTCGATCG GGAGGTAAGA
GTGGAACAAG AACGCGCGTT CCGTGAAGTG AAGACGTCAT TCCCCTAG
 
Protein sequence
MKYRQNREDE NFCRKVCIYE GELEHMKHEN TTKPAKFNRD FARSALKIIY SILTKILFPE 
ELLSALLKAS GSYLSRLGKD GRRALRKLNA VQVEDVRDAL KKMGRMTLRG VRNRRVAVDF
HAIPQYHADK SFLSRIKPTK GTSWGLVQAA IFLLGRTRSF LDVIPVTVKN VAEGFKAVME
VIVKELEEDK LRLVMVFADR EFAVNEVIRY LLELGLDFVI SAKAQMYKKY KGMLQDVDVS
FGGVRYTGFL CVRHGSGAYL IILRKEDDKI IAFLVRREMD LYDAIVLAEM YRERWGIENV
FRSLEEFRIR TRTCDVRKEL VLVLLSYLLL NVWFLIRSWR KVKLWEFSVS LSNLLDREVR
VEQERAFREV KTSFP