Gene Ssol_2242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2242 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2021537 
End bp2022724 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content34% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACX92430 
Protein GI261602827 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTGA AATATATTCT TGTAGTGATA ATAATAATAA CGTTTATAGT AAATATTGTA 
TCTCTATTTT ACTTAAACTC TCAAATAGCA AGTCTGTCAT CAAGCTATAA TACATTAGTC
AATAACTATA ATACTTTAAG GACCTACTAC CAAAACCTCA ATTCCAATTA TACTACACTT
TACTCATCAT ATTCTAATCT AGTTAACTCT TATAATTCGT TATCATCTCA ATATGCAAAG
CTATCTTCAG AATACAATAC GCTTATGGCA AAGTACGATA ACTTAACCGC AAAGTATAAC
ACTCTATCTC AAAATTACAC CATATTGTCA GGTCAGTTAG CCCTAACAAT GGGCACAATG
ACTGTTCAGT CATTCTATAT TTATTTAGCA CAAGTAAACA CTCAAGGCAT GGAAAGTTTA
TTAGTAGGAC CTCTAGCATC TTACTTCGAA ATAACGTCAC CACCTGGTAA TGGTACTATA
ATAGCTTCCC CAGCGAATTC TTCAGATGCA CTTCCTCTTA TTGGATCTAA GTTATCCCAA
TTCTTCAACT ATTTAAGTGT AAAAACTGAG GTTAAAGAAC TTGTAATTAC TCCCTTGGAA
AACTACGTAT TGGGAGAGGG TTTGGTTTCA TTTAATGACC AATATGCTAA TGGAACAATT
GTTACAAATT ACGCACTAAT CACAGTAGTT GCGCAAGAGA TTAACTTATC CACTTGGCAA
GTTGTATATG TAAAGATTAA CAACGCGCTC ACACAAAGTC AATATAACAC GTTAGTAACG
CTTTTCAACT TAATCCAAGC GTTAGAATCT AAGAATATAG GTCAATTACA ATCCATATTA
GTTGGTCCTT ATCAAAGTTA TGTGTATATA GCCCGAGGAC CTTACGCTGG TAATTATTCT
GGATTAGATG TACCTAATGT TTTCATAGAC ACTATAATAT CTAAGGATGT AAGTTCACTA
CAATTTGAAT TATATTACTT TAACATAACT CCATTAAACC CCACTACTAG CTTAGTTGAT
ATGTATGGCG TACTTAAGAT TACCCTCTCA AACGGTTCTA CTTACACTTC CTATACAGAT
CTCCGCACAA CAGTGGAACT AGAACCTAAC GGGGTACCTC AAGTAGTTGC ATTAAATATA
ATTAATGATC TAACTCAACA ACAAGTAGTT TCTGCATTGC CAAAATGA
 
Protein sequence
MDVKYILVVI IIITFIVNIV SLFYLNSQIA SLSSSYNTLV NNYNTLRTYY QNLNSNYTTL 
YSSYSNLVNS YNSLSSQYAK LSSEYNTLMA KYDNLTAKYN TLSQNYTILS GQLALTMGTM
TVQSFYIYLA QVNTQGMESL LVGPLASYFE ITSPPGNGTI IASPANSSDA LPLIGSKLSQ
FFNYLSVKTE VKELVITPLE NYVLGEGLVS FNDQYANGTI VTNYALITVV AQEINLSTWQ
VVYVKINNAL TQSQYNTLVT LFNLIQALES KNIGQLQSIL VGPYQSYVYI ARGPYAGNYS
GLDVPNVFID TIISKDVSSL QFELYYFNIT PLNPTTSLVD MYGVLKITLS NGSTYTSYTD
LRTTVELEPN GVPQVVALNI INDLTQQQVV SALPK