Gene Ssol_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0023 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp20490 
End bp22421 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content36% 
IMG OID 
Productprotein of unknown function DUF255 
Protein accessionACX90328 
Protein GI261600725 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAGA GATTAATGAA TTCTCAAAGT GGATATTTAA GGAAGGCAGT AAATCATCCA 
ATAAATTGGT ACACTTGGTC TAATGATGTA TTTGAAATTG CGAAGAAGGA GAATAAGCCA
ATACTTATTG ATGTAGGCGC TGCTTGGTGC CATTGGTGCA ATGTTATGGA TGAGGATACC
TATTCAAATC CAGAAATAGC TAAGATAGTT AACGAGAATT TTATTCCCGT AAAGGTTGAT
CGTGATGAAA TGCCCGATTT GGACAGGTTA CTTCAAAACG CAGCTATGGC AATAACAGGT
GAATCTGGAT GGCCTTTAAC TATATTTATG ACACCAGAGG GAAAGGTCTT CTTTGGTGGC
ACTTACTTTC CTCCAGAGGA TAGATATGGA CGAATAGGCT TCAAGAGATT ACTATTGGAG
ATTTTGAGAA TATGGAGAGA AGATAGAAAC CAAATACTAT TATCTTCAAT AGACCCCAAT
ATGTTGATTC CTAAAAATAC TGGTGGAGTT CTAAATTCAA GCGTTTTAGA TGATGCGTTT
TCCTTTATAA CATCCTATTT TGATATCGAG TATGGTGGAT TAGGGTTTGG TGCCAAATTT
CCCCATCCTT TTGTCGACTT GTTATTCCTT AACTACAGTG CTACTAGAGG AGATGATTTA
GGTAAGAAAC TAGGTCTATT CACTTTAAGG AAAATGTACT ATGGTGGTAT TTTTGATCAA
GTTGGTGGAG GATTTCATAG ATATACCGTT GATAGAGAGT GGAAGTTACC TCATTTTGAG
AAACTACTTA TAGATAATGC TGAATTAATC TACGATTACT TTACTTATTA TATAGCGACT
AATGATATAG AAATTCTTGA CGCCTTACGA AATTCAGTAG ATTTCGTTTT AAGAGAATTA
TACATAGATG GGAAAGGCTT TGCTAATAGT TTAGATGCAG ATAGTGAAGG TATGGAAGGT
AAGTATTACA CGTGGACTGA GGATGAGCTA AAAGATTCGT TAGGAAATGA TTACGGTATC
GCTATAAGGG TATTTGACTT GAAGAATACA GTGGAGGTTG AAGGTAGGAA AGTTCTATTG
AGAACTATTG AACTTAGGGA GCTGTGTAAA GCAACTGGTT TACAAGTGAA TGATATGATA
AAGAAGATAA GTGAGATAAG AACTAAACTA TTGGCATATA GAAAGCAAAA TAGAAAAATG
CCATATAGAG ATGATAATAC GTACACTTAT TCTAACGCAA AGCTGGCAGA ATCCATGTTG
TATTCCTCAT TAATTTTAAA CAAGAGCATT GAGGAGGCTA AGCTTGTTCT GAGTAAGTTA
GGAAAAACTG TTAGTAGAAG ATTAGATGGC GGAAAGGCGG GATTACCAGA AGACTATGCT
GCAGCACTTT TAGCAACTAT TTCCGCATAC GAGGTATTAG GCGAAGAAAG GTATTATGAT
TTGGCAATTG AACTCGGTAG AAATATTGGA GTCATTAAGC CTAATAATTT CACTGACATG
CCTAATGAGT CTGCAAATTC AATGTACATT AGGGCGTTAT TCAAGTTGTC GCTATTATCT
GATGAGTTTA AAGTTGATGA GGAGAAAATA AAACCTTTAA TTCCCTCTTT AACTGCAGAA
AACTCTCAAT TTATTGCTGG CATAGTAAAT TCTATCTCTA GTTACATTAG TGGAATGGCT
CATGTTGTGG TGGTTGATGA AAAGGATGGA TTAGCTGAGA GGTTACATAA AGTTGCGTTA
CTTACCTATT ATCCCTTTAA ACTAGTTGAA AAGGTGGATG ATAGCAGAAC TGACTATGTT
AGCTCAGTAA TAAGGGCTAT GATTAAATAT AATGTGGGAA GGAGTAGAGC ATATGTGTGT
ATAGGCAATA CGTGCAGTAT GCCAGTAAGT GAAGAGGAAA AGATTAAACA ATTACTTAAA
ACTAAGCTTT GA
 
Protein sequence
MNERLMNSQS GYLRKAVNHP INWYTWSNDV FEIAKKENKP ILIDVGAAWC HWCNVMDEDT 
YSNPEIAKIV NENFIPVKVD RDEMPDLDRL LQNAAMAITG ESGWPLTIFM TPEGKVFFGG
TYFPPEDRYG RIGFKRLLLE ILRIWREDRN QILLSSIDPN MLIPKNTGGV LNSSVLDDAF
SFITSYFDIE YGGLGFGAKF PHPFVDLLFL NYSATRGDDL GKKLGLFTLR KMYYGGIFDQ
VGGGFHRYTV DREWKLPHFE KLLIDNAELI YDYFTYYIAT NDIEILDALR NSVDFVLREL
YIDGKGFANS LDADSEGMEG KYYTWTEDEL KDSLGNDYGI AIRVFDLKNT VEVEGRKVLL
RTIELRELCK ATGLQVNDMI KKISEIRTKL LAYRKQNRKM PYRDDNTYTY SNAKLAESML
YSSLILNKSI EEAKLVLSKL GKTVSRRLDG GKAGLPEDYA AALLATISAY EVLGEERYYD
LAIELGRNIG VIKPNNFTDM PNESANSMYI RALFKLSLLS DEFKVDEEKI KPLIPSLTAE
NSQFIAGIVN SISSYISGMA HVVVVDEKDG LAERLHKVAL LTYYPFKLVE KVDDSRTDYV
SSVIRAMIKY NVGRSRAYVC IGNTCSMPVS EEEKIKQLLK TKL