Gene Ssol_1450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1450 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1333801 
End bp1335546 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content34% 
IMG OID 
Productprotein of unknown function DUF87 
Protein accessionACX91682 
Protein GI261602079 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0452751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGTA TTTTTGAAAC GGAAGAGGGA AAACTTAGAG AAGCTAAAAT TATTACTAGG 
CAAACTTCCG ATGGTAGGGG TACAATATCT TTTAGAAATT ATATAGTGGA ATTTCCATTC
TCGCTGAAGG ATAAGTTGGG TATAGGCAAA TTGCTCGCTG TAAATACAAT AAAGGAAAAT
AACTATCTCA TCTTAGAAGT TGCGGATATT ATTCCAATGC ATTATGGTAT GATAAATCTG
GACTCTACAA TACCTAAGGA GATTAGAAAA GAGATTATGA AAAGGGTTAG TGAAAGTTGG
TATTCAAATG ACGAGAAAGA AATATGGATA GACTCAATAA CTTATCCCTT AGGATATATT
CTTGAAGTAA ATTCGGATAA TGTTCTATTT AAAAAAGGAT ATTTCCCACC TCTATTAGGT
TCTTCAGTGA AGATTTTGAA TAAAAAGGCG TACGCATCAT TTGTCTGTGC CAAGAGTAAT
ATAAGTTTAG GTAAAATTCT ACATGAGGAA CTTTCTCTAG ACGTAAATTT AGAAAAGGCA
ATAAGGTATC ACCTAGGTAT TTTCGCTTTT ACGGGCTCTG GTAAATCAAA TTTAGCATCC
TTGATTGCTA GAAAAGTACT AGATAATTTA CTTGACACTA AAGTTATAAT ATTTGACGTG
TCAATGGAGT ATGCAATACT TCTCTTAGAT AAGTTGCTTG AAGTCCCATC CAGAGTTGTA
AGTGTAGACA GAGTTCCCCC TAATCCAATC GATGCTAGTA GAAAATTTTT AAGGAGTCAC
GTAATTCCTG ACGATATTGT AGATATTAGA GATAAGATAA AGAAAGGTGC GGAAATTCTG
CATCAAAATG GGAAAATGAA ACAGTTATAC GTTCCGCCTG AAGGTCTGTC TTACTTAACC
TATGCTGATT TAATAGATCT AATAAAAAAG CAGATAGAAG ATAAGTATAC TGCAATATCG
CAGAAACCAC TGCTATATAC TTTTCTAAGT AAGTTAGATA ATTTCATGAG AGAAAGGAAA
TTAACTGCAG ACGACATTAT AGATGACTCC ATTAATCAGT TATTAGATGA AATAGAAAAT
TTAGGTAAGG ATGCACACCT AAAGGAAAAT TCGTCACTGT TCACATTTAT ATCTGGCATA
AAGGCGTACA TCTCACTCGG TATAAGAGAA ACCGAAGAGT ATGATATAGA AAATTTAGCG
ATTGAGATCT TAGATTCTTC TAAGGATTCA CCTAGGTTAT TTATTTTAGA ACTACCTAAC
TTAGAAGAGG GAAGGCAAGT AGTTGCGACT ATAATCAATC AAATTTATAA TAGGAGAAAG
AGAATGTATT CCGATAATCC CAAAATATTA TTTATAATAG ATGAAGCTCA AGAGTTTATA
CCTTATGATA CTAAACAAAA AGACAAAAGT GAAGCTTCAA GTACTGCCAT AGAGAAGTTA
CTTAGGCACG GGAGAAAGTA TCATTTACAT TCACTAATAA GTACCCAGAG GCTAGCGTAT
CTAAACACTA ATGCTCTACA ACAGTTGCAC TCTTATTTCA TAAGCACACT CCCTAGGCCA
TACGATAGAC AATTATTAGC TGAAACTTTT GGAATTAGCG ATATGTTATT AGATAAGACC
TTAGAACTGG AACCGGGGCA ATGGTTGTTA GTAAGCTTTA AATCTGCACT TCCTCACGAC
GTCCCCGTAT TCTTTTCCGC GGAGAACAAT CTAGATTTAT TAAAGGATAG AATAAATAAG
TTATGA
 
Protein sequence
MESIFETEEG KLREAKIITR QTSDGRGTIS FRNYIVEFPF SLKDKLGIGK LLAVNTIKEN 
NYLILEVADI IPMHYGMINL DSTIPKEIRK EIMKRVSESW YSNDEKEIWI DSITYPLGYI
LEVNSDNVLF KKGYFPPLLG SSVKILNKKA YASFVCAKSN ISLGKILHEE LSLDVNLEKA
IRYHLGIFAF TGSGKSNLAS LIARKVLDNL LDTKVIIFDV SMEYAILLLD KLLEVPSRVV
SVDRVPPNPI DASRKFLRSH VIPDDIVDIR DKIKKGAEIL HQNGKMKQLY VPPEGLSYLT
YADLIDLIKK QIEDKYTAIS QKPLLYTFLS KLDNFMRERK LTADDIIDDS INQLLDEIEN
LGKDAHLKEN SSLFTFISGI KAYISLGIRE TEEYDIENLA IEILDSSKDS PRLFILELPN
LEEGRQVVAT IINQIYNRRK RMYSDNPKIL FIIDEAQEFI PYDTKQKDKS EASSTAIEKL
LRHGRKYHLH SLISTQRLAY LNTNALQQLH SYFISTLPRP YDRQLLAETF GISDMLLDKT
LELEPGQWLL VSFKSALPHD VPVFFSAENN LDLLKDRINK L