Gene Ssol_1177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1177 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1095197 
End bp1097026 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content31% 
IMG OID 
Productprotein of unknown function DUF814 
Protein accessionACX91415 
Protein GI261601812 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCTC AGAATATAAA ATTACAGAGA AAAAACAGTA TGACTTACTT TGATTTAATA 
GCGTGGATTA CAGAAAATAA GAAAGCGATA GAGGGATGTA TAATAGATAA CGTTTTCTTG
ATACAAAATA CTCAAAATAC ATATATTTTA AAGTTACATT GCAGTGGAAG AGACCAAGAA
TTAATAATAG AACCAAGTAA ACGAATAAAT ATAACAAAAT ACAATTATCC AAAAATTTCT
TCAACAAAAA TAACCGAATT AAGAAGATTA ATTAGAGGAG ATATAATTAC GGATATGTAT
GTATTGAACA AGGAAAGAAT ATTAATCCTA AAACTCAAAA GGGATGATAA AAAGGTAATA
GTAGAACTAC TACCTAGAGG AGTTTTGGTA ATAGCAGATA AAGATGGTAA AATCTTGTTT
GCGAGTGAAT ATAAAGAGTT TAAAGATAGA CTCATAAGAA TAGGGGAGAT ATATAAACCT
CCACCTTCAA TTGAACCTAA TATAGATGAA ATAGAAAAAT TGATTAAGAA AGGAAACATA
GCGAAAGGTT TAGGAATCCC ACAAGAAGTA GCAAACTATC TTAGCTTACA AGACTCTACA
CCAGATATTA ACGTAATAAG GGAAAAGATA AGAAATTTAG AGATTTCAAT AATTAATGGA
GAGATAAAAC CATGTCTCGT AGAAGATACA ACTGTAGTAC CTTTTTATCT TGACGGATGC
AAAGAATATC AAAGATTTAA TGATGCAATA GACGATTATT TCTACACTAT AACTCAAAAA
GAGCTATCTG AAAAAACTTC CAAAAAAATC TCAGAAGAGA AGCAGAAGAT TATAGCCACA
ATTAAGCAAA TAGAGGATAG TATAAAGGAT TATGAAGACA AAGAAAATAA CTATAGACAA
CTAGGCAATT TTATACTTTC AAAGGCATAC GAAATAGACC AGTTGTTGTT AAATAATAGA
GCAAAAAGTA AAAAGGTAAA GCTTAATGTA GATGGAGTTG AAATTGAATT AGATACCTCA
CTCTCAGCTA CTAAAAACGC AATGAGATTT TTTGATGAAG CTAAGGAATA TAAGAGAAAA
ATAGAAAGAG CCCTTAAAAG TTTAGAAGAA CTAAAAGAAA AACTGGCTAA AATAGAGAAA
CAAGAAATAG AGAAACAAAA CGAGATAAAA CTAACGCTAA GGAAAAAGGA ATGGTATGAG
AAATATAGAT GGAGTATTTC AAGAAGCGGA TATTTAATAA TTTTAGGAAG AGATGCAAGT
CAAAATGAAA GTATAGTTAA AAAATACCTA AGGGACAAAG ATATATTCTT GCATGCGGAT
ATTATAGGCG CTCCAGCCAC AATCATCATA ACACAAGATA ATAAGACAAT CTCTGAAGAA
GATATCTATG ATGCAGCAGT TATGGCTGCG AGCTACTCAA AGGCTTGGAA AGTAGGTTTA
GCATCTGTTG ACATATTTTG GGTTTTAGGC AATCAAGTCT CTAAATCACC GCCAAGTGGA
GAATACTTGA ATAAAGGTTC ATTCATGATT TATGGAAAAA AGAATTTCAT AAAAAACGTC
AAACTACAAT TAGCAATAGG CCTTATACTA AGTGAAAACG GTGTATCAGT AATAGTGGGA
AGTGAGGAAA CCATTTCGGC TAAGACTAAA TACTATGTTG TCATAGCTCC AGGTGATGAT
GATAAAGAGA GAATAACCCA AAAAATTATA AAAGTGTTTA GTAGAGCTTT ACCAGAAATA
AACGGATTGA ACGCATTAAA AACAGAGATT GAAGATAAAA TTCCGGGAAA GAGCAAGATA
GTTAAGACAA GTATAACATA TAATAGTTAA
 
Protein sequence
MSSQNIKLQR KNSMTYFDLI AWITENKKAI EGCIIDNVFL IQNTQNTYIL KLHCSGRDQE 
LIIEPSKRIN ITKYNYPKIS STKITELRRL IRGDIITDMY VLNKERILIL KLKRDDKKVI
VELLPRGVLV IADKDGKILF ASEYKEFKDR LIRIGEIYKP PPSIEPNIDE IEKLIKKGNI
AKGLGIPQEV ANYLSLQDST PDINVIREKI RNLEISIING EIKPCLVEDT TVVPFYLDGC
KEYQRFNDAI DDYFYTITQK ELSEKTSKKI SEEKQKIIAT IKQIEDSIKD YEDKENNYRQ
LGNFILSKAY EIDQLLLNNR AKSKKVKLNV DGVEIELDTS LSATKNAMRF FDEAKEYKRK
IERALKSLEE LKEKLAKIEK QEIEKQNEIK LTLRKKEWYE KYRWSISRSG YLIILGRDAS
QNESIVKKYL RDKDIFLHAD IIGAPATIII TQDNKTISEE DIYDAAVMAA SYSKAWKVGL
ASVDIFWVLG NQVSKSPPSG EYLNKGSFMI YGKKNFIKNV KLQLAIGLIL SENGVSVIVG
SEETISAKTK YYVVIAPGDD DKERITQKII KVFSRALPEI NGLNALKTEI EDKIPGKSKI
VKTSITYNS