Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0411 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 368118 |
End bp | 371078 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | |
Product | conserved hypothetical protein |
Protein accession | ACX90694 |
Protein GI | 261601091 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTTAA TATATAGTAC TACTGAAGAA AGTTCAGTTG GAAAAATGCT TAAAAGGTTT TGTATAAAGA CAAAAAATAT GGAGTGGAAA GTAATTATTA TAACAATCCT TTTAATTATA TCTCTTATTC CATTAACCCC AGTATCTCAT AGTAATACTA AACCACTACC ATTATCCACG TTAAATTCTA ATGACCCAGG GATAACTTTC TACGACGAGC AGCTATTCAT GGCGTTAAAT GCATATCCAG TTAGTACAAA TTTAGTAATT TATGTAAAAG TAATTGCACA AACTAGTGAT TCTGGTTACG GTCCAGCGTA TTTGTTAAAT GCAATAACCA ATAATGACTG GTGGTATCAA GTTGGTGTAG CCTATAATTG GCCTTATACA AATGGTTCAT TTGACCCAGG ATTTCATATG ATCTACATGG TCTGGAATAG TAGCGGGGAT CCGGTAATAG GACCAGTTCT GCTAAACTTC AATGGAATTG TGAATAGCGG GGATGAGATG ATGCTAGAAA TCGAAATATC AAATGGCAAT ATAATACTTT CCGCTTATGA TAATAGCACT GGTGCCAAAG CAATTGCAAT TGTTAACGCT AACGGTGCAT CCAGTATCGT TACGGATCTT AACTTTTTAC ATGGTTTCAT AACTGGATTA ATGACAGAAT GGTATCATCT TAAGGCATAT TTTGGAGGAG AACTGGGAGT TATATATAGA TTACAATATT TAACAACTAG CTTCACTTTA GGTATTAATG AAATATATTT AACCTCCCCT CCACAAGGTA TAGCGACTGA CGAGATTGCA TATTCGGTTT CAAATAACTA CACCTTCTAT AATTTGTCCT ATATAGGAGC ATTTACAGCC TCAAATGGAT ATTACTTTAT AACTGGTAAT TTATATCCAA TATTCTTAAA TTATAAGGTA ATTGGAGGAA GCTTTAATAT ACCTATAAAC GTTTCATATT TTTCCGACGG TCTTAAAGAG ATCGCAAGAT TGCCTTCCTT GATTTTCGTG AATGCTAATT CTAATATAAG TTTACCATCT ATTGTCTTAA ACGGGTTAAG CAGGATTGTT AGCTTAAATT CTACTCCAAT TTTAGCAAAC AGGAGTGGAA ACATAACAGT ATATTATCAA CTTCAGTATT TCATTAATAT AAACATTCCA GTAAACGCAA GTATCAATGG AATTTATACA ACTCTGAGTA GTGGATGGTA TAACGCCTCT ATAAAAGTGG TAATTAGCCC GTTTACATAT TATCCCAATG ATTCAAGAAT AATGGTATTT TCATACCCAA CTAATGAATT TACATTAATT ACCCCCACTA ACCTCACAAT AACTTATATT TTACAATACT ACGTGCACGT TAACTCATTA ATTCCACTTT TCGGATCAAT AAATGGGACA AACGCATCAA TTTCCTCTGG GTGGTACAAT AAGAACACGG TTATACAAAT TTATAATATA ACGTTTTATC AAAGCAATGA AACTAGAGCG GTAATTACTA AGATCTTACC CGCAAATAAA ATCCTCGTAA ATATGTCTTA TACAATAACT GCAAATGAGT TAATACAATA TTATATTATT GTAAAATCAC AGATACCCAT TTATGCACTT ATTAATGGAA CTAATAGTAC ATTAACCAGT AATTGGTATA ATATAGGTAC GAATATTAAT ATAGAAAACA TAACATATTA TGGACAAAAT AACGAATATA GGTATGTAAT ATCTAATATT TTACCTTCCA GAAACATTAC GGTTGAAAAT CCAATCACAA TATCAATCAT AACGGTAAAG CAATACCCTC TGATTGTCTA TTCCAAGATA CCAGTTTACG CACTAGTCAA TGGTACTAAC GAGACGTTAC AAAAATACTC TTGGTTCAAC GCTGGAAGTA GAATACAGAT AGAGAACATC ACATACTATA TAAATAGTAC TGCAAGATTT CTCATGGAAA AAGTTTTACC ATATTCAAAC TTTACATTAC TCCAACCAAT TAATATAACT ATAATTACAC TACCTCAATA TTTCTTAAAC GTTAGTAGTA ACTACCCAGT CTATGTACTC TTCAACGGTA AAAACACCAC CTTAAGTAAC GGATGGTATA ATAATGATAC TGAAATTGAA TTATATACAA TTTGGTATAT TAATCAAACT GAAAGACAAA ACTTAGTCAA TATCAGTCTA AATGGAAAGC CAACTTCCAA TAATGTTATA ATAGTTAATG GGCCAGTCTC GTTAAAACTG CACTATGTCC TACAATACTA TATAAACTTA ATTTCCAACA TACCAATAAA AGCCTTAATC AATTCAACCT TAGTTACTTT TAGTCCAGGA TGGTATAATG CGGTAACTCC TATTTCGTTC ATAAACATGA CGTATTATAT CTCCAACAAT ACAAGATATA TAATATTATC CATATTACCA TTTAACTTTA CAGTAAATAG AAGCTTAACT GTGAAAGTAA CGACGTTAAA GGAGTATCTG GTTACTGTGA ATGAGCCAAT ACTGATTACA ATAGCAAATA GGACCCTAAA TACTAGTGAA ATTTGGGTAC CAGCTGGGCA AACTTTATTA ATACCTAAAT ACGTAAATAT AAGCAATAAT GAAAGAATAT TCTATAATAC TTCTTCGTAT TTAATAAATA TAACCCAACC AACTAGTATT AATGTTAAAC CAATAATAGA ATATTTAGTA ACAATAGATG GAAATTCAGA ATGGTTACCA AGAGGAAGCG TAATAACGTT AACCCAATCC GTGCCCATAT ATGAACAAGG CAAGTGGGAG GGCTCCTATA ACGTTAGTAA TGGTGTAGCA ATAACCGTAA ATCAACCAAT AACTGAAACT TTCGTAAAGA ACATAAATGG GAGTTTTGTA GGGAGTGTCA TAATACTAAT AGCAATTATA ATCATAGCAA TATTGTTCTT AACGATAAGA AGGTCCAGAC CAAAGTTTTA A
|
Protein sequence | MILIYSTTEE SSVGKMLKRF CIKTKNMEWK VIIITILLII SLIPLTPVSH SNTKPLPLST LNSNDPGITF YDEQLFMALN AYPVSTNLVI YVKVIAQTSD SGYGPAYLLN AITNNDWWYQ VGVAYNWPYT NGSFDPGFHM IYMVWNSSGD PVIGPVLLNF NGIVNSGDEM MLEIEISNGN IILSAYDNST GAKAIAIVNA NGASSIVTDL NFLHGFITGL MTEWYHLKAY FGGELGVIYR LQYLTTSFTL GINEIYLTSP PQGIATDEIA YSVSNNYTFY NLSYIGAFTA SNGYYFITGN LYPIFLNYKV IGGSFNIPIN VSYFSDGLKE IARLPSLIFV NANSNISLPS IVLNGLSRIV SLNSTPILAN RSGNITVYYQ LQYFININIP VNASINGIYT TLSSGWYNAS IKVVISPFTY YPNDSRIMVF SYPTNEFTLI TPTNLTITYI LQYYVHVNSL IPLFGSINGT NASISSGWYN KNTVIQIYNI TFYQSNETRA VITKILPANK ILVNMSYTIT ANELIQYYII VKSQIPIYAL INGTNSTLTS NWYNIGTNIN IENITYYGQN NEYRYVISNI LPSRNITVEN PITISIITVK QYPLIVYSKI PVYALVNGTN ETLQKYSWFN AGSRIQIENI TYYINSTARF LMEKVLPYSN FTLLQPINIT IITLPQYFLN VSSNYPVYVL FNGKNTTLSN GWYNNDTEIE LYTIWYINQT ERQNLVNISL NGKPTSNNVI IVNGPVSLKL HYVLQYYINL ISNIPIKALI NSTLVTFSPG WYNAVTPISF INMTYYISNN TRYIILSILP FNFTVNRSLT VKVTTLKEYL VTVNEPILIT IANRTLNTSE IWVPAGQTLL IPKYVNISNN ERIFYNTSSY LINITQPTSI NVKPIIEYLV TIDGNSEWLP RGSVITLTQS VPIYEQGKWE GSYNVSNGVA ITVNQPITET FVKNINGSFV GSVIILIAII IIAILFLTIR RSRPKF
|
| |