Gene Ssol_0411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0411 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp368118 
End bp371078 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content32% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACX90694 
Protein GI261601091 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTAA TATATAGTAC TACTGAAGAA AGTTCAGTTG GAAAAATGCT TAAAAGGTTT 
TGTATAAAGA CAAAAAATAT GGAGTGGAAA GTAATTATTA TAACAATCCT TTTAATTATA
TCTCTTATTC CATTAACCCC AGTATCTCAT AGTAATACTA AACCACTACC ATTATCCACG
TTAAATTCTA ATGACCCAGG GATAACTTTC TACGACGAGC AGCTATTCAT GGCGTTAAAT
GCATATCCAG TTAGTACAAA TTTAGTAATT TATGTAAAAG TAATTGCACA AACTAGTGAT
TCTGGTTACG GTCCAGCGTA TTTGTTAAAT GCAATAACCA ATAATGACTG GTGGTATCAA
GTTGGTGTAG CCTATAATTG GCCTTATACA AATGGTTCAT TTGACCCAGG ATTTCATATG
ATCTACATGG TCTGGAATAG TAGCGGGGAT CCGGTAATAG GACCAGTTCT GCTAAACTTC
AATGGAATTG TGAATAGCGG GGATGAGATG ATGCTAGAAA TCGAAATATC AAATGGCAAT
ATAATACTTT CCGCTTATGA TAATAGCACT GGTGCCAAAG CAATTGCAAT TGTTAACGCT
AACGGTGCAT CCAGTATCGT TACGGATCTT AACTTTTTAC ATGGTTTCAT AACTGGATTA
ATGACAGAAT GGTATCATCT TAAGGCATAT TTTGGAGGAG AACTGGGAGT TATATATAGA
TTACAATATT TAACAACTAG CTTCACTTTA GGTATTAATG AAATATATTT AACCTCCCCT
CCACAAGGTA TAGCGACTGA CGAGATTGCA TATTCGGTTT CAAATAACTA CACCTTCTAT
AATTTGTCCT ATATAGGAGC ATTTACAGCC TCAAATGGAT ATTACTTTAT AACTGGTAAT
TTATATCCAA TATTCTTAAA TTATAAGGTA ATTGGAGGAA GCTTTAATAT ACCTATAAAC
GTTTCATATT TTTCCGACGG TCTTAAAGAG ATCGCAAGAT TGCCTTCCTT GATTTTCGTG
AATGCTAATT CTAATATAAG TTTACCATCT ATTGTCTTAA ACGGGTTAAG CAGGATTGTT
AGCTTAAATT CTACTCCAAT TTTAGCAAAC AGGAGTGGAA ACATAACAGT ATATTATCAA
CTTCAGTATT TCATTAATAT AAACATTCCA GTAAACGCAA GTATCAATGG AATTTATACA
ACTCTGAGTA GTGGATGGTA TAACGCCTCT ATAAAAGTGG TAATTAGCCC GTTTACATAT
TATCCCAATG ATTCAAGAAT AATGGTATTT TCATACCCAA CTAATGAATT TACATTAATT
ACCCCCACTA ACCTCACAAT AACTTATATT TTACAATACT ACGTGCACGT TAACTCATTA
ATTCCACTTT TCGGATCAAT AAATGGGACA AACGCATCAA TTTCCTCTGG GTGGTACAAT
AAGAACACGG TTATACAAAT TTATAATATA ACGTTTTATC AAAGCAATGA AACTAGAGCG
GTAATTACTA AGATCTTACC CGCAAATAAA ATCCTCGTAA ATATGTCTTA TACAATAACT
GCAAATGAGT TAATACAATA TTATATTATT GTAAAATCAC AGATACCCAT TTATGCACTT
ATTAATGGAA CTAATAGTAC ATTAACCAGT AATTGGTATA ATATAGGTAC GAATATTAAT
ATAGAAAACA TAACATATTA TGGACAAAAT AACGAATATA GGTATGTAAT ATCTAATATT
TTACCTTCCA GAAACATTAC GGTTGAAAAT CCAATCACAA TATCAATCAT AACGGTAAAG
CAATACCCTC TGATTGTCTA TTCCAAGATA CCAGTTTACG CACTAGTCAA TGGTACTAAC
GAGACGTTAC AAAAATACTC TTGGTTCAAC GCTGGAAGTA GAATACAGAT AGAGAACATC
ACATACTATA TAAATAGTAC TGCAAGATTT CTCATGGAAA AAGTTTTACC ATATTCAAAC
TTTACATTAC TCCAACCAAT TAATATAACT ATAATTACAC TACCTCAATA TTTCTTAAAC
GTTAGTAGTA ACTACCCAGT CTATGTACTC TTCAACGGTA AAAACACCAC CTTAAGTAAC
GGATGGTATA ATAATGATAC TGAAATTGAA TTATATACAA TTTGGTATAT TAATCAAACT
GAAAGACAAA ACTTAGTCAA TATCAGTCTA AATGGAAAGC CAACTTCCAA TAATGTTATA
ATAGTTAATG GGCCAGTCTC GTTAAAACTG CACTATGTCC TACAATACTA TATAAACTTA
ATTTCCAACA TACCAATAAA AGCCTTAATC AATTCAACCT TAGTTACTTT TAGTCCAGGA
TGGTATAATG CGGTAACTCC TATTTCGTTC ATAAACATGA CGTATTATAT CTCCAACAAT
ACAAGATATA TAATATTATC CATATTACCA TTTAACTTTA CAGTAAATAG AAGCTTAACT
GTGAAAGTAA CGACGTTAAA GGAGTATCTG GTTACTGTGA ATGAGCCAAT ACTGATTACA
ATAGCAAATA GGACCCTAAA TACTAGTGAA ATTTGGGTAC CAGCTGGGCA AACTTTATTA
ATACCTAAAT ACGTAAATAT AAGCAATAAT GAAAGAATAT TCTATAATAC TTCTTCGTAT
TTAATAAATA TAACCCAACC AACTAGTATT AATGTTAAAC CAATAATAGA ATATTTAGTA
ACAATAGATG GAAATTCAGA ATGGTTACCA AGAGGAAGCG TAATAACGTT AACCCAATCC
GTGCCCATAT ATGAACAAGG CAAGTGGGAG GGCTCCTATA ACGTTAGTAA TGGTGTAGCA
ATAACCGTAA ATCAACCAAT AACTGAAACT TTCGTAAAGA ACATAAATGG GAGTTTTGTA
GGGAGTGTCA TAATACTAAT AGCAATTATA ATCATAGCAA TATTGTTCTT AACGATAAGA
AGGTCCAGAC CAAAGTTTTA A
 
Protein sequence
MILIYSTTEE SSVGKMLKRF CIKTKNMEWK VIIITILLII SLIPLTPVSH SNTKPLPLST 
LNSNDPGITF YDEQLFMALN AYPVSTNLVI YVKVIAQTSD SGYGPAYLLN AITNNDWWYQ
VGVAYNWPYT NGSFDPGFHM IYMVWNSSGD PVIGPVLLNF NGIVNSGDEM MLEIEISNGN
IILSAYDNST GAKAIAIVNA NGASSIVTDL NFLHGFITGL MTEWYHLKAY FGGELGVIYR
LQYLTTSFTL GINEIYLTSP PQGIATDEIA YSVSNNYTFY NLSYIGAFTA SNGYYFITGN
LYPIFLNYKV IGGSFNIPIN VSYFSDGLKE IARLPSLIFV NANSNISLPS IVLNGLSRIV
SLNSTPILAN RSGNITVYYQ LQYFININIP VNASINGIYT TLSSGWYNAS IKVVISPFTY
YPNDSRIMVF SYPTNEFTLI TPTNLTITYI LQYYVHVNSL IPLFGSINGT NASISSGWYN
KNTVIQIYNI TFYQSNETRA VITKILPANK ILVNMSYTIT ANELIQYYII VKSQIPIYAL
INGTNSTLTS NWYNIGTNIN IENITYYGQN NEYRYVISNI LPSRNITVEN PITISIITVK
QYPLIVYSKI PVYALVNGTN ETLQKYSWFN AGSRIQIENI TYYINSTARF LMEKVLPYSN
FTLLQPINIT IITLPQYFLN VSSNYPVYVL FNGKNTTLSN GWYNNDTEIE LYTIWYINQT
ERQNLVNISL NGKPTSNNVI IVNGPVSLKL HYVLQYYINL ISNIPIKALI NSTLVTFSPG
WYNAVTPISF INMTYYISNN TRYIILSILP FNFTVNRSLT VKVTTLKEYL VTVNEPILIT
IANRTLNTSE IWVPAGQTLL IPKYVNISNN ERIFYNTSSY LINITQPTSI NVKPIIEYLV
TIDGNSEWLP RGSVITLTQS VPIYEQGKWE GSYNVSNGVA ITVNQPITET FVKNINGSFV
GSVIILIAII IIAILFLTIR RSRPKF