Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0993 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 933781 |
End bp | 936690 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | conserved hypothetical protein |
Protein accession | ACX91237 |
Protein GI | 261601634 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.365998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAGA GTTTGATAAT ATTACTTTTT GTAATATTAT CTCCTATAAC ATATTTAACA TTACCACTAT CCTCACAATC CACTCCTATT CAAGGATATG CGACAAGTAG TGAGTTGATA ACACCCGGCG AAATTGAGGT ACCAATAACC TTTCATTTGA TTAACTTAGG TCAAACCTTA ACTGAGGTTA CAATAACCCC TGCAGATACT TATCCGTTTT ACCTTTACCC CTATAATAAT GGAACTGAAC TTACACACAT ACCCTTGTGG AATCAAGGAC AGACAGTAAA TGTTACTTAC TTGTTTGATA TAGCTAGTAC TGCTAAGACT GGTACATATA CTGATGTAGT TGTAGTTCAA GGTATCACTA CCTCTGGTAC CCAAGTAACT TATGATGTTT TAGTACCAGT AGTAATAGCT GGCTATGTTA ATTTCTCCGC ATCATCAGTA TGGGGAACAA CATCAAACCC AATGGTAGTA GGACCAGGAG AAAACAACAT CCCACTAACA ATAATACTAC AAAACTTAGG TAATTCGTTA GTCACCAATA TAACATTAGA ACTAAACTCA CAGTTTCCAG TTGGATTCCT ACAAAATAAC GCTACAATTT CCGCAATACC AGCAGGATAT TATGGAGAGG TAACTGTAAT GACTTCAGTA TATCCTAATG CAACTGAAGG TTTGTATTAC ATCAAGCTAA ATGTAATATA CTATCATAAT GCGACTACTA CAGTATTAGT ACCTATTGAT ATAGGATCAT CTAATCAAGT GTCACTAGAG GACGCATGGG GTACACCATC AGACCCGATG GTAGCTGCGC CTGGTGAAAC TTTACTTCCC CTCACTATTT ACGTAAAGAA TCTTGGCGAG AACCTACTAT CCAATGTCAT GTTAATATTG CAATCTCACT ATCCAATTCA ATTCCTCCAA AATTACACAA TGATAGGTTT CGTACCAGCT GGAGGTTATA ATTACGTTAC AGTAGTAGCA AACGTCTATA AGAACGTAAC ACCTGGAGTA TATTACGTAC CAATAACCTT AGTGGCATAC GATGGAGGGT TCATGCAGAC TTTTGAAATG CCAGTCTACG TTTTAGGCTA TGTTAATTTC TCCGCATCAT CAGTATGGGG AACAACATCA AACCCAATGG TAGTAGGACC AGGAGAAAAC AACATCCCAC TAACAATAAT ACTACAAAAC TCTGGGATAG TTACTGTAAC TAACGCAACC TTGTTCCTAC AATCACAGTA TCCGGTACAG TTTCTACAGA ACAATGTAAC CCTTGGGAAC GTCCCAGCCG GCTATCCTAT ACCAGTAACA GTACTAGCAA ACGTTTATCC TAACGTAACA AATACCGGTG TTTACTACAT AACCGCAAAA GTGATGTATT ACGATGGTGT GATACAATAT GTAAAGGTCC CAATATATAT AGAGTCACTA AATCAAGTTT CGGTAGAGGG TATATGGGGT TCTTTATCTA ATCCAATACT GGTAGCTCCG GGTGAGAACA ATGTACCACT AACGCTAGTG ATAAAGAATC TTGGTGAAAA CCTACTATCA AACGTAAGCC TAATATTGCA ATCTCATTAT CCGATACAAT TCCTCCAGCA AAACGCCTCC GTTGGTTTCG TTCCCGCTGG TAGCTACAAT TACGTAACAG TGACGGCCAA CGTGTTTCCA AATGCAACTC CAGGAGTATA CTACATACCA GCAACCCTAG TGGCTTATGG TGGATTTAAG GAAAACATAA TGATAACTGT TGATATCCTA GGATATGTTA CAATACAAGC TCAAAGCTTG TGGGGAGAAG TAACATCCCC AATAACGGTC TCCTCAGGTG AGACTGATGT ACCTTTAACG GTTTTACTCA AAAACACTGG TGACGTTAAT ATTCTTAATG CTACCCTAGT TTTCCAGAAC GTAGAGTATC CATTAATCTT CCATCAGGCC ACTGCACAAA TCGGTATTGT ACCAGCTGGT CAAGAGAATT ATGCAACAGT TACTGTAAGT GTGTTTCCAA ATGCAACTCC AGGAGTATAC TACATACCAG CAACTCTATA TTACTTCAAT CATCAAACCA CAATAACAGT TCCAATAACA ATTTATTCAC CTAATATTTC AGTAAATTTA GTAACTATTC CACCTCAAGT ATTTCCTAGC TATTATGATG TTAGATTATT GGTAATTTTA ACAAACTTTG GAGGTGGTAT AGCGGAAAAC GCTAACGTGA GCATTCAATC ACCATTCCAA GTTATATCTT CTAATCCACT ACACTTAGGA GCACTACCAG TGGGAGTCCC CATCAATGCC ACATTTCTTA TTAATGTTCC AAATAATACA GTGCCTAAAA TCTACATTGT AAACATTACT ATTAACTATG ATGGTGGAAA AGAAACGTAT CAATATCCAT TACAGATATA TCCCAAGGCT AATCTGATAG TAGTAGGAGT TTCTTATCCT TCACTAAGTG CAGGAGACAG CAACGTGCCA ATTACAATAA CTCTTAAAAA CGCTGGAAAT TCGACAGCTA AGAACGTTAT AGTTAGGCTA GGAACTTCCA ATCTAATATA TCCTCATGTG AGTTCATCAA ACCCATTGCA AGCATTAACA GCATCTGAGG TATTCGCTGG AGATATTGCA CCCGGACAAG CAATAAATGT AACTTTTGTA GTCGATGTAA GTGGTGGAGC GTCTGCTGGA ACTTACCCAT TAGCAATTGC TTTGATATGG AATCAAACTG GTGCACTATT TCCATTTGAG CAGTCTGATA CATTCTACGT AACAATATCT CCTCCATTCT ATGAACAGTT CTTTAAATCA CCTATTGGCA TAATAACAAT TATAGTAATA ATAGTTATAA TTATAGTGAT TGCTGCCGTA CTCCTAAGAG TAAGAAATAA AAGAAGATAA
|
Protein sequence | MKKSLIILLF VILSPITYLT LPLSSQSTPI QGYATSSELI TPGEIEVPIT FHLINLGQTL TEVTITPADT YPFYLYPYNN GTELTHIPLW NQGQTVNVTY LFDIASTAKT GTYTDVVVVQ GITTSGTQVT YDVLVPVVIA GYVNFSASSV WGTTSNPMVV GPGENNIPLT IILQNLGNSL VTNITLELNS QFPVGFLQNN ATISAIPAGY YGEVTVMTSV YPNATEGLYY IKLNVIYYHN ATTTVLVPID IGSSNQVSLE DAWGTPSDPM VAAPGETLLP LTIYVKNLGE NLLSNVMLIL QSHYPIQFLQ NYTMIGFVPA GGYNYVTVVA NVYKNVTPGV YYVPITLVAY DGGFMQTFEM PVYVLGYVNF SASSVWGTTS NPMVVGPGEN NIPLTIILQN SGIVTVTNAT LFLQSQYPVQ FLQNNVTLGN VPAGYPIPVT VLANVYPNVT NTGVYYITAK VMYYDGVIQY VKVPIYIESL NQVSVEGIWG SLSNPILVAP GENNVPLTLV IKNLGENLLS NVSLILQSHY PIQFLQQNAS VGFVPAGSYN YVTVTANVFP NATPGVYYIP ATLVAYGGFK ENIMITVDIL GYVTIQAQSL WGEVTSPITV SSGETDVPLT VLLKNTGDVN ILNATLVFQN VEYPLIFHQA TAQIGIVPAG QENYATVTVS VFPNATPGVY YIPATLYYFN HQTTITVPIT IYSPNISVNL VTIPPQVFPS YYDVRLLVIL TNFGGGIAEN ANVSIQSPFQ VISSNPLHLG ALPVGVPINA TFLINVPNNT VPKIYIVNIT INYDGGKETY QYPLQIYPKA NLIVVGVSYP SLSAGDSNVP ITITLKNAGN STAKNVIVRL GTSNLIYPHV SSSNPLQALT ASEVFAGDIA PGQAINVTFV VDVSGGASAG TYPLAIALIW NQTGALFPFE QSDTFYVTIS PPFYEQFFKS PIGIITIIVI IVIIIVIAAV LLRVRNKRR
|
| |