Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0196 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 168618 |
End bp | 170351 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | type II secretion system protein E |
Protein accession | ACX90492 |
Protein GI | 261600889 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.191118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA TTTTCAAGTT CCCCTTACAG ATCGGTAGAG GTAGCGTTAA GCAATTACCT ATCACAGATC TTCCTATAAC GCTTTACCCA GTTACACCTT TGCCCGAAGA GGTTACAACG ATTGTCGCGG ATTATGAGGT TAATATCCTA AATTTAGTCC CGGAGGATAT TAAGTCAAAC CTAACTCGAA ATAATATTGA ACTGATATTA CCAAATCCTC ATGTCTTCAT TACTTTTGAT GAGAGAAAAG GTATCTACAA ATATGTTTTA TTAGAACCAC CGGTTAATGA AATGATCTAT AATATCTACA ATATATTTAT AGAGGAAGTG GAGAGAGAAC TGCTTTCTAA GAATCCCTCT TTAGATCTTG CAAAAATTAT ATTCGAACTG GATAAGAAAA GGTCAGGTCT TAAAATTATC CAAGAGAAGA GAGGAGATAT CTACGTTTTG AGTACAAATG CTAGAGTTAC TTTGTACTAT TTATTAAGAA ACATGTTCGG ATACAACGTA TTAACCCCAC TTGTAGCTGA TAAAAATATA GAAGATATTT CGGTTCCTGG TCTAAATAAT CCAGTCTATG TATATCATAG AAGTTATGAA TATATTCCAA CTAATATTAT ATTTACTAAG AACATGCAAG TATCTCCACA ACTTAATATA ATGATAGATG GTGAGGAACT GCTAGATCAA TTGGTTCTAA GAATGCTTTC TACTACAGGT AAGTCAATTT CTGTTGCTGA ACCAATACAA GACGGTATGT TACCAAATGG TGATAGGGTT GCCGCAACAT TTAGGCGCGA GGTATCAGCC AGTGGTTCTT CAGTAGTAAT AAGAAGATTT AGCGAAAGGC CTATCACAAT ACTAGGTTTA ATTAATTCTG GTACCCTATC TCCAGAACTA GCAGCATATC TATGGTATGG AATGGATCTG AGAATGAGTG TCATGTCAAT AGGAGTTACC GGGGCCGGAA AGACCACTTT ACTTAATGCA GTTCTAAATC TAGTAAAAGA AAGCATGAAG ATCGTCTCCA TAGAAGATAT TCCAGAAATT AGATTAGCCC ATACTAATTG GGTTCAGCTA TACGCTAGGC CAGCATATGC AGGAGTAGGT AAAGAGATTT CATTAATGGA TCTGCTAAAA TTATCCCTCA GATACAGGCC AGATATAATA GTTGTAGGTG AGATAAGAGG GCAAGAGGCT TACGTATTAT TCCAAGCGAC ATCAACTGGA CATGGAGGTG CTACGACATT CCACGCGTAT AATACCGACT CTGCAATAAA GAGGCTCATG AATGAGCCCC TAAATATTCC ACAAGAATGG ATACCTATGA TGAACATAAT AATGACAATT AGGAGGTTAC CAGTATATAT AGGAGAAAAG ATAGTCCTAA GAAGACGTGT TGTAGCAGTT GATGAAATAG TTAGTTGGAA CGACTATAGA AGGGTCTCGA GCTGGGATCC AAAAAGTGAT GCGTTTACAA TTAATCTAGA TGCTGCCAGA GTGTTAAAAA ATAGAATAGA GGAAGCTGGT CTTAATCTAG ATGACGTGAA AAGAGAAATG GAGAGAAGAG CATTATTCCT AAAGTTGTTA GCGTCTTCCA GAGAGATAAT ACAAAATGAG GAGAGTTATA AGCTTGTGAA GAGCTATATA ATAAAATACA GCTTAAAACC CGAAGAAGCT CTAAAAGAGG CTCAAGCAAT GGCTAGGACA AAAACTATAG AGTTAAAAGA ATAA
|
Protein sequence | MSKIFKFPLQ IGRGSVKQLP ITDLPITLYP VTPLPEEVTT IVADYEVNIL NLVPEDIKSN LTRNNIELIL PNPHVFITFD ERKGIYKYVL LEPPVNEMIY NIYNIFIEEV ERELLSKNPS LDLAKIIFEL DKKRSGLKII QEKRGDIYVL STNARVTLYY LLRNMFGYNV LTPLVADKNI EDISVPGLNN PVYVYHRSYE YIPTNIIFTK NMQVSPQLNI MIDGEELLDQ LVLRMLSTTG KSISVAEPIQ DGMLPNGDRV AATFRREVSA SGSSVVIRRF SERPITILGL INSGTLSPEL AAYLWYGMDL RMSVMSIGVT GAGKTTLLNA VLNLVKESMK IVSIEDIPEI RLAHTNWVQL YARPAYAGVG KEISLMDLLK LSLRYRPDII VVGEIRGQEA YVLFQATSTG HGGATTFHAY NTDSAIKRLM NEPLNIPQEW IPMMNIIMTI RRLPVYIGEK IVLRRRVVAV DEIVSWNDYR RVSSWDPKSD AFTINLDAAR VLKNRIEEAG LNLDDVKREM ERRALFLKLL ASSREIIQNE ESYKLVKSYI IKYSLKPEEA LKEAQAMART KTIELKE
|
| |