Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2044 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1833289 |
End bp | 1835232 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | amino acid permease-associated region |
Protein accession | ACX92252 |
Protein GI | 261602649 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGAAA GAGAAGCCAA AGGAGCAAGA GATTTAGGTA TTTCTTCAGA TAAACAATTA AGGAGAAGCT TAGGAAAATT TGAGCTATTA TACCTCTCAT TAGGAGGAAT TATAGGATCT GGATGGTTAT TTGCATCTTT AAGTACAGCA GCTTATGCTG GCGGTGCAGC TATATTGAGC TGGATAATTG CTGGAATTTT AGTAATGTTT GTGGGTTTAG CTTATGCTGA AATAGGTGCA GCAATTCCAA AAAGTGGTGG AATAACAAGA TATCCGCATT ATACTCACGG AGGGCTAGTA GGGTATATAA TTACTTGGGC TAATTTCCTT TCCGCTGCAT CAGTGCCAGC TATTGAGGCA GCAGCAGCAA TAGAATATAT AGGATCTTAT TACCCTCAGC TAATAACTTC TGGAACTTTT GATGGAACTA CCGTAACAAT TTTGACACCA TTAGGCATAG GATTAGCTGG TCTATTGTTA ATTTTCTTCT TCTTTTTAAA TTACTTCGGA GTTAACATTT TAGGAAAAGT GACTCATGGT GCAGGTTGGT GGAAATTATT GGTGCCAACA ATAACTTTTT TAGCATTATT AGCGTTGGAC CTGCACTCAG CTAACTTTAC ATTAGGTGGA GGTTTCTTCC CATCTGCACA ATATGTAAAA GGAGGTTCCT CTGGAATTTA TGGTTTTAGT GCAGTTCTCT TTGCTATTCC TTCTACCGGA GTAATTTTCG CTTATCTAGG ATTTAGACAA GCAGTAGAAT ATGGGGGTGA AGGTAAAAAT CCCAGTAAAG ATATACCATT TGCGGTGTTA GGTTCATTAC TTATAGCTAT TGCGTTATTT ACGTTACTTC AAGTTAGCTT TATAGGAGGA ATAGATTGGA GCAAATTATA TCTTGTTAAT AAGACTACTG GTATATTAAT CCCCGTTGTA CCAGGGAATT GGTCAGCATT AAGTACAGCA GTCACAGCAT CTAACGTCTC AATATCTTCT GGCCCATTCT TAGTTCTAAC TCAGATTGCC CCAGTCTCTG GTCTAGCTGC AGCGTTTTTC ACGGCTTTAG CGGTTTTATT AACTATTGAT TCTGTAGTTT CACCATCCGG AACTGGATGG ATTTATACTG GAACTTCAAC TAGAACACTA TACGCATTTG CTAGTAATGG TTATTTACCA GAAATTTTCT TAAAGATTGG AAAGACTAAA ATACCAACTT ATTCACTAAT TGCAACATTA ATAGTAGGTT TCATATTCTT ACTACCATTT CCATCGTGGT ATGCTTTAGT AGGTTTCATA TCTTCAGCGA CAGTATTAAC TTACATTATG GGCGGAATTG GATTAGCAGT CTTAAGAAAA CACGCTAAGG AATTAAATAG ACCATTTAGA GTACCCGCAT CAGTAATAAT TGCACCAATA GCAACACTAG CAGCTGGTTT AATAGTTTAC TGGTCAAGTT TTGCTATTCT CTTTTACGTA TTTACTGGAA TATTCTTAGG TCTTCCATTA TTCTTTATAT TCTATTCCAA TAGAATATTA GGGATAAATA AGGCATATTC AATAGTTGTT GGCGTTATTA ACTTAGTAAT AGATCTAGTA ATGGCATTCT TGTTATTTGA TGAGACTAGC GGACTTGGTG CTGCTAACAA TCTCTTCTTC GGAATTTATA TAGTAGTTAT TGCAGCTATG CTAATTGGCG ATATGTTATT CTTGCTGAAA ACTGTACCTC CAGATGTTAA GAGAGAAATT AATGCCGGAT GGTGGTTAAT TTACTTTATC TTAGCAATTT ACATAATATC ATATTTTGGA GGATTTGGGC TATATACTAT AATACCATTT CCATATGATA CTATAGTTGC TGCTATTGTA ATACTCATTG GATACTTCTG GGCTATAAGA AGTGGTTTCA GAACTCAAGC AATTCAAGAT ATAATTCAGG CAACTAGAGA ATAA
|
Protein sequence | MGEREAKGAR DLGISSDKQL RRSLGKFELL YLSLGGIIGS GWLFASLSTA AYAGGAAILS WIIAGILVMF VGLAYAEIGA AIPKSGGITR YPHYTHGGLV GYIITWANFL SAASVPAIEA AAAIEYIGSY YPQLITSGTF DGTTVTILTP LGIGLAGLLL IFFFFLNYFG VNILGKVTHG AGWWKLLVPT ITFLALLALD LHSANFTLGG GFFPSAQYVK GGSSGIYGFS AVLFAIPSTG VIFAYLGFRQ AVEYGGEGKN PSKDIPFAVL GSLLIAIALF TLLQVSFIGG IDWSKLYLVN KTTGILIPVV PGNWSALSTA VTASNVSISS GPFLVLTQIA PVSGLAAAFF TALAVLLTID SVVSPSGTGW IYTGTSTRTL YAFASNGYLP EIFLKIGKTK IPTYSLIATL IVGFIFLLPF PSWYALVGFI SSATVLTYIM GGIGLAVLRK HAKELNRPFR VPASVIIAPI ATLAAGLIVY WSSFAILFYV FTGIFLGLPL FFIFYSNRIL GINKAYSIVV GVINLVIDLV MAFLLFDETS GLGAANNLFF GIYIVVIAAM LIGDMLFLLK TVPPDVKREI NAGWWLIYFI LAIYIISYFG GFGLYTIIPF PYDTIVAAIV ILIGYFWAIR SGFRTQAIQD IIQATRE
|
| |