Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0490 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 438018 |
End bp | 440372 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | Peptidase M1 membrane alanine aminopeptidase |
Protein accession | ACX90769 |
Protein GI | 261601166 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.559596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAACA TCGAGAAATA CGAAATTTTT CTGGATTTTA ATGGGAATGA ATATGAGGGT GTAGAGAAAA TATACCTTAA CTCAGAAGAA GAAAAATTAG AGCTTGATAG TGTGAATCTC GAAATAAGAA GTGTGAAATC TGATGGTAAG GATACAAAAT TTGAATTGAA AGGTGAGAAA TTAGTAATCT ACGGAAAGAT TGAAAGAGAA CTTGAGATAA AGTTTAAAGG AAAAGCCTCT CGGGATTCAA TTTTAGGGAT TTACGTTGCT CCTTATGATG GTAAAGGAAT GATTACAACG CAATTTGAGG CAGTATACGC TAGGAGATTT ATCCCATGTT TTGATCATCC TGCAATGAAA GCTAGATTTA AACTAAGCGT TAGGGTACAA AAGGGACTAA AGGTAATATC TAATATGCCA GTTGAGAGAA TAGAAGAGGA TGTGGACGGT AAGGTAATTT ATCGTTTTCA AGAAACTCCT AAAATGTCTA CGTATCTATT ATACTTGGGA ATAGATGAAT TTGAGGAGAT TTCTGATAAC TCTAAACAAC CTACAGTAAT ATTAGCCACA GTACCCGGAA AATCAAAAAG AGGACTATTC GCAATTAACG TCGCCAGAAA GGTCATTGAG TTTTATGAAA AATACTTTGA AATACCTTAT CAGTTACCAA AAGTTCATTT AATACAAGTT CCCGAGTTTG CTGCTGGCGC TATGGAGAAT TGGGGTGCTA TTACCTTTAG GGAGACTGCT TTGCTGGCTG ATGATTCTTC TTCAATTTCT CAAAAGTTTA GAGTTGCTGA GGTTGTTGCT CATGAGTTGG CTCATCAGTG GTTTGGTAAT TTGGTTACTT TGAAGTGGTG GGATGATTTG TGGTTAAACG AGAGCTTCGC AACATTCATG AGCTATAAGA GTATAAAGCA TTTATTTCCC CAATGGGATA GTGAAGGTCA TCTTATTTAT GACGAATCTA TAGGTGCTTT AGAGGATGAC TCTCTTTCTA CTACACATCC AATAGAGGCA CATGTAAAAG ATCCCCATGA AATTGAACAA ATGTTTGATA ATATTAGTTA TGGTAAGGGG GCTAGTATTT TAAAGATGAT TGAGGCTTAT GTTGGTGAGG AGAATTTCAG AAGGGGGGTT GTCAATTACT TAAATTCTTT CAAATTCGGA AATGCAGAGG GTAAGGATTT GTGGAATTCT ATTTCTAACG CAGCTGGGCA GAGTATTGGA GAAATTATGG CTGATTGGAT TACAAAGCCT GGTTACCCTG TAATTTTTGT CAACGCATAC GGTAATTCTA TCAGGTTTTC TCAAAAAAGA TTTACACTTC TTGATAGCGG TTTAAATGAG GTTTACAAGG TTCCAATTAC ATATGAGATT AATGATAAAT TTGGCACTCT TCTTCTGGAC AAGGAATCAG CTGAAATAAG GTTAGATGAA GGTTTGAAGA GTATTAAGGT TAATATAAAT AGGACTGGAT TTTATAGGGT CCTTTATGAT TCTCTTAATC TAGCTTTCTC ATCAAAGCTT AATGCTTATG AAGAGTTGGG ATTGGTTAAC GACTATTGGA ATTTCCTATT GGCTGATCTA ATAGATGCGA AGACGTACTT TGGCGTAATT GGTAGGTTTG TATATACTTC TAACTCTTTT GTATCAAGAG AGATAACCTC TCAACTCTTA ACATTATATT ATCTATTTAA GAAAAATTAC GGGAAAGATT TCCTAGTTAA TCAAGTTAAG ATATTTAGAA AGGCTAATGA CGACCTAGGC AAATTAGCGT ATTCAACTGT TATCAGTGCC TTAGCTAGAA TGGATGAAGA GTTTGCATTA GGATTATCAA CTTTATTTGA TCAATATGAA AATATAGACA GTAATATTAA AGAAGCTGTT GCAATAGCTT ACGCAGTAAC TAATAACGAC TTCAATACTC TTCTAGAAAA GTACAAGAGG TATACAATAG ATGAGGAGAA GAATAGAATA TTAAGTGCAA TTTCATCACT TCGTGATCCA TCAATTGTAG TCAAGGTTTT CTCACTAATA TTTGAAAGGA ATATAAAGGC TCAAGATACT AGATTCGTTA TATCTTCACT GCTACACAAT CCTCATATAA GGGAGGAAGT ATGTAGTTAT CTTATGAACA ATTTTGAGGA AGTTAAGAAA TTCGTAAATA CAGTTTATGG AGGTCCTTGG GGTTTAGGCT CTATAGTTAG GAGTATGTCA TTCTGCGGTG TAGATAAGCC TAAAGACATT ATTGACTTTC TTGAAAAGGT GAAGTTCAAG GAGATCGAAA GACCTATTAA AGAGTCTGAA GAGAGGATAA AAGTATATTC TCGATTAAAA CAAAACCTAC CATAA
|
Protein sequence | MPNIEKYEIF LDFNGNEYEG VEKIYLNSEE EKLELDSVNL EIRSVKSDGK DTKFELKGEK LVIYGKIERE LEIKFKGKAS RDSILGIYVA PYDGKGMITT QFEAVYARRF IPCFDHPAMK ARFKLSVRVQ KGLKVISNMP VERIEEDVDG KVIYRFQETP KMSTYLLYLG IDEFEEISDN SKQPTVILAT VPGKSKRGLF AINVARKVIE FYEKYFEIPY QLPKVHLIQV PEFAAGAMEN WGAITFRETA LLADDSSSIS QKFRVAEVVA HELAHQWFGN LVTLKWWDDL WLNESFATFM SYKSIKHLFP QWDSEGHLIY DESIGALEDD SLSTTHPIEA HVKDPHEIEQ MFDNISYGKG ASILKMIEAY VGEENFRRGV VNYLNSFKFG NAEGKDLWNS ISNAAGQSIG EIMADWITKP GYPVIFVNAY GNSIRFSQKR FTLLDSGLNE VYKVPITYEI NDKFGTLLLD KESAEIRLDE GLKSIKVNIN RTGFYRVLYD SLNLAFSSKL NAYEELGLVN DYWNFLLADL IDAKTYFGVI GRFVYTSNSF VSREITSQLL TLYYLFKKNY GKDFLVNQVK IFRKANDDLG KLAYSTVISA LARMDEEFAL GLSTLFDQYE NIDSNIKEAV AIAYAVTNND FNTLLEKYKR YTIDEEKNRI LSAISSLRDP SIVVKVFSLI FERNIKAQDT RFVISSLLHN PHIREEVCSY LMNNFEEVKK FVNTVYGGPW GLGSIVRSMS FCGVDKPKDI IDFLEKVKFK EIERPIKESE ERIKVYSRLK QNLP
|
| |