Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31095 |
Symbol | ALS2 |
ID | 4837691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1750762 |
End bp | 1755120 |
Gene Length | 4359 bp |
Protein Length | 1452 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640389006 |
Product | agglutinin-like protein 2 |
Protein accession | XP_001383953 |
Protein GI | 150864934 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCT CACTCAAACT AGTGATAAGA TCTTTACTTT TTGTCGCATC GGCTTTGGCC GCTGACGATC TAGTTATAAG CCAAAATACA ATTGTTAGCA CTGGTGATAC GTTAATCAGA CAAAACTTCA TTGTAAACAG TGGGGTTTTC TATTCAATTG ATTATGGTAT GACCCACAAC TTTTATAATG ATATTACAAT TAACGGCAAA CTCTATATTA CCAACAAGAT TGTTAGGACA GGTATGACTT GCGATGTAAT TGGTACTACC GCGAATATTG TCAACAACGG GTGGATAGTT TTAGATGATA CCAATGCTAC CTCGGCACCA ACTTATGATT GGTATGGTGG TTCCTTCGAG AACAACGGTA TGGTGTGGTT TGCTGGAATT GGTAATACCG GTGGGTCAAC ATTTGCAATC CAACCAAAGG GTTCTTTTAT TAATACTGGT ACCATCATTT TATATCAAAC CGTTAGAAGA TCTGGTGGAA CTTCTCACCT TGGTTTGGAT GGTAAAACCA TTACTAATGA CGGAACTGTT TGTATCTATC AAAACATTTT TTTCCAAGGT TCCACAGTGG AAGGTAATGG ATGTTTCGAT GTCGGCCTTG ACTCAAACTT CTGGGCCACT AATGTTAATT CAAGACCTAT GGAAGAAGGA CAACTTATTT ACTTGAGTAC TAGTACTTCA AGCTTAAGAA TTGATACGTA TCCTCCCAAT ATCCCTATCC ACATAGCCGG ATGGGGTAAT AACAATGTTA TTGGACTTAG CACTGCTATC AACAGCTTTG ATTACGATGG AAACAACTTG AGCATTAGAA GTGGAAGTTA TACCTATAAA CTTGTTATTG GTCCAGGTTA TGATCCCTCA TTAATAAGCA TTGGTTCCGC AGCATACGGT AGCGGTGTTG GTACAATATC AAGGGCCGGT ATCTTATACT CTGGTCCACC ACCAGACGCA AGCAGACCCT CTCAATGTAA TGAGTGTCCT TCCCCTCCTC GTGCTCCGAT ATCAACTGAA ACTCCTAACC CAACCACCAC TGTTACTTCT ACGTGGACTG GTACTTTCAC CACCACGGAA ACATATACTG ATACTCAGGG TGGCACTGAC ACAATCGTCA TTGTGGTTCC AACAGAGTAT TCCTCTACCG AAGAATCCAC TACTGAAGAA CCCACTACCG AAGAATCATC TACTGAAGAA TCGACTACCG AAGAACCCAC TACTGAAGAA TCATCGACTG AAGTCTTGCC AAACGAAACT ACCACTATTA CTTCTACGTG GACTGGTACT TTCACCACCA CGGAAACATA TACTGATACT CAGGGTGGTA CTGACACAAT CGTCATTGTG GTTCCAACAG AGTATTCCTC TACCGAAGAA TCCACTACTG AAGAACCCAC TACCGAAGAA TCATCGACTG AAGTCTTGCC AAACGAAACT ACCACTATTA CTTCTACGTG GACTGGTACT TTCACCACCA CGGAAACATA TACTGATACT CAGGGTGGCA CTGACACAAT CGTCATTGTG GTTCCAACAG AGTATTCCTC TACCGAAGAA TCCACTACTG AAGAACCCAC TACCGAAGAA TCATCTACTG AAGAATCGAC TACCGAAGAA CCCACTACCG AAGAATCATC GACTGAAGTC TTGCCAAACG AAACTACCAC TATTACTTCG ACGTGGACTG GTACTTTCAC CACCACAGAA ACATATACTG ATACTCAGGG TGGCACTGAC ACAATCGTCA TTGTGGTTCC AACAGAGTAT TCCTCTACCG AAGAATCCAC TACTGAAGAA CCCACTACCG AAGAATCATC TACTGAAGAA TCGACTACCG AAGAACCCAC TACTGAAGAA TCATCGACTG AAGTCTTGCC AAACGAAACT ACCACTATTA CTTCGACGTG GACTGGTACT TTCACCACCA CAGAAACATA TACTGATACT CAGGGTGGCA CTGACACAAT CGTCATTGTG GTTCCAACAG AGTATTCCTC TACCGAAGAA TCCACTACTG AAGAACCCAC TACCGAAGAA TCCACTACTG AAGAACCCAC TACTGAAGAA TCATCGACTG AAGTCTTGCC AAACGAAACT ACCACTATTA CTTCTACGTG GACTGGTACT TTCACCACCA CGGAAACATA TACTGATACT CAGGGTGGCA CTGACACAAT CGTCATTGTG GTTCCAACAG AGTATTCCTC TACCGAAGAA CCCACTACCG AAGAATCATC GACTGAAGTC TTACCAAACG AAACTACCAC TATTACTTCT ACGTGGACTG GTACTTTCAC CACCACGGAA ACATATACTG ATACTCAGGG TGGTACTGAC ACAATCGTCA TTGTGGTTCC AACAGAGTAT TCCTCTACCG AAGAATCCAC TACCGAAGAA TCCTCTACTG AAGTCTTGCC AAACGAAACT ACCACTATTA CTTCTACGTG GACTGGTACT TTCACCACCA CAGAAACATA TACTGATACT CAGGGTGGCA CTGACACAAT CGTCGTTGTA GTTCCAACAG AATACTCCTC TGCTGAAGTC TCACCTAACC CAACTACCAC TGTTACTTCC TCGTGGACTG GTACTTTCAC CACCACAGAA ACATATACTG ATACTCAGGG TGGTACTGAC ACAATCGTCG TTGTAGTTCC AACAGAATAC TCCTCTGCTG AAGTCTCACC TAACCCAACT ACCACTATTA CTTCGACATG GACTGGTACT TTCACCACCA CAGAAACATA TACTGATACT CAGGGTGGCA CTGACACAAT CGTCGTTGTA GTTCCAACAG AATACTCCTC TGCTGAAGTC TCACCTAACC CAACTACCAC TATTACTTCC TCGTGGACTG GTACTTTCAC CACCACAGAA ACATATACTG ATACTCAGGG TGGCACTGAC ACAATCGTCG TTGTAGTTCC AACAGAATAC TCCTCCGCTG AAGTCTCACC TAACCCAACT ACCACTATTA CTTCTACGTG GACTGGTACT TTCACCACCA CAGAAACATA TACTGATACT CAGGGTGGCA CTGACACAAT CGTCGTTGTA GTTCCAACAG AATACTCCTC TGCTGAAGTC TCACCTAACC CAACTACCAC TATTACTTCC TCGTGGACTG GTACTTTCAC CACCACAGAA ACATATACTG ATACTCAGGG TGGTACTGAC ACAATCGTCG TTGTAGTTCC AACAGAATAC TCCTCTGCTG AAGTCTCACC TAACCCAACT ACCACTATTA CTTCGACATG GACTGGTACT TTCACCACCA CAGAAACATA TACTGATACT CAGGGTGGCA CTGACACAAT CGTCGTTGTA GTTCCAACAG AATACTCCTC TGCTGAAGTC TCACCTAACC CAACTACCAC TATTACTTCG ACATGGACTG GTACTTTCAC CACCACAGAA ACATATACTG ATACTCAGGG TGGCACTGAC ACAATCGTCG TTGTAGTTCC AACAGAATAC TCCTCCGCTG AAGTCTCACC TAACCCAACT ACCACTATTA CTTCCTCGTG GACTGGTACT TTCACCACCA CAGAAACATA TACTGATACT CAGGGTGGTA CTGACACAAT CGTCGTTGTA GTTCCAACAG AATACTCCTC TGCTGAAGTC TCACCTAACC CAACTACCAC TATTACTTCG ACATGGACTG GTACTTTCAC CACCACAGAA ACATATACTG ATACTCAGGG TGGCACTGAC ACAATCGTCG TTGTAGTTCC AACAGAGTAT TCTCCCGTTG AAGATTACCC TACTGAAGTC TCACCTAACC CAACTACCAC TATAACTTCT TTGTGGACTG GTACTTTCAC CACTACAAAA ACGTTTACTT ACACTCAAGG AGGATCCGTG ACTGTCATTG TCGCAGTTCC AACAGAGTAT TCTTCTGTTG GCGGTTCCTC TGTCGAGCTA CCAGCTGATA TAACATTGGG TCAATCATCT GCCGTGGTGA TTAATCCTGG CGTAGATCTT GAATCTGAAA CCGGGCCCGC TGCTGAGTTT AGTAAGCACA TCAGTGATCA TGTGCAATCT CGCTCAATCC CTGAAGAGTG GTTCACTACT ACGGTTACAA CAACAGGTCC AAATGGCGAA GTTTCGACTT ATACAACAGC CGACACTTCA GGTTTTCAAA CAAATGGCGT GGTTCCCGCT CCATCTAGTA GTTCCACCTA CGACTCCACT GGTTCTTCTA ATTCTGATTC TCTGGCTGAA GATTTAAAGG ATGAAACAGA TTTGTCATCT TCAGTAGATG AGTACGAGGG ATCTGGTGCA ACCCTCATTG GAGCAAGTCT GGTTTACTTT GCTTCAATCT TGTGTCTTAT TTTCTCCCTC TACGCTTGA
|
Protein sequence | MKISLKLVIR SLLFVASALA ADDLVISQNT IVSTGDTLIR QNFIVNSGVF YSIDYGMTHN FYNDITINGK LYITNKIVRT GMTCDVIGTT ANIVNNGWIV LDDTNATSAP TYDWYGGSFE NNGMVWFAGI GNTGGSTFAI QPKGSFINTG TIILYQTVRR SGGTSHLGLD GKTITNDGTV CIYQNIFFQG STVEGNGCFD VGLDSNFWAT NVNSRPMEEG QLIYLSTSTS SLRIDTYPPN IPIHIAGWGN NNVIGLSTAI NSFDYDGNNL SIRSGSYTYK LVIGPGYDPS LISIGSAAYG SGVGTISRAG ILYSGPPPDA SRPSQCNECP SPPRAPISTE TPNPTTTVTS TWTGTFTTTE TYTDTQGGTD TIVIVVPTEY SSTEESTTEE PTTEESSTEE STTEEPTTEE SSTEVLPNET TTITSTWTGT FTTTETYTDT QGGTDTIVIV VPTEYSSTEE STTEEPTTEE SSTEVLPNET TTITSTWTGT FTTTETYTDT QGGTDTIVIV VPTEYSSTEE STTEEPTTEE SSTEESTTEE PTTEESSTEV LPNETTTITS TWTGTFTTTE TYTDTQGGTD TIVIVVPTEY SSTEESTTEE PTTEESSTEE STTEEPTTEE SSTEVLPNET TTITSTWTGT FTTTETYTDT QGGTDTIVIV VPTEYSSTEE STTEEPTTEE STTEEPTTEE SSTEVLPNET TTITSTWTGT FTTTETYTDT QGGTDTIVIV VPTEYSSTEE PTTEESSTEV LPNETTTITS TWTGTFTTTE TYTDTQGGTD TIVIVVPTEY SSTEESTTEE SSTEVLPNET TTITSTWTGT FTTTETYTDT QGGTDTIVVV VPTEYSSAEV SPNPTTTVTS SWTGTFTTTE TYTDTQGGTD TIVVVVPTEY SSAEVSPNPT TTITSTWTGT FTTTETYTDT QGGTDTIVVV VPTEYSSAEV SPNPTTTITS SWTGTFTTTE TYTDTQGGTD TIVVVVPTEY SSAEVSPNPT TTITSTWTGT FTTTETYTDT QGGTDTIVVV VPTEYSSAEV SPNPTTTITS SWTGTFTTTE TYTDTQGGTD TIVVVVPTEY SSAEVSPNPT TTITSTWTGT FTTTETYTDT QGGTDTIVVV VPTEYSSAEV SPNPTTTITS TWTGTFTTTE TYTDTQGGTD TIVVVVPTEY SSAEVSPNPT TTITSSWTGT FTTTETYTDT QGGTDTIVVV VPTEYSSAEV SPNPTTTITS TWTGTFTTTE TYTDTQGGTD TIVVVVPTEY SPVEDYPTEV SPNPTTTITS LWTGTFTTTK TFTYTQGGSV TVIVAVPTEY SSVGGSSVEL PADITLGQSS AVVINPGVDL ESETGPAAEF SKHISDHVQS RSIPEEWFTT TVTTTGPNGE VSTYTTADTS GFQTNGVVPA PSSSSTYDST GSSNSDSSAE DLKDETDLSS SVDEYEGSGA TLIGASSVYF ASILCLIFSL YA
|
| |