Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_4539 |
Symbol | ALS1.2 |
ID | 4837070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1015371 |
End bp | 1016786 |
Gene Length | 1416 bp |
Protein Length | 472 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388385 |
Product | Agglutinin-like protein 1 precursor |
Protein accession | XP_001382423 |
Protein GI | 150863820 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.340377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.951225 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTATAG TATTAGCAGT TGCTTTCCTT TTTCTGTGTG TAAGAGCTGC AGTTGTCTCA GGTGTTTTCA CCAGCTTCGA CTCTCTTGTT TTCCAGAACG GGGGTAACTA TCCATTTGAT GGACCAGCTA ATCCAAGTTG GATTGCCACT TTGAAATGGC AACTTGATGG CACTAAGGTT GCTCCTGGTG ACACATTCAC CTTAGACATG CCATGCACTT TCAAGTTCAC ACAAACTCCT GCTGATGCAC CAGTACTCCT TCAAGCTGGT GGAATCACTT ATGCTACATG TCAAACACTT GGTGGTGAAA TCATTGTTCC ATATTCTCAA TTACAATGTA CAGTTGAAAA TGCTGTTACC ACAAGCACCC TTGCCTCAGG ATCTGTTTAT TTTCCAGTTG TATTTAATAT TGGTGGAAGT GCTACCCCTG TAGATTTGAC CGATTCTAAA TGCTTTGCCA GTGGTGATAA TACTGTTACC TTCAATGATG GTGACACCAA ACTCTCTATT ACTGCAAACT TTGAAACCGG TTATCCTGCT TCTGGAGTCA ACCCCACTAA CATCATTTAC AGAAACAGGT TCCTCCCTCA GCTCGGTGAG AGCCAACACT TGTTAGTTGC TGGTCAATGT CCAAGAGGTT ACACTTCCGG TACTTTGGGA TTCTCTTTCA GTGGCGGCAA ATTAGATTGC TCTAGTGTTC ATGCTGCAAT TACCAACCAA TTGAATGATT GGTATTTCCC AACCGACGCA GAGACAGATT TTCTGTTCAC CTATACCTGC TCTGCTTCTG GCTACCAAAT AACATACAAG AATATTCCTG CCGGTTACAG ACCGTTTATA GATGGACTTA CGTCGGCTAC CGCAAATTTA CTTACTGTTT CTTACACTAA CAAGTTCGTT TGTGTCGGAT CCTCTATCAA TAATGACAAG AGCACCAAAG TTACATGGAG TTCTTATCAG AATTCCGATA GCGGAGGTGA TGGTCATGTC ATTGTTCTCA CTACCTCAAC TGGCACTGGA TCCAGTACCA CTGTGACTAC CGCTACTGGT AAAAGTACAA ATACTATTAT TGTAATTGTC CCAACCCCAA CTACAACAAT CACCCAAACT TACACCGGCA CCGTAACAAC CACCACCACC GTTACTGCCA CTTCCGGAGG CACAAACACT GTCATTGTGG AAGTTCCAAC TTCTTACCCA CCAAACCCAA CAACTACTGT GACATCGACT TGGACTGGAA CAGAAACTTC ATCCACCACT GTTACTGATA CACATGGCGG AACTGATACT ATCATAGTTG TGGTTCCTTC GAATCCAACC ACAACATTAA CATCCACTTG GACTGGAACA GAAACCTCTT CGACTACTGT CACTGACACT CAAGGTGGAA CTGATACTGT AATTGTTGTA GTCCCT
|
Protein sequence | MLIVLAVAFL FSCVRAAVVS GVFTSFDSLV FQNGGNYPFD GPANPSWIAT LKWQLDGTKV APGDTFTLDM PCTFKFTQTP ADAPVLLQAG GITYATCQTL GGEIIVPYSQ LQCTVENAVT TSTLASGSVY FPVVFNIGGS ATPVDLTDSK CFASGDNTVT FNDGDTKLSI TANFETGYPA SGVNPTNIIY RNRFLPQLGE SQHLLVAGQC PRGYTSGTLG FSFSGGKLDC SSVHAAITNQ LNDWYFPTDA ETDFSFTYTC SASGYQITYK NIPAGYRPFI DGLTSATANL LTVSYTNKFV CVGSSINNDK STKVTWSSYQ NSDSGGDGHV IVLTTSTGTG SSTTVTTATG KSTNTIIVIV PTPTTTITQT YTGTVTTTTT VTATSGGTNT VIVEVPTSYP PNPTTTVTST WTGTETSSTT VTDTHGGTDT IIVVVPSNPT TTLTSTWTGT ETSSTTVTDT QGGTDTVIVV VP
|
| |