Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_4391 |
Symbol | ALS1 |
ID | 4836912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 313437 |
End bp | 314849 |
Gene Length | 1413 bp |
Protein Length | 471 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388227 |
Product | Agglutinin-like protein 1 precursor |
Protein accession | XP_001382823 |
Protein GI | 150864119 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.065918 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATGCTTA GAACAATAAT TTTTCTTTTC TTTTACCTGA TTGTAAGAGC TGCTGAAATA ACTGGTGTAT TCACTAGTTT TGACTCTCTC GTTTTCCAGA ATGGTGGAAC CTATCCTTAT GAAGCACCTG CAAACCCTAG CTGGATTGCT ACCTTAAGCT GGGCAATGGA TGGCAATGTA GTTGCTCCTG GAGACACATT TACTTTACAT ATGCCATGCA CTTTTAAGCT TACAACCGAT AATACAATCA TTCTAGGAGC AGAAGGTACG ACATACGCTA CATGTCTGCA TTTTTCAGGA GAAATTATTG TTCCTTATTC TGAATTGTAT TGTACAATTA CAGATTCATT ACTACCTGGT ATGACAGTAG AAGGTTCCAC TTACTTCCCT GTTGTATTCA ATACTGGCGG AAGTGCATTG GCCAGCGATT TGGAAGACTC TACATGTTTC ACGAATGGGG AGAACACTGT CTCCTTCTAC GATGGAGATA CACAACTTTC CACCAGTGTT GTTTTCGAGG CAGGGCCACC AAGTGCCAAT ATAAATCCAG ACATTGTTAT ATTTAGAGCT AGAGTGCTTC CTCAACTTAA TAGCAGCCAA CATTATGCTG CCATTGGTTG GTGTCCTAAT GGTTACACCT CGGGTACCTT GGGGTTTACT TTCACTGGTG GAACTCTTGA TTGTGATTCA GTCCATGCCT CAATTACCAA CCAGTTGAAT GACTTCCATT ATCCAACCAA TGCCGACCCC AATTTTCTGT ATACTTTCAC TTGTTCACCT ACATCCTATC AGATCAATTT TGAAAATATT CCAGCTGGAT ATAGACCATT TATCGATGGT CTTACCAGGC CATCAGGCCC CGCTCTTTCT GTCTTTTATA CCAATCGCTA TCTATGTGCT GATGAGACGA CATTTAGACA GCAAAATCTT AATGTAAATT GGGGTGAATA TGCAGATGGT CCGACTAGTG GTAACGGCAA TGTCATTGTG GTCACAACTT CTACATACAG TGGTTCACAT ACCCAGATAT CCACTGCCAC AGGCGAATCA ACGAACACTA TTATTGTTTA TGTACCTACT CCAACAGTTA CAATAACAGA GACATGGACT GGGCGCGAAA CAAGCACGAC CACTGTATTT CCAACATCAG AAGGAGGTAC TGTAATAGTT ATTATCGACT TACCAACGGA TCCTCCACCT AACCCAACCA CCACTATCAC ATCCGTATGG ACAGGTACTG ATACTACTAC AGTCACAGTA ACTGACACAG AAGGAGGAAC TGATACAGTA ATTGTTCAAG TTCCTGCAAA CCCAACCACC ACTATCACAT CTGTCTGGAC TGGAACTGAT ACTACTACAA TTACGGTGAC TGACACAGAA GGAGGCACCG ATACTGTCAT TGTTCAAGTT CCT
|
Protein sequence | VMLRTIIFLF FYSIVRAAEI TGVFTSFDSL VFQNGGTYPY EAPANPSWIA TLSWAMDGNV VAPGDTFTLH MPCTFKLTTD NTIILGAEGT TYATCSHFSG EIIVPYSELY CTITDSLLPG MTVEGSTYFP VVFNTGGSAL ASDLEDSTCF TNGENTVSFY DGDTQLSTSV VFEAGPPSAN INPDIVIFRA RVLPQLNSSQ HYAAIGWCPN GYTSGTLGFT FTGGTLDCDS VHASITNQLN DFHYPTNADP NFSYTFTCSP TSYQINFENI PAGYRPFIDG LTRPSGPALS VFYTNRYLCA DETTFRQQNL NVNWGEYADG PTSGNGNVIV VTTSTYSGSH TQISTATGES TNTIIVYVPT PTVTITETWT GRETSTTTVF PTSEGGTVIV IIDLPTDPPP NPTTTITSVW TGTDTTTVTV TDTEGGTDTV IVQVPANPTT TITSVWTGTD TTTITVTDTE GGTDTVIVQV P
|
| |