Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31759 |
Symbol | ALS6 |
ID | 4838784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 1623864 |
End bp | 1627022 |
Gene Length | 3159 bp |
Protein Length | 1052 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390099 |
Product | agglutinin-like protein 6 serine rich |
Protein accession | XP_001384616 |
Protein GI | 150865412 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.607907 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCATTC TCGTCTTCGT CCTTATTACT TCTGTACTTG GTGCACAGTT GACGGATGTC TTTCAATCTC TAGAAATCAT CAACAATTCA GGGCTGAATC GTGCTCAGGA TATTCGTACT GCCAAACTTA CATGGAAAAT CGAAGCTGGT GATGCAGTTG AAGGTGACGA ATTCAGCTTG GAGATGCCAA ATGTGTTCAG AACAAAGTTT CCAGGAGACC AGTTGTATCT TGTTGCTGAC TATTCAATCT ATGCTCTGTG TGTTGCTGTT GATGGTTCTT ATCTTGCACA AAATTCTTAC TTGAATTGCA CGACCACTAG TTCTGTTGTC GAGTCTGATT TCAAGGCTAC GGGAACTCTC TCGTTTGATT TTGTGTTTAA TGCTGGAGGC TCTGGAAGTG AGATAGATAC CACTGCTGCT AGCATATTGG TCCCTGGAGA AAATAAAATA AATTGGAGTG GTTTGCAAAC TACTGTTAAT ATCGATGCTG GTCCCTTTTT TGCTCCTGTT AGCAATGATA AAGAACTTGT GTATTTCTCT CGTTCGACTC CTCAGATGTA CGAACAGATA TTCATGCTTG CCGGAGAATG TAATGGTGGT ATTGTCTCTG GGAGTATTGG TATGACCACC AACGATAGTC TAGATTGTAC TCAGTTTGCG TTGAAAGCAA CAAACAACTT AAATTCCTTC CTTTTGCCGG AAACTGCCAT CAATGTCCAA AACACCATAA CTTGCAAAGA ACAAAGTATC ACCTTCAAAT TTAATTCGGT TGCCAATAAC TACCGAGTCT TTCTCCAAGG TCTTGAGAAG TTTCCAACTA ACTCTGATGC TATTAGACAT ATATTTGCCT ACAGTATTCA ATGTGGAGAC GGTACAAAAA TTACAAAGCT GAGTGGCCAG GATTTCGTAG TTATTGACGG CTATGAAGAC AGTTCTGGAT CGGTAGAATA TTCAACAGTT TACACTACTA CTACCTGGAC AGAGACATAT TTAACAACGG TTACCATCCC TTGTACTGAT GAACTGGCTA CTGCCACCGT GATTGTCAAG GTTCCGACAT CTTGCTCTTC AGATTTAGAG CTGTCTACTC ACTGTCCAGG TTGTGAGTCT GAATCATCAA GTTCTGTAAG TTGTGATGAA CCTGAAATTT CATCAGATAC TTCCAGTCTG TTATCTACTA TTTATTGCGA TGAATCCTCT TCTAGCTCAG ATACATCCTC TCTTATTGAA AGCAGCTCCG ATATATACTC TTCCAGTTTG GCTGATACCT CTGTTTACAC AAGTTCTGAA TCCTCAACTA CTGAAGAATG CCCCGAGACC TCCTCTCTTT CTTCTACTGA ATCTTCTTCG TCTGAAGTAT CTTCAACAAC TGAGGAATGT ACTGAAACAC TGTCTTCGAG TATTGCTGAT TCATCAATCT ATACGAGCTC AGAATCTTCA ATTCTTTCTT CGCATGATGA ACTGTCCTCT ACAATTGAGA CTACTGATTC TATTTCTTCA GTTGAATCAT CTTCCTCTAT TCAGGAAACT TCGGAATTGA CATCTTCAAA GGAATCTTCT TCATCTGTTG AATCGACATC TTCAGTCGAA TTGTCCTCCT CCGTTCAGGC CACATCTTCG AAGGAATTAT CTTCTTCAGT TGAAACGACA TCTTCAGTAT TTACTTCTAC CGAGTCATTG TCTTCTGATG ATACGTCATC TTCTATCGAA ACAACTTCAA CTGTGAGTTC ATCCTCCTCA GAAATTACTA GCCCCTGTCT TCAATGCACC AGTTCTATTT CCAGTTCTAG TTCCGTTGAT GTTCCTAGTC CGTGGACAAG TAGTCTGGAA ACTGAGTCTT CTTCCTCATC AACCACTACC AGTTACACCA CAATCCCTTC CTCTAGCATT GAAGGTGCGC TGTCCTCTCC TTTTGTTTCT AGTGGATTGA CGAGCTCTGA ATCTACGCTG GTTCTGTCCA ACACTCCTCC CGAATACACT ATCACAATTA CCAACCGTGG AACTACAATT ATAACCATTG CAACTTGTCC TGGGGGGTGT ACAAGGACAA CGACGGTATT CCCCAGTGAG ACCACTACTA CTCTGATTGC AACTAGTACA GAAACATATT GCCCGGATAG TCTGACAGAA ATTGACAAGA GTTCAAGTGT TTTGAATACA TCTATAACTT CTACGACAGC TGAAACTACC GAAGAAACCA GCAAAGAGTC TACTTCAGAG ACATCCACCA ACGACAGTAC CATCACCAGC AAGACTACAA CCACATTGAT CAACACCAGT ACAGAAACAT ACTGTCCAGA AAGTCTGACG GAAACTGACA AGAGTTCAAA TGTTCTCGAC ACATCTATAA CTTCCACTAC TTCTACTAGT AGCAGTACGA AGGAAACCAC AGACTCTACC AAAGATAGTA CAAAGGCAAC TACGACAACT TCCACTATCA GTCTATCTAC GTCTGAAAGC ACCTCCTCCA GCAATACTGG CACTTTGAGC ACTTTTTCAA TCAGTACTTC TACTGGAAAT ATCTCATCCT CTATTAGCTA TACGGAGATT GTTAGTAGTC CTACGGAAAT AACTTCTGTC ACTACTGATT GTACTACTAA TTACATTTCC ACGACTATTA CTTGCTCCTC GTGTGAATCT AATATTGAGT CAACCTCAGA TAAGGTTTCC AAGCCTCCAG GGGGAACAAA TTACGATACG ACTAATTTTG CACCCACTGC GCCCGCGCCA AGATCCAGTG AAACTGGCCA ATTTCCCCTG CTGTCTGATA AGGCCAGCCA AGAAGAGCCA ATTCCACTGT CGTCCGGAAC GGCTGTCTGT GAAGGTGATT GCGATTTGAC TAGCCGTGTG GAATATGACA AATTGACCCC GACTCAATCC ACGACTCAGA CCACGACTCA CACAACCACC CAGACAACCA CTCAGTCTAC CACTCAATCC ACTCAATCTA CGTCCGGGAC CATTCCTGTG TCTGCGTCTG CGCAAAGTCT GTCTGCGCAG TCTTCAAAAG TTGCTGACTC TTCGACTTTC CACTTTAGTC TGTTCTCCAC CCTGTCTGTG CTACCTTCAG GATTACCCAT TCCCGTCGCG TTCGACTCGG CTGCTGCTCG TCCCGTGATC AGCCTTGTTG CTCTCATGAT GTCGCTTCTC TTCTTGTAA
|
Protein sequence | MCILVFVLIT SVLGAQLTDV FQSLEIINNS GSNRAQDIRT AKLTWKIEAG DAVEGDEFSL EMPNVFRTKF PGDQLYLVAD YSIYASCVAV DGSYLAQNSY LNCTTTSSVV ESDFKATGTL SFDFVFNAGG SGSEIDTTAA SILVPGENKI NWSGLQTTVN IDAGPFFAPV SNDKELVYFS RSTPQMYEQI FMLAGECNGG IVSGSIGMTT NDSLDCTQFA LKATNNLNSF LLPETAINVQ NTITCKEQSI TFKFNSVANN YRVFLQGLEK FPTNSDAIRH IFAYSIQCGD GTKITKSSGQ DFVVIDGYED SSGSVEYSTV YTTTTWTETY LTTVTIPCTD ESATATVIVK VPTSCSSDLE SSTHCPGCES ESSSSVSCDE PEISSDTSSS LSTIYCDESS SSSDTSSLIE SSSDIYSSSL ADTSVYTSSE SSTTEECPET SSLSSTESSS SEVSSTTEEC TETSSSSIAD SSIYTSSESS ILSSHDESSS TIETTDSISS VESSSSIQET SELTSSKESS SSVESTSSVE LSSSVQATSS KELSSSVETT SSVFTSTESL SSDDTSSSIE TTSTVSSSSS EITSPCLQCT SSISSSSSVD VPSPWTSSSE TESSSSSTTT SYTTIPSSSI EGASSSPFVS SGLTSSESTS VSSNTPPEYT ITITNRGTTI ITIATCPGGC TRTTTVFPSE TTTTSIATST ETYCPDSSTE IDKSSSVLNT SITSTTAETT EETSKESTSE TSTNDSTITS KTTTTLINTS TETYCPESST ETDKSSNVLD TSITSTTSTS SSTKETTDST KDSTKATTTT STISLSTSES TSSSNTGTLS TFSISTSTGN ISSSISYTEI VSSPTEITSV TTDCTTNYIS TTITCSSCES NIESTSDKVS KPPGGTNYDT TNFAPTAPAP RSSETGQFPS SSDKASQEEP IPSSSGTAVC EGDCDLTSRV EYDKLTPTQS TTQTTTHTTT QTTTQSTTQS TQSTSGTIPV SASAQSSSAQ SSKVADSSTF HFSSFSTSSV LPSGLPIPVA FDSAAARPVI SLVALMMSLL FL
|
| |