Gene PICST_31759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31759 
SymbolALS6 
ID4838784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1623864 
End bp1627022 
Gene Length3159 bp 
Protein Length1052 aa 
Translation table12 
GC content43% 
IMG OID640390099 
Productagglutinin-like protein 6 serine rich 
Protein accessionXP_001384616 
Protein GI150865412 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.607907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCATTC TCGTCTTCGT CCTTATTACT TCTGTACTTG GTGCACAGTT GACGGATGTC 
TTTCAATCTC TAGAAATCAT CAACAATTCA GGGCTGAATC GTGCTCAGGA TATTCGTACT
GCCAAACTTA CATGGAAAAT CGAAGCTGGT GATGCAGTTG AAGGTGACGA ATTCAGCTTG
GAGATGCCAA ATGTGTTCAG AACAAAGTTT CCAGGAGACC AGTTGTATCT TGTTGCTGAC
TATTCAATCT ATGCTCTGTG TGTTGCTGTT GATGGTTCTT ATCTTGCACA AAATTCTTAC
TTGAATTGCA CGACCACTAG TTCTGTTGTC GAGTCTGATT TCAAGGCTAC GGGAACTCTC
TCGTTTGATT TTGTGTTTAA TGCTGGAGGC TCTGGAAGTG AGATAGATAC CACTGCTGCT
AGCATATTGG TCCCTGGAGA AAATAAAATA AATTGGAGTG GTTTGCAAAC TACTGTTAAT
ATCGATGCTG GTCCCTTTTT TGCTCCTGTT AGCAATGATA AAGAACTTGT GTATTTCTCT
CGTTCGACTC CTCAGATGTA CGAACAGATA TTCATGCTTG CCGGAGAATG TAATGGTGGT
ATTGTCTCTG GGAGTATTGG TATGACCACC AACGATAGTC TAGATTGTAC TCAGTTTGCG
TTGAAAGCAA CAAACAACTT AAATTCCTTC CTTTTGCCGG AAACTGCCAT CAATGTCCAA
AACACCATAA CTTGCAAAGA ACAAAGTATC ACCTTCAAAT TTAATTCGGT TGCCAATAAC
TACCGAGTCT TTCTCCAAGG TCTTGAGAAG TTTCCAACTA ACTCTGATGC TATTAGACAT
ATATTTGCCT ACAGTATTCA ATGTGGAGAC GGTACAAAAA TTACAAAGCT GAGTGGCCAG
GATTTCGTAG TTATTGACGG CTATGAAGAC AGTTCTGGAT CGGTAGAATA TTCAACAGTT
TACACTACTA CTACCTGGAC AGAGACATAT TTAACAACGG TTACCATCCC TTGTACTGAT
GAACTGGCTA CTGCCACCGT GATTGTCAAG GTTCCGACAT CTTGCTCTTC AGATTTAGAG
CTGTCTACTC ACTGTCCAGG TTGTGAGTCT GAATCATCAA GTTCTGTAAG TTGTGATGAA
CCTGAAATTT CATCAGATAC TTCCAGTCTG TTATCTACTA TTTATTGCGA TGAATCCTCT
TCTAGCTCAG ATACATCCTC TCTTATTGAA AGCAGCTCCG ATATATACTC TTCCAGTTTG
GCTGATACCT CTGTTTACAC AAGTTCTGAA TCCTCAACTA CTGAAGAATG CCCCGAGACC
TCCTCTCTTT CTTCTACTGA ATCTTCTTCG TCTGAAGTAT CTTCAACAAC TGAGGAATGT
ACTGAAACAC TGTCTTCGAG TATTGCTGAT TCATCAATCT ATACGAGCTC AGAATCTTCA
ATTCTTTCTT CGCATGATGA ACTGTCCTCT ACAATTGAGA CTACTGATTC TATTTCTTCA
GTTGAATCAT CTTCCTCTAT TCAGGAAACT TCGGAATTGA CATCTTCAAA GGAATCTTCT
TCATCTGTTG AATCGACATC TTCAGTCGAA TTGTCCTCCT CCGTTCAGGC CACATCTTCG
AAGGAATTAT CTTCTTCAGT TGAAACGACA TCTTCAGTAT TTACTTCTAC CGAGTCATTG
TCTTCTGATG ATACGTCATC TTCTATCGAA ACAACTTCAA CTGTGAGTTC ATCCTCCTCA
GAAATTACTA GCCCCTGTCT TCAATGCACC AGTTCTATTT CCAGTTCTAG TTCCGTTGAT
GTTCCTAGTC CGTGGACAAG TAGTCTGGAA ACTGAGTCTT CTTCCTCATC AACCACTACC
AGTTACACCA CAATCCCTTC CTCTAGCATT GAAGGTGCGC TGTCCTCTCC TTTTGTTTCT
AGTGGATTGA CGAGCTCTGA ATCTACGCTG GTTCTGTCCA ACACTCCTCC CGAATACACT
ATCACAATTA CCAACCGTGG AACTACAATT ATAACCATTG CAACTTGTCC TGGGGGGTGT
ACAAGGACAA CGACGGTATT CCCCAGTGAG ACCACTACTA CTCTGATTGC AACTAGTACA
GAAACATATT GCCCGGATAG TCTGACAGAA ATTGACAAGA GTTCAAGTGT TTTGAATACA
TCTATAACTT CTACGACAGC TGAAACTACC GAAGAAACCA GCAAAGAGTC TACTTCAGAG
ACATCCACCA ACGACAGTAC CATCACCAGC AAGACTACAA CCACATTGAT CAACACCAGT
ACAGAAACAT ACTGTCCAGA AAGTCTGACG GAAACTGACA AGAGTTCAAA TGTTCTCGAC
ACATCTATAA CTTCCACTAC TTCTACTAGT AGCAGTACGA AGGAAACCAC AGACTCTACC
AAAGATAGTA CAAAGGCAAC TACGACAACT TCCACTATCA GTCTATCTAC GTCTGAAAGC
ACCTCCTCCA GCAATACTGG CACTTTGAGC ACTTTTTCAA TCAGTACTTC TACTGGAAAT
ATCTCATCCT CTATTAGCTA TACGGAGATT GTTAGTAGTC CTACGGAAAT AACTTCTGTC
ACTACTGATT GTACTACTAA TTACATTTCC ACGACTATTA CTTGCTCCTC GTGTGAATCT
AATATTGAGT CAACCTCAGA TAAGGTTTCC AAGCCTCCAG GGGGAACAAA TTACGATACG
ACTAATTTTG CACCCACTGC GCCCGCGCCA AGATCCAGTG AAACTGGCCA ATTTCCCCTG
CTGTCTGATA AGGCCAGCCA AGAAGAGCCA ATTCCACTGT CGTCCGGAAC GGCTGTCTGT
GAAGGTGATT GCGATTTGAC TAGCCGTGTG GAATATGACA AATTGACCCC GACTCAATCC
ACGACTCAGA CCACGACTCA CACAACCACC CAGACAACCA CTCAGTCTAC CACTCAATCC
ACTCAATCTA CGTCCGGGAC CATTCCTGTG TCTGCGTCTG CGCAAAGTCT GTCTGCGCAG
TCTTCAAAAG TTGCTGACTC TTCGACTTTC CACTTTAGTC TGTTCTCCAC CCTGTCTGTG
CTACCTTCAG GATTACCCAT TCCCGTCGCG TTCGACTCGG CTGCTGCTCG TCCCGTGATC
AGCCTTGTTG CTCTCATGAT GTCGCTTCTC TTCTTGTAA
 
Protein sequence
MCILVFVLIT SVLGAQLTDV FQSLEIINNS GSNRAQDIRT AKLTWKIEAG DAVEGDEFSL 
EMPNVFRTKF PGDQLYLVAD YSIYASCVAV DGSYLAQNSY LNCTTTSSVV ESDFKATGTL
SFDFVFNAGG SGSEIDTTAA SILVPGENKI NWSGLQTTVN IDAGPFFAPV SNDKELVYFS
RSTPQMYEQI FMLAGECNGG IVSGSIGMTT NDSLDCTQFA LKATNNLNSF LLPETAINVQ
NTITCKEQSI TFKFNSVANN YRVFLQGLEK FPTNSDAIRH IFAYSIQCGD GTKITKSSGQ
DFVVIDGYED SSGSVEYSTV YTTTTWTETY LTTVTIPCTD ESATATVIVK VPTSCSSDLE
SSTHCPGCES ESSSSVSCDE PEISSDTSSS LSTIYCDESS SSSDTSSLIE SSSDIYSSSL
ADTSVYTSSE SSTTEECPET SSLSSTESSS SEVSSTTEEC TETSSSSIAD SSIYTSSESS
ILSSHDESSS TIETTDSISS VESSSSIQET SELTSSKESS SSVESTSSVE LSSSVQATSS
KELSSSVETT SSVFTSTESL SSDDTSSSIE TTSTVSSSSS EITSPCLQCT SSISSSSSVD
VPSPWTSSSE TESSSSSTTT SYTTIPSSSI EGASSSPFVS SGLTSSESTS VSSNTPPEYT
ITITNRGTTI ITIATCPGGC TRTTTVFPSE TTTTSIATST ETYCPDSSTE IDKSSSVLNT
SITSTTAETT EETSKESTSE TSTNDSTITS KTTTTLINTS TETYCPESST ETDKSSNVLD
TSITSTTSTS SSTKETTDST KDSTKATTTT STISLSTSES TSSSNTGTLS TFSISTSTGN
ISSSISYTEI VSSPTEITSV TTDCTTNYIS TTITCSSCES NIESTSDKVS KPPGGTNYDT
TNFAPTAPAP RSSETGQFPS SSDKASQEEP IPSSSGTAVC EGDCDLTSRV EYDKLTPTQS
TTQTTTHTTT QTTTQSTTQS TQSTSGTIPV SASAQSSSAQ SSKVADSSTF HFSSFSTSSV
LPSGLPIPVA FDSAAARPVI SLVALMMSLL FL