Gene PICST_31095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31095 
SymbolALS2 
ID4837691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1750762 
End bp1755120 
Gene Length4359 bp 
Protein Length1452 aa 
Translation table12 
GC content44% 
IMG OID640389006 
Productagglutinin-like protein 2 
Protein accessionXP_001383953 
Protein GI150864934 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCT CACTCAAACT AGTGATAAGA TCTTTACTTT TTGTCGCATC GGCTTTGGCC 
GCTGACGATC TAGTTATAAG CCAAAATACA ATTGTTAGCA CTGGTGATAC GTTAATCAGA
CAAAACTTCA TTGTAAACAG TGGGGTTTTC TATTCAATTG ATTATGGTAT GACCCACAAC
TTTTATAATG ATATTACAAT TAACGGCAAA CTCTATATTA CCAACAAGAT TGTTAGGACA
GGTATGACTT GCGATGTAAT TGGTACTACC GCGAATATTG TCAACAACGG GTGGATAGTT
TTAGATGATA CCAATGCTAC CTCGGCACCA ACTTATGATT GGTATGGTGG TTCCTTCGAG
AACAACGGTA TGGTGTGGTT TGCTGGAATT GGTAATACCG GTGGGTCAAC ATTTGCAATC
CAACCAAAGG GTTCTTTTAT TAATACTGGT ACCATCATTT TATATCAAAC CGTTAGAAGA
TCTGGTGGAA CTTCTCACCT TGGTTTGGAT GGTAAAACCA TTACTAATGA CGGAACTGTT
TGTATCTATC AAAACATTTT TTTCCAAGGT TCCACAGTGG AAGGTAATGG ATGTTTCGAT
GTCGGCCTTG ACTCAAACTT CTGGGCCACT AATGTTAATT CAAGACCTAT GGAAGAAGGA
CAACTTATTT ACTTGAGTAC TAGTACTTCA AGCTTAAGAA TTGATACGTA TCCTCCCAAT
ATCCCTATCC ACATAGCCGG ATGGGGTAAT AACAATGTTA TTGGACTTAG CACTGCTATC
AACAGCTTTG ATTACGATGG AAACAACTTG AGCATTAGAA GTGGAAGTTA TACCTATAAA
CTTGTTATTG GTCCAGGTTA TGATCCCTCA TTAATAAGCA TTGGTTCCGC AGCATACGGT
AGCGGTGTTG GTACAATATC AAGGGCCGGT ATCTTATACT CTGGTCCACC ACCAGACGCA
AGCAGACCCT CTCAATGTAA TGAGTGTCCT TCCCCTCCTC GTGCTCCGAT ATCAACTGAA
ACTCCTAACC CAACCACCAC TGTTACTTCT ACGTGGACTG GTACTTTCAC CACCACGGAA
ACATATACTG ATACTCAGGG TGGCACTGAC ACAATCGTCA TTGTGGTTCC AACAGAGTAT
TCCTCTACCG AAGAATCCAC TACTGAAGAA CCCACTACCG AAGAATCATC TACTGAAGAA
TCGACTACCG AAGAACCCAC TACTGAAGAA TCATCGACTG AAGTCTTGCC AAACGAAACT
ACCACTATTA CTTCTACGTG GACTGGTACT TTCACCACCA CGGAAACATA TACTGATACT
CAGGGTGGTA CTGACACAAT CGTCATTGTG GTTCCAACAG AGTATTCCTC TACCGAAGAA
TCCACTACTG AAGAACCCAC TACCGAAGAA TCATCGACTG AAGTCTTGCC AAACGAAACT
ACCACTATTA CTTCTACGTG GACTGGTACT TTCACCACCA CGGAAACATA TACTGATACT
CAGGGTGGCA CTGACACAAT CGTCATTGTG GTTCCAACAG AGTATTCCTC TACCGAAGAA
TCCACTACTG AAGAACCCAC TACCGAAGAA TCATCTACTG AAGAATCGAC TACCGAAGAA
CCCACTACCG AAGAATCATC GACTGAAGTC TTGCCAAACG AAACTACCAC TATTACTTCG
ACGTGGACTG GTACTTTCAC CACCACAGAA ACATATACTG ATACTCAGGG TGGCACTGAC
ACAATCGTCA TTGTGGTTCC AACAGAGTAT TCCTCTACCG AAGAATCCAC TACTGAAGAA
CCCACTACCG AAGAATCATC TACTGAAGAA TCGACTACCG AAGAACCCAC TACTGAAGAA
TCATCGACTG AAGTCTTGCC AAACGAAACT ACCACTATTA CTTCGACGTG GACTGGTACT
TTCACCACCA CAGAAACATA TACTGATACT CAGGGTGGCA CTGACACAAT CGTCATTGTG
GTTCCAACAG AGTATTCCTC TACCGAAGAA TCCACTACTG AAGAACCCAC TACCGAAGAA
TCCACTACTG AAGAACCCAC TACTGAAGAA TCATCGACTG AAGTCTTGCC AAACGAAACT
ACCACTATTA CTTCTACGTG GACTGGTACT TTCACCACCA CGGAAACATA TACTGATACT
CAGGGTGGCA CTGACACAAT CGTCATTGTG GTTCCAACAG AGTATTCCTC TACCGAAGAA
CCCACTACCG AAGAATCATC GACTGAAGTC TTACCAAACG AAACTACCAC TATTACTTCT
ACGTGGACTG GTACTTTCAC CACCACGGAA ACATATACTG ATACTCAGGG TGGTACTGAC
ACAATCGTCA TTGTGGTTCC AACAGAGTAT TCCTCTACCG AAGAATCCAC TACCGAAGAA
TCCTCTACTG AAGTCTTGCC AAACGAAACT ACCACTATTA CTTCTACGTG GACTGGTACT
TTCACCACCA CAGAAACATA TACTGATACT CAGGGTGGCA CTGACACAAT CGTCGTTGTA
GTTCCAACAG AATACTCCTC TGCTGAAGTC TCACCTAACC CAACTACCAC TGTTACTTCC
TCGTGGACTG GTACTTTCAC CACCACAGAA ACATATACTG ATACTCAGGG TGGTACTGAC
ACAATCGTCG TTGTAGTTCC AACAGAATAC TCCTCTGCTG AAGTCTCACC TAACCCAACT
ACCACTATTA CTTCGACATG GACTGGTACT TTCACCACCA CAGAAACATA TACTGATACT
CAGGGTGGCA CTGACACAAT CGTCGTTGTA GTTCCAACAG AATACTCCTC TGCTGAAGTC
TCACCTAACC CAACTACCAC TATTACTTCC TCGTGGACTG GTACTTTCAC CACCACAGAA
ACATATACTG ATACTCAGGG TGGCACTGAC ACAATCGTCG TTGTAGTTCC AACAGAATAC
TCCTCCGCTG AAGTCTCACC TAACCCAACT ACCACTATTA CTTCTACGTG GACTGGTACT
TTCACCACCA CAGAAACATA TACTGATACT CAGGGTGGCA CTGACACAAT CGTCGTTGTA
GTTCCAACAG AATACTCCTC TGCTGAAGTC TCACCTAACC CAACTACCAC TATTACTTCC
TCGTGGACTG GTACTTTCAC CACCACAGAA ACATATACTG ATACTCAGGG TGGTACTGAC
ACAATCGTCG TTGTAGTTCC AACAGAATAC TCCTCTGCTG AAGTCTCACC TAACCCAACT
ACCACTATTA CTTCGACATG GACTGGTACT TTCACCACCA CAGAAACATA TACTGATACT
CAGGGTGGCA CTGACACAAT CGTCGTTGTA GTTCCAACAG AATACTCCTC TGCTGAAGTC
TCACCTAACC CAACTACCAC TATTACTTCG ACATGGACTG GTACTTTCAC CACCACAGAA
ACATATACTG ATACTCAGGG TGGCACTGAC ACAATCGTCG TTGTAGTTCC AACAGAATAC
TCCTCCGCTG AAGTCTCACC TAACCCAACT ACCACTATTA CTTCCTCGTG GACTGGTACT
TTCACCACCA CAGAAACATA TACTGATACT CAGGGTGGTA CTGACACAAT CGTCGTTGTA
GTTCCAACAG AATACTCCTC TGCTGAAGTC TCACCTAACC CAACTACCAC TATTACTTCG
ACATGGACTG GTACTTTCAC CACCACAGAA ACATATACTG ATACTCAGGG TGGCACTGAC
ACAATCGTCG TTGTAGTTCC AACAGAGTAT TCTCCCGTTG AAGATTACCC TACTGAAGTC
TCACCTAACC CAACTACCAC TATAACTTCT TTGTGGACTG GTACTTTCAC CACTACAAAA
ACGTTTACTT ACACTCAAGG AGGATCCGTG ACTGTCATTG TCGCAGTTCC AACAGAGTAT
TCTTCTGTTG GCGGTTCCTC TGTCGAGCTA CCAGCTGATA TAACATTGGG TCAATCATCT
GCCGTGGTGA TTAATCCTGG CGTAGATCTT GAATCTGAAA CCGGGCCCGC TGCTGAGTTT
AGTAAGCACA TCAGTGATCA TGTGCAATCT CGCTCAATCC CTGAAGAGTG GTTCACTACT
ACGGTTACAA CAACAGGTCC AAATGGCGAA GTTTCGACTT ATACAACAGC CGACACTTCA
GGTTTTCAAA CAAATGGCGT GGTTCCCGCT CCATCTAGTA GTTCCACCTA CGACTCCACT
GGTTCTTCTA ATTCTGATTC TCTGGCTGAA GATTTAAAGG ATGAAACAGA TTTGTCATCT
TCAGTAGATG AGTACGAGGG ATCTGGTGCA ACCCTCATTG GAGCAAGTCT GGTTTACTTT
GCTTCAATCT TGTGTCTTAT TTTCTCCCTC TACGCTTGA
 
Protein sequence
MKISLKLVIR SLLFVASALA ADDLVISQNT IVSTGDTLIR QNFIVNSGVF YSIDYGMTHN 
FYNDITINGK LYITNKIVRT GMTCDVIGTT ANIVNNGWIV LDDTNATSAP TYDWYGGSFE
NNGMVWFAGI GNTGGSTFAI QPKGSFINTG TIILYQTVRR SGGTSHLGLD GKTITNDGTV
CIYQNIFFQG STVEGNGCFD VGLDSNFWAT NVNSRPMEEG QLIYLSTSTS SLRIDTYPPN
IPIHIAGWGN NNVIGLSTAI NSFDYDGNNL SIRSGSYTYK LVIGPGYDPS LISIGSAAYG
SGVGTISRAG ILYSGPPPDA SRPSQCNECP SPPRAPISTE TPNPTTTVTS TWTGTFTTTE
TYTDTQGGTD TIVIVVPTEY SSTEESTTEE PTTEESSTEE STTEEPTTEE SSTEVLPNET
TTITSTWTGT FTTTETYTDT QGGTDTIVIV VPTEYSSTEE STTEEPTTEE SSTEVLPNET
TTITSTWTGT FTTTETYTDT QGGTDTIVIV VPTEYSSTEE STTEEPTTEE SSTEESTTEE
PTTEESSTEV LPNETTTITS TWTGTFTTTE TYTDTQGGTD TIVIVVPTEY SSTEESTTEE
PTTEESSTEE STTEEPTTEE SSTEVLPNET TTITSTWTGT FTTTETYTDT QGGTDTIVIV
VPTEYSSTEE STTEEPTTEE STTEEPTTEE SSTEVLPNET TTITSTWTGT FTTTETYTDT
QGGTDTIVIV VPTEYSSTEE PTTEESSTEV LPNETTTITS TWTGTFTTTE TYTDTQGGTD
TIVIVVPTEY SSTEESTTEE SSTEVLPNET TTITSTWTGT FTTTETYTDT QGGTDTIVVV
VPTEYSSAEV SPNPTTTVTS SWTGTFTTTE TYTDTQGGTD TIVVVVPTEY SSAEVSPNPT
TTITSTWTGT FTTTETYTDT QGGTDTIVVV VPTEYSSAEV SPNPTTTITS SWTGTFTTTE
TYTDTQGGTD TIVVVVPTEY SSAEVSPNPT TTITSTWTGT FTTTETYTDT QGGTDTIVVV
VPTEYSSAEV SPNPTTTITS SWTGTFTTTE TYTDTQGGTD TIVVVVPTEY SSAEVSPNPT
TTITSTWTGT FTTTETYTDT QGGTDTIVVV VPTEYSSAEV SPNPTTTITS TWTGTFTTTE
TYTDTQGGTD TIVVVVPTEY SSAEVSPNPT TTITSSWTGT FTTTETYTDT QGGTDTIVVV
VPTEYSSAEV SPNPTTTITS TWTGTFTTTE TYTDTQGGTD TIVVVVPTEY SPVEDYPTEV
SPNPTTTITS LWTGTFTTTK TFTYTQGGSV TVIVAVPTEY SSVGGSSVEL PADITLGQSS
AVVINPGVDL ESETGPAAEF SKHISDHVQS RSIPEEWFTT TVTTTGPNGE VSTYTTADTS
GFQTNGVVPA PSSSSTYDST GSSNSDSSAE DLKDETDLSS SVDEYEGSGA TLIGASSVYF
ASILCLIFSL YA