Gene PICST_4539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_4539 
SymbolALS1.2 
ID4837070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1015371 
End bp1016786 
Gene Length1416 bp 
Protein Length472 aa 
Translation table12 
GC content43% 
IMG OID640388385 
ProductAgglutinin-like protein 1 precursor 
Protein accessionXP_001382423 
Protein GI150863820 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.340377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.951225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTATAG TATTAGCAGT TGCTTTCCTT TTTCTGTGTG TAAGAGCTGC AGTTGTCTCA 
GGTGTTTTCA CCAGCTTCGA CTCTCTTGTT TTCCAGAACG GGGGTAACTA TCCATTTGAT
GGACCAGCTA ATCCAAGTTG GATTGCCACT TTGAAATGGC AACTTGATGG CACTAAGGTT
GCTCCTGGTG ACACATTCAC CTTAGACATG CCATGCACTT TCAAGTTCAC ACAAACTCCT
GCTGATGCAC CAGTACTCCT TCAAGCTGGT GGAATCACTT ATGCTACATG TCAAACACTT
GGTGGTGAAA TCATTGTTCC ATATTCTCAA TTACAATGTA CAGTTGAAAA TGCTGTTACC
ACAAGCACCC TTGCCTCAGG ATCTGTTTAT TTTCCAGTTG TATTTAATAT TGGTGGAAGT
GCTACCCCTG TAGATTTGAC CGATTCTAAA TGCTTTGCCA GTGGTGATAA TACTGTTACC
TTCAATGATG GTGACACCAA ACTCTCTATT ACTGCAAACT TTGAAACCGG TTATCCTGCT
TCTGGAGTCA ACCCCACTAA CATCATTTAC AGAAACAGGT TCCTCCCTCA GCTCGGTGAG
AGCCAACACT TGTTAGTTGC TGGTCAATGT CCAAGAGGTT ACACTTCCGG TACTTTGGGA
TTCTCTTTCA GTGGCGGCAA ATTAGATTGC TCTAGTGTTC ATGCTGCAAT TACCAACCAA
TTGAATGATT GGTATTTCCC AACCGACGCA GAGACAGATT TTCTGTTCAC CTATACCTGC
TCTGCTTCTG GCTACCAAAT AACATACAAG AATATTCCTG CCGGTTACAG ACCGTTTATA
GATGGACTTA CGTCGGCTAC CGCAAATTTA CTTACTGTTT CTTACACTAA CAAGTTCGTT
TGTGTCGGAT CCTCTATCAA TAATGACAAG AGCACCAAAG TTACATGGAG TTCTTATCAG
AATTCCGATA GCGGAGGTGA TGGTCATGTC ATTGTTCTCA CTACCTCAAC TGGCACTGGA
TCCAGTACCA CTGTGACTAC CGCTACTGGT AAAAGTACAA ATACTATTAT TGTAATTGTC
CCAACCCCAA CTACAACAAT CACCCAAACT TACACCGGCA CCGTAACAAC CACCACCACC
GTTACTGCCA CTTCCGGAGG CACAAACACT GTCATTGTGG AAGTTCCAAC TTCTTACCCA
CCAAACCCAA CAACTACTGT GACATCGACT TGGACTGGAA CAGAAACTTC ATCCACCACT
GTTACTGATA CACATGGCGG AACTGATACT ATCATAGTTG TGGTTCCTTC GAATCCAACC
ACAACATTAA CATCCACTTG GACTGGAACA GAAACCTCTT CGACTACTGT CACTGACACT
CAAGGTGGAA CTGATACTGT AATTGTTGTA GTCCCT
 
Protein sequence
MLIVLAVAFL FSCVRAAVVS GVFTSFDSLV FQNGGNYPFD GPANPSWIAT LKWQLDGTKV 
APGDTFTLDM PCTFKFTQTP ADAPVLLQAG GITYATCQTL GGEIIVPYSQ LQCTVENAVT
TSTLASGSVY FPVVFNIGGS ATPVDLTDSK CFASGDNTVT FNDGDTKLSI TANFETGYPA
SGVNPTNIIY RNRFLPQLGE SQHLLVAGQC PRGYTSGTLG FSFSGGKLDC SSVHAAITNQ
LNDWYFPTDA ETDFSFTYTC SASGYQITYK NIPAGYRPFI DGLTSATANL LTVSYTNKFV
CVGSSINNDK STKVTWSSYQ NSDSGGDGHV IVLTTSTGTG SSTTVTTATG KSTNTIIVIV
PTPTTTITQT YTGTVTTTTT VTATSGGTNT VIVEVPTSYP PNPTTTVTST WTGTETSSTT
VTDTHGGTDT IIVVVPSNPT TTLTSTWTGT ETSSTTVTDT QGGTDTVIVV VP