Gene PICST_4391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_4391 
SymbolALS1 
ID4836912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp313437 
End bp314849 
Gene Length1413 bp 
Protein Length471 aa 
Translation table12 
GC content42% 
IMG OID640388227 
ProductAgglutinin-like protein 1 precursor 
Protein accessionXP_001382823 
Protein GI150864119 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.065918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATGCTTA GAACAATAAT TTTTCTTTTC TTTTACCTGA TTGTAAGAGC TGCTGAAATA 
ACTGGTGTAT TCACTAGTTT TGACTCTCTC GTTTTCCAGA ATGGTGGAAC CTATCCTTAT
GAAGCACCTG CAAACCCTAG CTGGATTGCT ACCTTAAGCT GGGCAATGGA TGGCAATGTA
GTTGCTCCTG GAGACACATT TACTTTACAT ATGCCATGCA CTTTTAAGCT TACAACCGAT
AATACAATCA TTCTAGGAGC AGAAGGTACG ACATACGCTA CATGTCTGCA TTTTTCAGGA
GAAATTATTG TTCCTTATTC TGAATTGTAT TGTACAATTA CAGATTCATT ACTACCTGGT
ATGACAGTAG AAGGTTCCAC TTACTTCCCT GTTGTATTCA ATACTGGCGG AAGTGCATTG
GCCAGCGATT TGGAAGACTC TACATGTTTC ACGAATGGGG AGAACACTGT CTCCTTCTAC
GATGGAGATA CACAACTTTC CACCAGTGTT GTTTTCGAGG CAGGGCCACC AAGTGCCAAT
ATAAATCCAG ACATTGTTAT ATTTAGAGCT AGAGTGCTTC CTCAACTTAA TAGCAGCCAA
CATTATGCTG CCATTGGTTG GTGTCCTAAT GGTTACACCT CGGGTACCTT GGGGTTTACT
TTCACTGGTG GAACTCTTGA TTGTGATTCA GTCCATGCCT CAATTACCAA CCAGTTGAAT
GACTTCCATT ATCCAACCAA TGCCGACCCC AATTTTCTGT ATACTTTCAC TTGTTCACCT
ACATCCTATC AGATCAATTT TGAAAATATT CCAGCTGGAT ATAGACCATT TATCGATGGT
CTTACCAGGC CATCAGGCCC CGCTCTTTCT GTCTTTTATA CCAATCGCTA TCTATGTGCT
GATGAGACGA CATTTAGACA GCAAAATCTT AATGTAAATT GGGGTGAATA TGCAGATGGT
CCGACTAGTG GTAACGGCAA TGTCATTGTG GTCACAACTT CTACATACAG TGGTTCACAT
ACCCAGATAT CCACTGCCAC AGGCGAATCA ACGAACACTA TTATTGTTTA TGTACCTACT
CCAACAGTTA CAATAACAGA GACATGGACT GGGCGCGAAA CAAGCACGAC CACTGTATTT
CCAACATCAG AAGGAGGTAC TGTAATAGTT ATTATCGACT TACCAACGGA TCCTCCACCT
AACCCAACCA CCACTATCAC ATCCGTATGG ACAGGTACTG ATACTACTAC AGTCACAGTA
ACTGACACAG AAGGAGGAAC TGATACAGTA ATTGTTCAAG TTCCTGCAAA CCCAACCACC
ACTATCACAT CTGTCTGGAC TGGAACTGAT ACTACTACAA TTACGGTGAC TGACACAGAA
GGAGGCACCG ATACTGTCAT TGTTCAAGTT CCT
 
Protein sequence
VMLRTIIFLF FYSIVRAAEI TGVFTSFDSL VFQNGGTYPY EAPANPSWIA TLSWAMDGNV 
VAPGDTFTLH MPCTFKLTTD NTIILGAEGT TYATCSHFSG EIIVPYSELY CTITDSLLPG
MTVEGSTYFP VVFNTGGSAL ASDLEDSTCF TNGENTVSFY DGDTQLSTSV VFEAGPPSAN
INPDIVIFRA RVLPQLNSSQ HYAAIGWCPN GYTSGTLGFT FTGGTLDCDS VHASITNQLN
DFHYPTNADP NFSYTFTCSP TSYQINFENI PAGYRPFIDG LTRPSGPALS VFYTNRYLCA
DETTFRQQNL NVNWGEYADG PTSGNGNVIV VTTSTYSGSH TQISTATGES TNTIIVYVPT
PTVTITETWT GRETSTTTVF PTSEGGTVIV IIDLPTDPPP NPTTTITSVW TGTDTTTVTV
TDTEGGTDTV IVQVPANPTT TITSVWTGTD TTTITVTDTE GGTDTVIVQV P