Gene PICST_31841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31841 
SymbolFLO1 
ID4839189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp18138 
End bp22013 
Gene Length3876 bp 
Protein Length1291 aa 
Translation table12 
GC content43% 
IMG OID640390504 
ProductFloculation protein FLO1 
Protein accessionXP_001385025 
Protein GI126137003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.29209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATTC GGTCAATTTA CTTTTTGATT TTTGGTTTTA TACTAAATCT AGCACTTGGA 
GCAGATCCTG GTGCTTGTGT TCCTGACATA AGAACATCAA GTCCGGGTTT CAGAGCTACT
TTTTATCCAT ATGAAAATGC ATATGTTGAT GGTGCTCCAA ATTATAACCC TGAAGATGCG
TATTTCCCAG ATGCAAATAT AAATTACTTG TTGACTGGTT ACAGGAACAC GCCAATTCAG
GGAGCTACAT CTGGTGTTAC TGAGCCAGCT TTCTCGTATG GCTACCCAAC AGACCCTTAC
CTAATTCAAT GGGAAGACTT GGCATCCGAC TATGTTTATG GTGTGTTCAC CTCGGTTTCC
AATTTTACAC TTGAGTTGAC AGGATATTTC TTAGCTCCGG AAACTGGGGA ATTTGCAATT
GAAGTCACTG CTGATAATGG GGCAGTTGTT ACATTTGGAG CAGGGCAGGC GTTTGAATGC
TGCAACACCC AGATTTTATC TAATGATGGT GAATTTACTT TATTTTCTAA TGAAGAGTTC
AATGGAAGTT TACCCAATAT TTTATCTGAA CCTCAAAGGC TTACAGCTGG CTTTTACTAT
CCTATTAGAA TTTCATTCGT TAATACGGCC GGTCCAGCCG CCTTGGATTT GGTTATTACG
ACGCCCAGCG GAATAAGAAT TACCGATTTC GACGAAACAA TTTTCAGCTT TTTAGATGTG
GGAAGTAATT GCCCATATGT ACCACCCACA GTCACCTTTA CTACTCCATG GACTGGAACC
ACAACGAGAA CGATTACTCA TACTCCATCA TCTGACGGTG ACACTATCAC AGTTGAGATA
GATACTCCTG TTCCAACCAC AACGTTCACA ACTCCATGGA CTGGAACCAC AACGAGGACT
ATTACTCATA CTCCATCATC TGACGGTGAC ACTATCACAG TTGAGATAGA TACTCCTGTT
CCAACCACAA CGTTCACAAC TCCTTGGACT GGAACCACAA CGAGGACTAT TACTCATACT
CCATCATCTG ACGGTGACAC TATCACAGTT GAAATCGATA CTCCTGTTCC AACCACAACG
TTCACAACTC CATGGACTGG AACCACAACG AGAACAATCA CTCATACTCC ATCATCTGAC
GGTGACACTA TCACAGTTGA AATCGATACT CCTGTTCCAA CCACAACGTT CACAACTCCA
TGGACTGGAA CCACAACGAG AACAATCACT CATACTCCAT CATCTGACGG TGACACGATC
ACAGTTGAGA TAGATACTCC TGTTCCAACC ACAACGTTCA CAACTCCTTG GACTGGAACC
ACAACGAGAA CGATTACTCA TACTCCATCA TCTGACGGTG ACACGATCAC AGTTGAGATA
GATACTCCTG TTCCAACCAC AACGTTCACA ACTCCATGGA CTGGAACCAC AACGAGGACT
ATTACTCATA CTCCATCATC TGACGGTGAC ACTATCACAG TTGAAATCGA TACCCCAACC
TCAGGTCCTC CAACTACGAC GTTTACAACT CCTTGGACTG GAACCACAAC GAGAACGATT
ACTCATACTC CATCATCTGA CGGTGACACT ATCACAGTTG AGATAGATAC TCCTGTTCCA
ACCACAACGT TCACAACTCC ATGGACTGGA ACCACAACGA GGACTATTAC TCATACTCCA
TCATCTGACG GTGACACTAT CACAGTTGAG ATAGATACTC CTGTTCCAAC CACAACGTTC
ACAACTCCAT GGACTGGAAC CACAACGAGG ACTATTACTC ATACTCCATC ATCTGACGGT
GACACTATCA CAGTTGAAAT CGATACCCCA ACCACAGGTC CTCCAACTAC GACGTTTACA
ACTCCTTGGA CTGGAACCAC AACGAGAACG ATTACTCATA CTCCATCATC TGACGGTGAC
ACTATCACAG TTGAGATAGA TACTCCTGTT CCAACCACAA CGTTCACAAC TCCATGGACT
GGAACCACAA CGAGAACAAT CACTCATACT CCATCATCTG ACGGTGACAC TATCACAGTT
GAGATAGATA CTCCTGTTCC AACCACAACG TTCACAACTC CATGGACTGG AACCACAACG
AGGACTACTA CTCATACTCC ATCATCTGAC GGTGACACTA TCACAGTTGA GATAGATACT
CCTGTTCCAA CTACGACGTT TACAACTCCT TGGACTGGAA CCACAACGAG AACGATTACT
CATACTCCAT CATCTGACGG TGACACTATC ACAGTTGAGA TAGATACTCC TGTTCCAACC
ACAACGTTCA CAACTCCATG GACTGGAACC ACAACGAGGA CTATTACTCA TATTCCATCA
TCTGACGGTG ACACTATCAC AGTTGAAATC GATACCCCAA CCACAGGTCC TCCAACTACG
ACGTTTACAA CTCCTTGGAC TGGAACCACA ACGAGAACAA TCACTCATAC TCCATCATCT
GACGGTGACA CTATCACAGT TGAGATAGAT ACTCCTGTTC CAACTACGAC GTTTACAACT
CCTTGGACTG GAACCACAAC GAGAACGATT ACTCATACTC CATCATCTGA CGGTGACACT
ATCACAGTTG AGATAGATAC TCCTGTTCCA ACTACGACGT TTACAACTCC TTGGACTGGA
ACCACAACGA GAACGATTAC TCATACTCCA TCTTCTGACG GTGACACTAT CACAGTTGAG
ATCGATACCC CAACCACAGG TCCTCCAACT ACGACGTTTA CAACTCCATG GACTGGAACC
ACAACGAGAA CGATTACTCA TACTCCATCA TCTGACGGTG ACACTATCAC AGTTGAGATA
GATACTCCTG TTCCAACCAC AACGTTCACA ACTCCTTGGA CTGGAACCAC AACGAGGACT
ATTACTCATA CTCCATCATC TGACGGTGAC ACTATCACAG TTGAAATCGA TACCCCAACC
ACAGGTCCTC CAACTACGGC GTTTACAACT CCTTGGACTG GAACCACAAC GAGAACGATT
ACTCATACTC CATCATCTGA CGGTGACACT ATCACAGTTG AGATAGATAC TCCTGTTCCA
ACCACAACGT TCACATCTCC ATGGACTGGA ACCACAACGA GAACAATCAC TCATACTCCA
TCATCTGACG GTGACACCAT CACAGTTGAG ATAGATACTC CTATTTCTAT TGCCACATTT
GAGACTGTTC AAACAACTTT CAGCAAGTAT ACCAGTTTTT GGAGCAATTT CACTGAAGAG
ATCACTTCTG ATATTGGAAA AACAGTTATT GATATTTCAA CTACTGTGAT TACAATAAAA
TCGTGTGAGA ATGATTATTG CGATCAGTTG GTAAGAACGA CTGGGTATCA AGTTGTTACA
ACAACCATTG ACGAAACCGT TACAGAGTTT ACAACCTTCT GTGATATTCC TAGCACAACT
GCTGAAGATC AAATATTATC TGCAGCATCT GATCTCGAAG GATCAACTTC CTCTGAGGTA
ACCACAGTCT ATTTCAAAGA TGATACGACC ACGTATTTAA TTTCTCAGAC TGTAACTAAT
ACTGGCATTT CCACGATCAC TACATATTCT TATCAAGAAA ATGAAGATTA TGTATCTACA
GGAACTTCTA AAGATATCGA CCAAAATGAA ATACTTTCAG GGGAACTTAC GATTACCAGT
ACGAAAAGTG GGGATACGAA GCACAGCCAA GATTCTTCTG CCACTCAAAA ATTCGACTAT
TCCAATCAAT TAACGCCTGT GACGGTTGTG CCTCAAGTCA GTTCGACGAA TGTTCCTGAA
GTTTACCATG TCCAAAGTGA AGGTGGTGGC GCAAACACCT ATAAAATCTC TTTCTGGTCA
GTTATATTAG CTCTAGTTGT TGTGTTCGAG TTTTGA
 
Protein sequence
MTIRSIYFLI FGFILNLALG ADPGACVPDI RTSSPGFRAT FYPYENAYVD GAPNYNPEDA 
YFPDANINYL LTGYRNTPIQ GATSGVTEPA FSYGYPTDPY LIQWEDLASD YVYGVFTSVS
NFTLELTGYF LAPETGEFAI EVTADNGAVV TFGAGQAFEC CNTQILSNDG EFTLFSNEEF
NGSLPNILSE PQRLTAGFYY PIRISFVNTA GPAALDLVIT TPSGIRITDF DETIFSFLDV
GSNCPYVPPT VTFTTPWTGT TTRTITHTPS SDGDTITVEI DTPVPTTTFT TPWTGTTTRT
ITHTPSSDGD TITVEIDTPV PTTTFTTPWT GTTTRTITHT PSSDGDTITV EIDTPVPTTT
FTTPWTGTTT RTITHTPSSD GDTITVEIDT PVPTTTFTTP WTGTTTRTIT HTPSSDGDTI
TVEIDTPVPT TTFTTPWTGT TTRTITHTPS SDGDTITVEI DTPVPTTTFT TPWTGTTTRT
ITHTPSSDGD TITVEIDTPT SGPPTTTFTT PWTGTTTRTI THTPSSDGDT ITVEIDTPVP
TTTFTTPWTG TTTRTITHTP SSDGDTITVE IDTPVPTTTF TTPWTGTTTR TITHTPSSDG
DTITVEIDTP TTGPPTTTFT TPWTGTTTRT ITHTPSSDGD TITVEIDTPV PTTTFTTPWT
GTTTRTITHT PSSDGDTITV EIDTPVPTTT FTTPWTGTTT RTTTHTPSSD GDTITVEIDT
PVPTTTFTTP WTGTTTRTIT HTPSSDGDTI TVEIDTPVPT TTFTTPWTGT TTRTITHIPS
SDGDTITVEI DTPTTGPPTT TFTTPWTGTT TRTITHTPSS DGDTITVEID TPVPTTTFTT
PWTGTTTRTI THTPSSDGDT ITVEIDTPVP TTTFTTPWTG TTTRTITHTP SSDGDTITVE
IDTPTTGPPT TTFTTPWTGT TTRTITHTPS SDGDTITVEI DTPVPTTTFT TPWTGTTTRT
ITHTPSSDGD TITVEIDTPT TGPPTTAFTT PWTGTTTRTI THTPSSDGDT ITVEIDTPVP
TTTFTSPWTG TTTRTITHTP SSDGDTITVE IDTPISIATF ETVQTTFSKY TSFWSNFTEE
ITSDIGKTVI DISTTVITIK SCENDYCDQL VRTTGYQVVT TTIDETVTEF TTFCDIPSTT
AEDQILSAAS DLEGSTSSEV TTVYFKDDTT TYLISQTVTN TGISTITTYS YQENEDYVST
GTSKDIDQNE ILSGELTITS TKSGDTKHSQ DSSATQKFDY SNQLTPVTVV PQVSSTNVPE
VYHVQSEGGG ANTYKISFWS VILALVVVFE F