Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31841 |
Symbol | FLO1 |
ID | 4839189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 18138 |
End bp | 22013 |
Gene Length | 3876 bp |
Protein Length | 1291 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390504 |
Product | Floculation protein FLO1 |
Protein accession | XP_001385025 |
Protein GI | 126137003 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.29209 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATTC GGTCAATTTA CTTTTTGATT TTTGGTTTTA TACTAAATCT AGCACTTGGA GCAGATCCTG GTGCTTGTGT TCCTGACATA AGAACATCAA GTCCGGGTTT CAGAGCTACT TTTTATCCAT ATGAAAATGC ATATGTTGAT GGTGCTCCAA ATTATAACCC TGAAGATGCG TATTTCCCAG ATGCAAATAT AAATTACTTG TTGACTGGTT ACAGGAACAC GCCAATTCAG GGAGCTACAT CTGGTGTTAC TGAGCCAGCT TTCTCGTATG GCTACCCAAC AGACCCTTAC CTAATTCAAT GGGAAGACTT GGCATCCGAC TATGTTTATG GTGTGTTCAC CTCGGTTTCC AATTTTACAC TTGAGTTGAC AGGATATTTC TTAGCTCCGG AAACTGGGGA ATTTGCAATT GAAGTCACTG CTGATAATGG GGCAGTTGTT ACATTTGGAG CAGGGCAGGC GTTTGAATGC TGCAACACCC AGATTTTATC TAATGATGGT GAATTTACTT TATTTTCTAA TGAAGAGTTC AATGGAAGTT TACCCAATAT TTTATCTGAA CCTCAAAGGC TTACAGCTGG CTTTTACTAT CCTATTAGAA TTTCATTCGT TAATACGGCC GGTCCAGCCG CCTTGGATTT GGTTATTACG ACGCCCAGCG GAATAAGAAT TACCGATTTC GACGAAACAA TTTTCAGCTT TTTAGATGTG GGAAGTAATT GCCCATATGT ACCACCCACA GTCACCTTTA CTACTCCATG GACTGGAACC ACAACGAGAA CGATTACTCA TACTCCATCA TCTGACGGTG ACACTATCAC AGTTGAGATA GATACTCCTG TTCCAACCAC AACGTTCACA ACTCCATGGA CTGGAACCAC AACGAGGACT ATTACTCATA CTCCATCATC TGACGGTGAC ACTATCACAG TTGAGATAGA TACTCCTGTT CCAACCACAA CGTTCACAAC TCCTTGGACT GGAACCACAA CGAGGACTAT TACTCATACT CCATCATCTG ACGGTGACAC TATCACAGTT GAAATCGATA CTCCTGTTCC AACCACAACG TTCACAACTC CATGGACTGG AACCACAACG AGAACAATCA CTCATACTCC ATCATCTGAC GGTGACACTA TCACAGTTGA AATCGATACT CCTGTTCCAA CCACAACGTT CACAACTCCA TGGACTGGAA CCACAACGAG AACAATCACT CATACTCCAT CATCTGACGG TGACACGATC ACAGTTGAGA TAGATACTCC TGTTCCAACC ACAACGTTCA CAACTCCTTG GACTGGAACC ACAACGAGAA CGATTACTCA TACTCCATCA TCTGACGGTG ACACGATCAC AGTTGAGATA GATACTCCTG TTCCAACCAC AACGTTCACA ACTCCATGGA CTGGAACCAC AACGAGGACT ATTACTCATA CTCCATCATC TGACGGTGAC ACTATCACAG TTGAAATCGA TACCCCAACC TCAGGTCCTC CAACTACGAC GTTTACAACT CCTTGGACTG GAACCACAAC GAGAACGATT ACTCATACTC CATCATCTGA CGGTGACACT ATCACAGTTG AGATAGATAC TCCTGTTCCA ACCACAACGT TCACAACTCC ATGGACTGGA ACCACAACGA GGACTATTAC TCATACTCCA TCATCTGACG GTGACACTAT CACAGTTGAG ATAGATACTC CTGTTCCAAC CACAACGTTC ACAACTCCAT GGACTGGAAC CACAACGAGG ACTATTACTC ATACTCCATC ATCTGACGGT GACACTATCA CAGTTGAAAT CGATACCCCA ACCACAGGTC CTCCAACTAC GACGTTTACA ACTCCTTGGA CTGGAACCAC AACGAGAACG ATTACTCATA CTCCATCATC TGACGGTGAC ACTATCACAG TTGAGATAGA TACTCCTGTT CCAACCACAA CGTTCACAAC TCCATGGACT GGAACCACAA CGAGAACAAT CACTCATACT CCATCATCTG ACGGTGACAC TATCACAGTT GAGATAGATA CTCCTGTTCC AACCACAACG TTCACAACTC CATGGACTGG AACCACAACG AGGACTACTA CTCATACTCC ATCATCTGAC GGTGACACTA TCACAGTTGA GATAGATACT CCTGTTCCAA CTACGACGTT TACAACTCCT TGGACTGGAA CCACAACGAG AACGATTACT CATACTCCAT CATCTGACGG TGACACTATC ACAGTTGAGA TAGATACTCC TGTTCCAACC ACAACGTTCA CAACTCCATG GACTGGAACC ACAACGAGGA CTATTACTCA TATTCCATCA TCTGACGGTG ACACTATCAC AGTTGAAATC GATACCCCAA CCACAGGTCC TCCAACTACG ACGTTTACAA CTCCTTGGAC TGGAACCACA ACGAGAACAA TCACTCATAC TCCATCATCT GACGGTGACA CTATCACAGT TGAGATAGAT ACTCCTGTTC CAACTACGAC GTTTACAACT CCTTGGACTG GAACCACAAC GAGAACGATT ACTCATACTC CATCATCTGA CGGTGACACT ATCACAGTTG AGATAGATAC TCCTGTTCCA ACTACGACGT TTACAACTCC TTGGACTGGA ACCACAACGA GAACGATTAC TCATACTCCA TCTTCTGACG GTGACACTAT CACAGTTGAG ATCGATACCC CAACCACAGG TCCTCCAACT ACGACGTTTA CAACTCCATG GACTGGAACC ACAACGAGAA CGATTACTCA TACTCCATCA TCTGACGGTG ACACTATCAC AGTTGAGATA GATACTCCTG TTCCAACCAC AACGTTCACA ACTCCTTGGA CTGGAACCAC AACGAGGACT ATTACTCATA CTCCATCATC TGACGGTGAC ACTATCACAG TTGAAATCGA TACCCCAACC ACAGGTCCTC CAACTACGGC GTTTACAACT CCTTGGACTG GAACCACAAC GAGAACGATT ACTCATACTC CATCATCTGA CGGTGACACT ATCACAGTTG AGATAGATAC TCCTGTTCCA ACCACAACGT TCACATCTCC ATGGACTGGA ACCACAACGA GAACAATCAC TCATACTCCA TCATCTGACG GTGACACCAT CACAGTTGAG ATAGATACTC CTATTTCTAT TGCCACATTT GAGACTGTTC AAACAACTTT CAGCAAGTAT ACCAGTTTTT GGAGCAATTT CACTGAAGAG ATCACTTCTG ATATTGGAAA AACAGTTATT GATATTTCAA CTACTGTGAT TACAATAAAA TCGTGTGAGA ATGATTATTG CGATCAGTTG GTAAGAACGA CTGGGTATCA AGTTGTTACA ACAACCATTG ACGAAACCGT TACAGAGTTT ACAACCTTCT GTGATATTCC TAGCACAACT GCTGAAGATC AAATATTATC TGCAGCATCT GATCTCGAAG GATCAACTTC CTCTGAGGTA ACCACAGTCT ATTTCAAAGA TGATACGACC ACGTATTTAA TTTCTCAGAC TGTAACTAAT ACTGGCATTT CCACGATCAC TACATATTCT TATCAAGAAA ATGAAGATTA TGTATCTACA GGAACTTCTA AAGATATCGA CCAAAATGAA ATACTTTCAG GGGAACTTAC GATTACCAGT ACGAAAAGTG GGGATACGAA GCACAGCCAA GATTCTTCTG CCACTCAAAA ATTCGACTAT TCCAATCAAT TAACGCCTGT GACGGTTGTG CCTCAAGTCA GTTCGACGAA TGTTCCTGAA GTTTACCATG TCCAAAGTGA AGGTGGTGGC GCAAACACCT ATAAAATCTC TTTCTGGTCA GTTATATTAG CTCTAGTTGT TGTGTTCGAG TTTTGA
|
Protein sequence | MTIRSIYFLI FGFILNLALG ADPGACVPDI RTSSPGFRAT FYPYENAYVD GAPNYNPEDA YFPDANINYL LTGYRNTPIQ GATSGVTEPA FSYGYPTDPY LIQWEDLASD YVYGVFTSVS NFTLELTGYF LAPETGEFAI EVTADNGAVV TFGAGQAFEC CNTQILSNDG EFTLFSNEEF NGSLPNILSE PQRLTAGFYY PIRISFVNTA GPAALDLVIT TPSGIRITDF DETIFSFLDV GSNCPYVPPT VTFTTPWTGT TTRTITHTPS SDGDTITVEI DTPVPTTTFT TPWTGTTTRT ITHTPSSDGD TITVEIDTPV PTTTFTTPWT GTTTRTITHT PSSDGDTITV EIDTPVPTTT FTTPWTGTTT RTITHTPSSD GDTITVEIDT PVPTTTFTTP WTGTTTRTIT HTPSSDGDTI TVEIDTPVPT TTFTTPWTGT TTRTITHTPS SDGDTITVEI DTPVPTTTFT TPWTGTTTRT ITHTPSSDGD TITVEIDTPT SGPPTTTFTT PWTGTTTRTI THTPSSDGDT ITVEIDTPVP TTTFTTPWTG TTTRTITHTP SSDGDTITVE IDTPVPTTTF TTPWTGTTTR TITHTPSSDG DTITVEIDTP TTGPPTTTFT TPWTGTTTRT ITHTPSSDGD TITVEIDTPV PTTTFTTPWT GTTTRTITHT PSSDGDTITV EIDTPVPTTT FTTPWTGTTT RTTTHTPSSD GDTITVEIDT PVPTTTFTTP WTGTTTRTIT HTPSSDGDTI TVEIDTPVPT TTFTTPWTGT TTRTITHIPS SDGDTITVEI DTPTTGPPTT TFTTPWTGTT TRTITHTPSS DGDTITVEID TPVPTTTFTT PWTGTTTRTI THTPSSDGDT ITVEIDTPVP TTTFTTPWTG TTTRTITHTP SSDGDTITVE IDTPTTGPPT TTFTTPWTGT TTRTITHTPS SDGDTITVEI DTPVPTTTFT TPWTGTTTRT ITHTPSSDGD TITVEIDTPT TGPPTTAFTT PWTGTTTRTI THTPSSDGDT ITVEIDTPVP TTTFTSPWTG TTTRTITHTP SSDGDTITVE IDTPISIATF ETVQTTFSKY TSFWSNFTEE ITSDIGKTVI DISTTVITIK SCENDYCDQL VRTTGYQVVT TTIDETVTEF TTFCDIPSTT AEDQILSAAS DLEGSTSSEV TTVYFKDDTT TYLISQTVTN TGISTITTYS YQENEDYVST GTSKDIDQNE ILSGELTITS TKSGDTKHSQ DSSATQKFDY SNQLTPVTVV PQVSSTNVPE VYHVQSEGGG ANTYKISFWS VILALVVVFE F
|
| |