Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_64286 |
Symbol | |
ID | 4841042 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 113854 |
End bp | 116805 |
Gene Length | 2952 bp |
Protein Length | 895 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640392357 |
Product | predicted protein |
Protein accession | XP_001386623 |
Protein GI | 150866881 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GACAGCTCAC AGAACAACGA CACCTTTCTC AGTCGGATCT TCGGCCTCAA CTCGGTCTAC AACCATCTCC AGGAAAACTA CCAGTACTAC GATCCGGAAT TCGACAGCAC GTACAACCAA CAGGTGCTAG CCAATCTGCA AAGACAAGAC TACGATTTGT TCGGCAATGG CCTCGAGTCC AGAATCGGAG CTTCTATCTA TGATGGAGCT AATGTAAACA ACACTATTGA TAACAACCTC AAAAATCCCA ATGGTGCCAA TGCAGCTAAT GTTCCAAAAG CTAATGGTAT CAATACTAAT GCCGATAACG ACAACAGCAA AACAAACTTG CTCGACTCAG AATCAGATTC TGACTTGTCG CTGCTGTCTT CAGCTTCGCC TTCAGTAGTA GTAAAACCGA GATTTAAAAG GACAGAAGCG GTCGTGGATT GGACCCCAGG TGTAGGGGAA CCCCCAGTTA ATATCCAGAG TGACAATGAG GACGATGAAG TTGACGATTT GCTTACGAGT TTGTCGAAGC CTCGGAGTAG CCCTCCTAGA CGAAAGCCTA CTTTCAATAT TCCCAATGCT AGAGACATAT TTCTGGCCAA TGGAAACACC CAAGCTACAC TGGTATTGCC CTTATATAAT CAGAAATATC GGAAGCCTGT CGAAAGAGAC ACCGGTGCTT CTGCTGGAGA TAGTAGAGGC TATGCCAATT CTGGTATCCG CAGAACTAAC GGCACTAGGT TCGTAATTCC TCCTAAGGAA AGAGCGTTGT ATCTCTGGGC AAACATCACC AACATGGACG AGTTCCTCAC GGATCTCTAC TACTATTATA GAGGTAAGGG AATGCTCAAC ATCGTGTTAT CTAGCATCAT CGACTTGTTG ATTCTTGTCT TCATACTTGG CTTTACTGTG TTTTTGAAGT GGGGAATCAA CTACCGGTAC TTTTTCGACA ATTACAAGGA CTCGACATAT ATAACGCTTG CCGACTTGAT CATCCCCAAC TTCCTCGTTG ATGAGGTGCC GTTATTGGCA AAGTTTTTCC TCTTTGGCTT TGTCTGCTAC ATCGTATTAC GGTTGATCCA ACTTTATTTC AACTACAACT ACAAGCTCAA GGAGATCAAA AACTTCTACA AGTACTTGAT CAACATCCTG AACGATGACG AGTTGATGAC TATCACCTGG AAGACGATTG TAGAGAGGTT GATGTTGCTC AAAGACTACA ACAGCTTGAC TTCGACAACG AGCCATTTTG ACGGTGCTAC GGATCACTAT ATAAATGACT TAAACTCGAA GGTGCGACTC AATGCTCACG ATATAGCCAA TAGGATCATG AGAAAGGAAA ACTACATGAT CGCTTTGATC AACAAGGACG TTCTAGACCT TTCACTCAGT CCTTTCCAGA ACTCGTCCTT CCAGTTGATC AACAACAAGT CTGTTCTCAC AAAGACGTTG GAGTGGAACT TGAAGCTCTG TATTAACAAC TTTGCATTCA ACAACGAGGG TCAAATCAAT CCTAGCATAC TTAAAGATTT CAACAGGAAT CAGTTGGCGA AGGAGTTGAA CTCGCGCTTC AAGATGGCTG CCATTATCAA CTTGATCTTG TGCCCTTTCA TTGTAATCTA CTTCGTGTTG TTATACTTTT TCCGGTACTT CAATGAGTAC AAATCCAATC CCGCTTCTAT CATGGGTCTA AGGCAGTACA CACCTTATGC CGAATGGAAG TTGCGTGAAT TTAACGAGTT GCCCCATTTC TTCATCAGAA GATTGCAGCT CTCTGTTGGT CCAGCTAATA CCTACATCAA CATGTTTCCT CGTGGATTTC TTGTGATAAA CTTAATGAAC CTCGTCAACT TCATTCTGGG TGCAATCATG GCTATCTTGG TTATCATGGG GCTCTGGTTT GAAGACGAAA ATCACAGCTT CTGGTCATTT GAACTCACCG AAGGAAAATC AACTTTATTT TACATTAGTA TATTTGGTAC TTTATGGGCC ATAACATCTA CCTCTACTTC TACTTCGGAT ACTGCCGACA ATTTAAATCC CAACTCGCAT TCTTTTGTTT ATGATCCAGA AGCAAGCTTG AGGTATGTTT CCCAGTTTAC TCATTACTTG CCGAGCAGCT GGAACAGAAG ATTGCACACT GTGGAGGTGA AAAACGAGTT CTGTGAGTTG TACAGCTTGA AGATTATCAT AATTCTCAAC GAAATCTTTA GTTTGATCTT GACACCATTC ATTTTGTGGT TCAGGGCTCT GAGCAGTTCT GGTGCTATTA TAGATTTCTT CAGAGAATAC TCCATCCATG TAGATGGATT GGGCTACGTT TGCTATTTTG CAATGTTCAA CTTTGAAGAA AAGGACAAAA ATATGATGTT TGACTTGAAT AAGAGAAAAG GTAAGTCTAA AAGATCTAGA AGAAGTAAGA CTTCTAAAAC TAGCTCTAAG AAGACAGTCA ATGAAATTGA GTTGAACAAT ATCAAGTCAA AGAGACGTGA AAAAGCTAAA ATCAGCGATT CTGAAGATGC TAGTTCGTTG CCAAATACCA GTGACGATGA AAGTGGAAAC GACTTGAATG CAGACACCTA TCAGGACGAG AAAATGATAA AGTCGTACAT GTATTTCCTT GAGAGTTATG GAGGTTCCAA TAACAATGGC AGTAACAACA ACCCTTCACA GCAACATGGG AACACTTTGA AGAATGGCAC ACGTGTAGCT GGAAAAGCAG ACGTTAGGGC CATAAACAGC AATAATAAAC TATTGGCCAA AAACTCGGTA ATATCGAACA TCGATCCTTC GCCATCTTTG ATCATACAAG GTCCTTCGGA CAACCACAGC TTGTTGGATT CGGCTTATAA TATTAACTAC AAGTTTGATG ATGCGGAGCA GGAAGAATCG ACGAGACCAG GAAAGAAATC GGGAGTTTTG GGTATGATCA ACCAATTTTA CAAGCAGGAC TTGGGAAGGT AG
|
Protein sequence | DSSQNNDTFL SRIFGLNSVY NHLQENYQYY DPEFDSTYNQ QVLANSQRQD YDLFGNGLDK TNLLDSESDS DLSSSSSASP SVVVKPRFKR TEASDNEDDE VDDLLTSLSK PRSSPPRRKP TFNIPNARDI FSANGNTQAT SVLPLYNQKY RKPVERDTGA SAGDSRGYAN SGIRRTNGTR FVIPPKERAL YLWANITNMD EFLTDLYYYY RGKGMLNIVL SSIIDLLILV FILGFTVFLK WGINYRYFFD NYKDSTYITL ADLIIPNFLV DEVPLLAKFF LFGFVCYIVL RLIQLYFNYN YKLKEIKNFY KYLINISNDD ELMTITWKTI VERLMLLKDY NSLTSTTSHF DGATDHYIND LNSKVRLNAH DIANRIMRKE NYMIALINKD VLDLSLSPFQ NSSFQLINNK SVLTKTLEWN LKLCINNFAF NNEGQINPSI LKDFNRNQLA KELNSRFKMA AIINLILCPF IVIYFVLLYF FRYFNEYKSN PASIMGLRQY TPYAEWKLRE FNELPHFFIR RLQLSVGPAN TYINMFPRGF LVINLMNLVN FISGAIMAIL VIMGLWFEDE NHSFWSFELT EGKSTLFYIS IFGTLWAITS TSTSTSDTAD NLNPNSHSFV YDPEASLRYV SQFTHYLPSS WNRRLHTVEV KNEFCELYSL KIIIILNEIF SLILTPFILW FRASSSSGAI IDFFREYSIH VDGLGYVCYF AMFNFEEKDK NMMFDLNKRK GKSKRSRRSK TSKTSSKKTV NEIELNNIKS KRREKAKISD SEDASSLPNT SDDESGNDLN ADTYQDEKMI KSYMYFLESY GAGKADVRAI NSNNKLLAKN SVISNIDPSP SLIIQGPSDN HSLLDSAYNI NYKFDDAEQE ESTRPGKKSG VLGMINQFYK QDLGR
|
| |