Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_81668 |
Symbol | |
ID | 4837402 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1812055 |
End bp | 1815229 |
Gene Length | 3175 bp |
Protein Length | 995 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388717 |
Product | predicted protein |
Protein accession | XP_001383119 |
Protein GI | 150864344 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TGTGAATCTT TTCCGTACTC TTGATTACTT TTATTGACTT TTTCTTTCTT CCATTCTTCC ATTCTTCTTC CATTCTTCTA CATTAGCCCC AATTCCCTAG TCAACGAAGA TGGCTACGAA CTTCCATCTC TACCACCCCT CGCTCAGACG CGATCGAGAA GAGTCGGCAA CCGGTGGAGA CCTTAAGACT ACTCCGACGA ATACCTCTTC CAGCAACAAC AGCAACAGCA ACAGCACAAG TCTATTTGGA AGCAGCCCAA AGCACACTCT GCATTCTCAT GCACATCATA GAGGATTCTC GTTCGACAAG ATCAAGAGCA AGGTGAGTCG TCGCTCGCTG GAGCCAGAAG TGATACCAGG TTCTCCAGAC AGAAGAAACT CTGCTTTTCC CATGATTGTG ATCTCAGAAT CAAATGAGAC AAGTCGTAAC CAAAGTCAGA ACGAAAACCA AAACCAAATG AATGTCTCTG CAGCAAATAC AAGACGTCTC AGAGCTCAGT CGCTTTCGCT CATCCACAGC CAGTCCCGGT CCCGTTCTGG GTCCCGTTCC AGAAGTCTTG TTCCCAACGG AAACTCAACT TCTCGGAAAC AAACGAAGGA ACTCGTGAAG TTAGAAACGG CTCACATAAT CCTGAAGAAA TTGGAATCGA TTTTGCTGGA TCTCGGACTT CAGTCTCCTA TACCGTTGAA AGCTACCAAC AACACCTCCA GTGGCTCCAT AGCTAAGTCG GTCAAGGTAT ACATCGCCAA CACAAACGAT TGCATCTTCT TAGCACCAGC ATCTTCCGCC AGTTTCACCT ATGAAGACGT CGAGAACGGG GGTGCAATTC CACATGATGA TGAAGACGAC GACGAAGGAA TGGATAGTCT CGTTGTAGAC TCTGGCAGAA GCAATAGCAA CTTCACTTCC ATACGACGTG GATCCGTAAT CTCAGATGAT GAGTCTGCTG TAACGTCAGA CGAGGAAGCG GTTCTGCCAG ACTTGGCTAC ACCCAGAAGA CTTAAGAAAA AAATGAGGTT CTTCAACTCG CCCAACTATC TCTGCACCAA GATCGATTCA GACATGCCGA TCCCACACAC TTTTGCTGTC GTAATCGAAC TCGAAAAGGA TTCCACTTCT GTGAGAGACG TAAAGTTCGA TTTTCTGTCA GTCACGAATA TCTTGTGGCC ATCTGGAGAT CCTTATAGCC GAACCCATTC AAAGGAGCGG TTCAAGATTG GCAGTATGGA GTGGTCTACA AGTCTCGGAG ATTCCGATTT CTACATTAAC ACCAACAACT CGAACGATGT ACGCATCAAA AATATCACTC CTGATGATCT CGCCAGAAGG ACAAGAGAAT ATAAACTCGT GAATATCCGC AATCTTGCTG ATGGAACAGA CAACGCCAAC ACCAGTCGTA AAAACTCTAT CTCACTAGAC TTTAACGATC TGCCCTTAAA TACTCATGGC AATAGCAATG GTCATAGCAG TAATAGCCAT GGAGGGAACA GTAATAGTAG TGAAGTGTAC AAGGCTGGTC TTTATGTGTT TCTCTTGCCG ATTATCTTAC CTCAGCATAT CCCTCCTACG ATCATCTCAA TCAACGGCAC GCTTTTGCAC ACATTGAGTA TCAACTTCAA CAAGACAAGT GACATGCTCA ACAGAAAGGT CAAAGTCTGC TCCACGTACA ATTTGCCTAT GGTGAGGACG CCTCCCTCAT TTGCTAATTC GATTGCTGAT AAGCCCATCT ACGTTAACCG TGTATGGAAC GATTCGTTGC ATTACATCAT AACTTTTCCC AAAAAGTATG TGTCGTTGGG ACTGGAACAT GTAGTCAATG TAAAACTTGT TCCGCTAGTC AAAGACGTTA TTATCAAGCG CATCAAATTC AATGTTCTAG AGAGAATAAC ATATGTCTCT AAGAACTTGT CTAAGGAGTA CGACTTTGAC AGTGATGATC CTTATTGTGT GAAAGCACAT TCATCGGACA ATAGAACTAG AGAAAGAGTC GTTTCTCTTT GTGAGTTAAA GACAAAATCG AAACAGAACA GTATGATAGG GGTTCCTGGA GATCCTTATA AGGAGGAAGT TGTCAAGTGT CCCGACAATA ACTTGTTGTT TTCATGCTAT GAACCTGATG AGTATGAAAA CTTTAGACTA GAGGACTCGA ATACAAACAA GCGTAAGGGA AAAGACAAGG AAGAGACACC TACGATGATT GCTTCACCTC TCGATATCAA CATAGCTTTG CCTTTTTTAA CCACCAGAAT GGACAAGACC ATGATGACTA GTACAGAAGA AGATCCAGCG CATCTTCATA GAAGTTCTGT TTCCAGAAAG GCTTCTATCA CTACTGAAAG TTTGAACAGT ACTTCTGGCG GAGGCTCGCC TTCTTTCCAG CCTACTTCTC CCATAATAGG GGCTTTGGAA ACAAACCTTT CCCATAGACA TAGCATGGAT TCGTACGATC CCGTTTCGTC TGACTACATA AAACCAAACT CTTCAATGTA CTTGTCAGAT GACAACAGCG CGAAACTGAC GCCTCCTGAA AACATTCAGA AAGGGTTTAC GTTGGTTTCA AAGGCTTTGT ATCCTGATTC TAACTTCAGA CACATCCAGA TCAGCCACAG ATTGCAGGTA TGTTTCCGGA TTTCGAAGCC AGATCCGAAG GACGGCTTCA AGATGCACCA TTACGAGGTT GTGGTCGATA CGCCTTTGAT TCTCTTGAGT TCCAAGTGTA ATGAGGGATC AATCCAGTTG CCCAAGTATG ACGACCTAGA AGGTGTTTTT TCTACTGTAG ATACCGAAAT CTCGTTCAGA ACACCCGACT TTGAAAGAAA CGGAATTTCC ATCAAAAGGC TCGATGAAAA TAGCTCTGTT GAACCATTGC CTTCTTTTGA AGAAGCCACT TCTTCACCTT CGTCGCCGAT TACCAGATCA ATATCCATAG GTGAAGATCC ATTGAGCAGA ATTCCATCAA ATAACTTAAT CCCTCTGTCA AATCCGTACC CAGACGAGCC GGCTCCAGCG TATGAACGTT CTCTGACAAC TTCTAACCAT GGAAGCCGTA ACAACAGCTT TGTAGCTTCG TCAAATATTG ACGAAGTTGT CAACAGCGAT TCCAACAATA GTTCTTCTTC TAGCTTGAGA AGGTCAACGT TGAGAAATTC GTTGCTGCAT TCCTTTGCTC CGTCT
|
Protein sequence | MATNFHLYHP SLRRDREESA TGGDLKTTPT NTSSSNNSNS NSTSLFGSSP KHTSHSHAHH RGFSFDKIKS KVSRRSSEPE VIPGSPDRRN SAFPMIVISE SNETSRNQSQ NENQNQMNVS AANTRRLRAH LVPNGNSTSR KQTKELVKLE TAHIISKKLE SILSDLGLQS PIPLKATNNT SSGSIAKSVK VYIANTNDCI FLAPASSASF TYEDVENGGA IPHDDEDDDE GMDSLVVDSG RSNSNFTSIR RGSVISDDES AVTSDEEAVS PDLATPRRLK KKMRFFNSPN YLCTKIDSDM PIPHTFAVVI ELEKDSTSVR DVKFDFSSVT NILWPSGDPY SRTHSKERFK IGSMEWSTSL GDSDFYINTN NSNDVRIKNI TPDDLARRTR EYKLVNIRNL ADGTDNANTS RKNSISLDFN DSPLNTHGNS NGNSNSSEVY KAGLYVFLLP IILPQHIPPT IISINGTLLH TLSINFNKTS DMLNRKVKVC STYNLPMVRT PPSFANSIAD KPIYVNRVWN DSLHYIITFP KKYVSLGSEH VVNVKLVPLV KDVIIKRIKF NVLERITYVS KNLSKEYDFD SDDPYCVKAH SSDNRTRERV VSLCELKTKS KQNSMIGVPG DPYKEEVVKC PDNNLLFSCY EPDEYENFRL EDSNTNKRKG KDKEETPTMI ASPLDINIAL PFLTTRMDKT MMTSTEEDPA HLHRSSVSRK ASITTESLNS TSGGGSPSFQ PTSPIIGALE TNLSHRHSMD SYDPVSSDYI KPNSSMYLSD DNSAKSTPPE NIQKGFTLVS KALYPDSNFR HIQISHRLQV CFRISKPDPK DGFKMHHYEV VVDTPLILLS SKCNEGSIQL PKYDDLEGVF STVDTEISFR TPDFERNGIS IKRLDENSSV EPLPSFEEAT SSPSSPITRS ISIGEDPLSR IPSNNLIPSS NPYPDEPAPA YERSSTTSNH GSRNNSFVAS SNIDEVVNSD SNNSSSSSLR RSTLRNSLSH SFAPS
|
| |