Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32266 |
Symbol | |
ID | 4839708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1075532 |
End bp | 1077700 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391023 |
Product | predicted protein |
Protein accession | XP_001384884 |
Protein GI | 150865604 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.110274 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.90348 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCGAC GCCAGGGCAG GTCTTTCGTC CTCGTTTTGG CGTTGCTAGC AATATTAGTG ACTTTGCACT TATTTTTTTG GGATGATAAA CTAGACATTA GAAGCAAGGT TTTCAAGCAG AAGAATTCTG GACAAGTTCC AACAAATAAA GCCGAAGTCC CCCCAAAGCC AGGTCTGAAA TCACATCCAA AGGATCCAAA GGTTCCAAAG TCTTCGAAGT CTTCTGACGA TCACAAGACA GTTCAACGAG GCGACGTCAT CTCCAAATAT GTTGGAAAAA GCAAGTTGAT AGTGTTTCCT AAAGCATTCG AAGAGTCAGA TCTGAAGAAG CTATACAAAC TCTATACATC ACAGTTCAAA GAACTGCCTC CGCGACGGAG CAAGATAATT AGATTTCTGG ATCCCTCGCA AAAGACTGAT CCTCTCAGTA ACTCCATAAA GAATTTGAAG TATCACAAGC ATCCCGTCCA GCTGTTTAAT GCGTTTACTG ATATAGAAGG AAATATGGAA AAATGTGGAT TGCTCGAAGA TAAGTACGAA ATTGAAGTGC TGAAAAGCTT AATGAAAAGC AAGTCTTTGA AAAAGATTGT ACAGAAGTGG GTCAATGAAA ACTCCACATA CTTCCAGGAA ATAAATGAGT TCTTCCACAC TCCTCTCCAG CAGCAACTCA AAGAAGGTAG TGTTGATCTG CATTGGTTTA GATTGGCGGG ATCATCAGTA TGGTTAGAAC AATATGGAAT CCATTTCATG GTCTCAAGAA TTATATATGC CGAAAACAAA GATAGGGGAA CGCCAAACTT ATCTATGATC TACGCTCAAG CTTATGATGA AAACTGGAAA GAAGTAGAAA ACTTAGAGTT GGTGGTTCCT ACAAATAACC CAAACCTATA CCTTGAATCC AATGATCCCC CTCAATTTAG ATTGCAATTG CCTCAAATCT TACCTATACC AATTCACCTT GATAAAGAAC GTCAATGGCC CGGATTTTAT GGTGCTGAAG ATCCTCGAAT TATCGCTGTG AAAAACGCAA GAGGTTTCGA GGAACCTTTG GTTTTCTACA ACTCCTACCA TGGAAAAGTT GTGAACATTG AGGAAATCGA TGAAGGTAAG TCAACTGCAC ACCTACTATT CTCCAGAAAT ATGTTCATGG CTTGGCCATT CCAGACTCAA AGAGGGAAAT ACAACGTTCA AGACTTGCCC TCGAAATACC ACAACAACAT ATTCACAAGA GTGTTGGAAA TAAAAGAAGC AAATAAAGAA AGAGAAGGAA AACAAAAGAA CTGGACCCCA TTCATTAGCA GCCAGGATAG GAAGAGTTTC GGATATGACA AGTACATCTA TATGAGTATT AGGATTGAAC ATTTGCAAAT TTTGCGGTGT CCAGTAGTAG GTGGGAGCGA CCAATTCGTG ACTGAATGTG AGGAGGTTTA TCTATTAAAC CCAGAGAAGT CCAACAACGA CGGAATCGGC CCATTGAGGG GTGGATCACA ATTTGTAAAC ATTAACAACA TGCTTGAGTC ATATTCCGAG CTTCCAGAAG CTAGGAAGCT TATAGATAGC ATTCCAAAAG GAAGGGAACT TTGGTTTTCT TTCGCCAGAG CAAATCTAGA ATATTGCGGC TGTGGTGTAA AGATGTATAG ACCTAATTTG GTTGTGGTGG TTAAGGATGG CGATCAGTAC AAGATTAGTT ATGTCAGTTC CTTCGTTGAT TTAGCGGTCG AGCAGCTTGG CTGGATTTTG AAGGAGTCTG ATAACTATTG TCCAGAGAAT GATGGTTCAG TTATGATTCC TAATGGTATT GCGTCTTGGA CCCTTCGAGA AAATGGTGAA AAGACAAACA TTGATGACTA CATGACGTTG AGTTATTCAC TTGCAGATGC TACAGTTGAA ATTATTCACA TCAGAGGGGT TCTTAAAGCA CTTCTTGGTC TTGACTCCAA AAATCAAAAC TACAAGCTCT TCGAGAAGTC AAGTGCGCTC GATGCTAAGA CTGTGGGTTA CAACAATGAC AATATAGATT GTGCTTTAGA GAATTCTAAG CAGTACTGTC AGGCATTTGG AGAAAAGGAA AAGAAGAAGA TGGAGCTGAG AGTGAAGCCT CAAGATAAAG AAGAAAACAA ACAAAGACAA GATGGAAAGG AATCCGAAAC ACAGGAACCG AAAGAGTGA
|
Protein sequence | MFRRQGRSFV LVLALLAILV TLHLFFWDDK LDIRSKVFKQ KNSGQVPTNK AEVPPKPGSK SHPKDPKVPK SSKSSDDHKT VQRGDVISKY VGKSKLIVFP KAFEESDSKK LYKLYTSQFK ESPPRRSKII RFSDPSQKTD PLSNSIKNLK YHKHPVQSFN AFTDIEGNME KCGLLEDKYE IEVSKSLMKS KSLKKIVQKW VNENSTYFQE INEFFHTPLQ QQLKEGSVDS HWFRLAGSSV WLEQYGIHFM VSRIIYAENK DRGTPNLSMI YAQAYDENWK EVENLELVVP TNNPNLYLES NDPPQFRLQL PQILPIPIHL DKERQWPGFY GAEDPRIIAV KNARGFEEPL VFYNSYHGKV VNIEEIDEGK STAHLLFSRN MFMAWPFQTQ RGKYNVQDLP SKYHNNIFTR VLEIKEANKE REGKQKNWTP FISSQDRKSF GYDKYIYMSI RIEHLQILRC PVVGGSDQFV TECEEVYLLN PEKSNNDGIG PLRGGSQFVN INNMLESYSE LPEARKLIDS IPKGRELWFS FARANLEYCG CGVKMYRPNL VVVVKDGDQY KISYVSSFVD LAVEQLGWIL KESDNYCPEN DGSVMIPNGI ASWTLRENGE KTNIDDYMTL SYSLADATVE IIHIRGVLKA LLGLDSKNQN YKLFEKSSAL DAKTVGYNND NIDCALENSK QYCQAFGEKE KKKMESRVKP QDKEENKQRQ DGKESETQEP KE
|
| |