Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33685 |
Symbol | |
ID | 4840987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 213361 |
End bp | 215370 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640392302 |
Product | predicted protein |
Protein accession | XP_001386636 |
Protein GI | 150866892 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.872469 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTTT ACGGCAAGAA CTGGAGTTCG TTTCGAAAGA GAACACGTCT TCTGGCGGAT GCACCAGTTT TTTCCAGCGA CGAAGAAAAT GAAGAATTCA CTGATATAAC CGAGCCCACT TCAGTTTCAG ACAAACTAAT GTCAGTGATT CAAGATTCTC ACGTAGTTAC CAAGATAGGA GAGCAAGAAT TGCTTCCCTC GGCTGAGATT AATTCGAATT GGAAAACATA CAAATCTGTG CAGCATCACA AACGCAGCCT TTCAGATTTC AATCTTTCAG ATTCTCTCAC TACTTCTCCA CAGAAGCTAG TAACTGTAGT CAACGCGCTC AGCAACTCAC CAAGCCCAGT GAAAAGTCGG GCTAAACGAA GACTTGAAGA CGAATTAAAG GATTTGGCAA AAACTCCACC GAGAAACAAA TCCAAGAACA ATAATAGAGA GCAAGTCACT CCTGCTAAGA GTACGAATTC AACTGCAAAG AGAACTCCTA CATTCACACC CAAAGAAGCA CGCGACTGGG ATTCCTTGTT CGAAAGCATA GATGACGAAC TGGTTGGTCG AAATACCTTT CTAACGTCTC AAAATGATAG TGACAACCAT GGTGAAGAAA CGGACAACGA CGATGGAGAA ATAAACGTAG ATTTGTCAGT ATTTTCTTCC TATGTAGAAA ACTCAAATAC TTCCCCAAGT CGAAATTCTA GCGAAAGGAT ACTGAAAGGA ACCGCAAAAT CGAAGCTTAG AACATATGGG GATGAGCGAA GCTTTCTTCT AGAAGGAAAT GAAGAAGAAA ACTCGAAAGT ATCTATTGGC GATGAAATAC CAGTTGTGGA AGATGTGTTA AGCATCAATG ATCTCAGAAG TATCAGCAAA GAAAACCAAC GAAAGGAAGC TCTAGACTAT ATTTTGGAAG GATTGCAATT TACTGATTGC AAGAACTTGG CCACTGGAAA TGCTGTATTG GTATCATTGC TAGTAGACTT AGCCATCGAG TCGATCAAGA ATGGTTCAAA TGCACTTGAA AGAAACGGTG AAATCATCGC AACTAGGTTG CTAGCCATTT ATGAAAATAT TATTAATTCT AAGGGCAATG GAAAAGATGT TTTGTGTTGG CTAGTTAGTG TGAACTTTCT CTTTTTAGCA GCATCAGCAA CTCGGACTGA ACAAGTTGCT ATCAGTCAGA CTTTCAGACA TTGCCTATTA TCAATTCTCA AACTGCTTGG GGTCACTTAC AGTTCCAATG GTTTGCCTAT TTTGGTAAAG AAATCTCTTT TGCAATTGCT GGACATATTG CAAATTGAAA CTCCTCTCCA AGTGCAATTG ATAGAAGTGA TGAGCAATGT TCCTGATTTC CACAGAGCAG ACATATTTGA ACATGTCATT CAATTGTTTG CAACCGAGAC TAGATTGACC AACAAGATGA AGCTATTAAG CTATATCCAA TCGTATGTTG AAAGATCGCC TGAATTGGAT AGTCTTTACG ATCTCGAAAT TTGTATATTA GAATCTATGA ACAAGATAGA TTTTGTACAA CTTGACGATC TAGATGTTCA AGTGTTAAAG TTGGTCGTTG TGCTATCTAC ATCTTATGAC AATAATGAGC GAGTTGCCGA ATTGCTATTT GATCCCAAAT ATGTTTCGCC CATGATAAGG TATATCAATA GCAGCTACAG TTGCTTAAAC GACGAATATC GGCTCAACAT AGCATTGTTT CTCTTGGGAT TCTTGATAAA CTTTGTTGAG TCTGACCGGT TCGAATTGCA AAAGTTTGAT GATGTTAGAG ATAACATAGC TATATTTGAA GCAATTGATG CTAGTGCAAA GGACGAGGCT TCCGGTCACC TAATAGGCTA CAACAGTATA GTTTTGACAT ACTTACTGTT GAAGTATAGT GAAGATCTCG ACATAGATAT TCAACATCTA AAACACAAGT TGGACCATTT CAAGGAGAAG ATAGCTAATA CCAGAATCAA GACCAAGATT GATAGTTTGC TTACAGAATT GGCTAAATAA
|
Protein sequence | MSVYGKNWSS FRKRTRLSAD APVFSSDEEN EEFTDITEPT SVSDKLMSVI QDSHVVTKIG EQELLPSAEI NSNWKTYKSV QHHKRSLSDF NLSDSLTTSP QKLVTVVNAL SNSPSPVKSR AKRRLEDELK DLAKTPPRNK SKNNNREQVT PAKSTNSTAK RTPTFTPKEA RDWDSLFESI DDESVGRNTF LTSQNDSDNH GEETDNDDGE INVDLSVFSS YVENSNTSPS RNSSERISKG TAKSKLRTYG DERSFLLEGN EEENSKVSIG DEIPVVEDVL SINDLRSISK ENQRKEALDY ILEGLQFTDC KNLATGNAVL VSLLVDLAIE SIKNGSNALE RNGEIIATRL LAIYENIINS KGNGKDVLCW LVSVNFLFLA ASATRTEQVA ISQTFRHCLL SILKSLGVTY SSNGLPILVK KSLLQLSDIL QIETPLQVQL IEVMSNVPDF HRADIFEHVI QLFATETRLT NKMKLLSYIQ SYVERSPELD SLYDLEICIL ESMNKIDFVQ LDDLDVQVLK LVVVLSTSYD NNERVAELLF DPKYVSPMIR YINSSYSCLN DEYRLNIALF LLGFLINFVE SDRFELQKFD DVRDNIAIFE AIDASAKDEA SGHLIGYNSI VLTYLSLKYS EDLDIDIQHL KHKLDHFKEK IANTRIKTKI DSLLTELAK
|
| |