Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_82707 |
Symbol | |
ID | 4838222 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 319023 |
End bp | 322054 |
Gene Length | 3032 bp |
Protein Length | 901 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640389537 |
Product | predicted protein |
Protein accession | XP_001383691 |
Protein GI | 150864733 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAAGT CCAAGATCAA GTCCATCTTG CCCAAGCCGT CGCCACTGCA TCCCCAGCAG ACGCCCTCGC CAAAATTGTC TGGCTCCGCA ACTCCACTTT CTGGAAAAAG ACGGTCTGTG GCCTCAGGAC TAGCAGACTC TAAGAAGCGA AGATCTCTTC CTACCAACAT CAGCTCGAAC ATGAGCTCAG TTCCAGCCAT AGTTCCTATA GCCCCTGCTC CTGTTTCTAT CAGCATGGCG GAAGACGATA GTCTGGAAAA GTCGAAGCAA ACCGGCCACA GACCTGTGAC TTCGTGCACC TTCTGTCGCC AGCACAAGAT CAAATGTAAT GCTCTGGAAA ACTACCCCAA TCCATGTCAA CGCTGTGAAA GAATGGGTCT TAAGTGCGAA ATCGACCCTC AGTTCAGACC CAAGAAAGGA TCCCAGATCC AATCGCTCAA AAGCGATGTA GACGAACTCC GGGCCAAGAT CGAAATCCTC ACAAAGAATG AATCGCTTTT GACCCAGGCA CTAAATCAAC ATAACATCTT GCAACAGCAG CAACAACAGC AGCAGCAGCA GCTGTATACA CCTAGGGCAC AGTCAACCCA TTCTACAAAC TCGCCTGTAA ACTTCCAGTC GCCGCAGCTC TATCCTGCTG GAACTTACCA ATCCTCCCCG AACTCTATAT CGTTACCATC GGGCCACCTC AATGACTCTG TAAATAGCAG CACCAACCCT AACCAACTCG CCCATGTAAT TCAGGAAGGC TCCGATACTT CGCCGTCTAC AAACAACACG CCTAACTCTC AACATTTGAA TAGAGAAGAA GTGCAATATG TCTCAGAGTT CATACTTGGA GAAGTACATC TTCCTCTAGA CAAGGCGAAC GACTTGCACC ATATCTTCAT GACAAAGCAT CTTCCTTTCT TGCCCATTAT CACCTCTCGA TCGGCGACAG AATTGTACCA TAAATCGCAA CTCCTTTTCT GGACAGTAAT ACTAACTGCA TCTCTCTCGG AACCTGATCC AACGTTGTAC ATGTCGTTAG CTTCGTTGAT CAAGCAGTTG GCTATTGAAA CCTGCTGGAT CAGAACACCC AGATCGACCC ATGTGATTCA GGCTCTCATC ATCCTCTCCA TCTGGCCCTT GCCCAACGAA AAGGTGTTGG ATGACTGCTC GTATCGTTTT ATTGGTCTTG CTAAAAATTT ATCGCTTCAG TTAGGTTTAC ACAGAGGTGG TGAGTTCATT CAAGAATTTA GTAGAACCCA AGTAAGTCTT GGCCCCGATG CGGAACGTTG GAGAACCAGA TCATGGATTG CCGTTTTCTT CTGTGAGCAG TTCTGGTCCT CGGTCTTAGG TTTACCTCCT TCCATCAACA CCACGGACTA TTTACTAGAA AATGCCAGGG TGGACCAGAC ATTGCCCAAA GACTTTAGGT GCTTGATTTC GTTGTCAATT TTCCAGTGTA AACTCGTAAA TGTCATGGGT ATTTCTGTAA CAAGACCCGA TGGTTTATTG GAGCCTCTGA ACAGAGCTGG CTCGCTAAAC ATCTTGGATC GTGAGTTGGA AAGATTGAAG TTCAAATTGA ACATTGTAGA TGGATCTGCA ATTGAAATCT ACTATCTCTA TGTGAAGTTG ATGATCTGCT GCTTTGCCTT TTTGCCAGGC ACGCCAATCG AAGACCAGGT TAAATACGTC AGCTCAGCTT ACTTATCAGC TACCAGGGTT GTCACGGTGT CTTCGCAAAT GTTGAAGGAC AATATTCTGT TGATAGAATT GCCCATATAT GTCAGGCAGG CGATGACGTA CTCAGTGTTG CTTCTTTTCA AGTTGCACTT GTCGCGTTAC TTAATTGACA AGTATGTGGA CAGCTCCAGG CAGCTGATAG TCACGGTACA CAGATTGTTA AGAAACACTT TGTCGTCATG GAAAGACTTG AAAAACGATA TATCCAGAAC AGCAAAGGTA TTGGAAAACT TGAACATCGT TCTCTATACC TACCCTGACA TCTTGTTGAA CGATAATTTA GAAGCTGGTG GTAGTATTAT CAACAGGATG AGATCACACC TAACTGCATC CTTGTTCTAC GACTTGGTGT GGTGTGTCCA CGAAGGTAGA AGAAGAACTA TGATTGACAA GTCGAAAAAG TCAGAGTCTT TGGAAGACAC GAAAATTCCA CCCAACTCCA CTACATCTAC TTCTGTCAGT AAAAGACCCG CACCTTTGCC GTTCTATAAC CAGATCACCA AGGACGACTT CAAAACCATC ACCACCACTA CACCCAACGG GACTACCATT ACTACTTTGG TTCCTACTGA TCAAGCTATG AACCAGGCTA GAAACGCATC TGGAAACAAA CCTTTGGAAA TCAATGGTAT ACCTTTGGCT ATGTTAGAAG CCACAGGTAG TATCAAGGAT ACTATCCGAG AGCTGCAGAC TCCAGCTCCA GAGGTAGACA ATCCAACCAC AAATGCTCCT ATACTTCCAT CTACAGTCAA AATCAAACTG GAGTACGACA ATGTTGTTTC TACACCTCAA CAGCCGTTAT TGGCTCACCA ATCACAGGCG CTTCAGCACC AGTCGTTTGC TATGCTTGGA CACCAACCTG TAGATACTAC GCCTAACCAA CCGATGTTCA TAAATAGTGA CTCGATGCAA ATCCAGCAAC CCAGCTTGGT GGATCCAGCA AGCGCACAAG CTACACCTAT CCAGTATATT GGTGCTCCCA TCAATGGAGT CGCTGATCAG ATGGATAACT TCTTCCAGCA GCAGTCTAAC GGTTGGTTGA ATAACGATAA CTACCAAGAT GATGATTTCC TCGGTTGGTT CGACGTGAAC ATGCGTTCCG ATCAATAAAC CAATTGTCCC TAAAATAATG AAATGACTTG TTCCTATTAT GTCCCTCTAT TCTCTGTCTA CTCTCTGAGT TTAATAATCT TATTATTATG TATATATTTT CCATCTATAT TCATTATTTG TTGATTGAAT TTGTTATGAT TGCTAAAAGA GTACTATATA CTATTAACTG CTTGTTAATA TAAATTTTCA AA
|
Protein sequence | MEKSKIKSIL PKPSPSHPQQ TPSPKLSGSA TPLSGKRRSV ASGLADSKKR RSLPTNISSN MSSVPAISKQ TGHRPVTSCT FCRQHKIKCN ASENYPNPCQ RCERMGLKCE IDPQFRPKKG SQIQSLKSDV DELRAKIEIL TKNESLLTQA LNQHNILQQQ QQQQQQQSYT PRAQSTHSTN SPVNFQSPQL YPAGTYQSSP NSISLPSGHL NDSVNSSTNP NQLAHVIQEG SDTSPSTNNT PNSQHLNREE VQYVSEFILG EVHLPLDKAN DLHHIFMTKH LPFLPIITSR SATELYHKSQ LLFWTVILTA SLSEPDPTLY MSLASLIKQL AIETCWIRTP RSTHVIQALI ILSIWPLPNE KVLDDCSYRF IGLAKNLSLQ LGLHRGGEFI QEFSRTQVSL GPDAERWRTR SWIAVFFCEQ FWSSVLGLPP SINTTDYLLE NARVDQTLPK DFRCLISLSI FQCKLVNVMG ISVTRPDGLL EPSNRAGSLN ILDRELERLK FKLNIVDGSA IEIYYLYVKL MICCFAFLPG TPIEDQVKYV SSAYLSATRV VTVSSQMLKD NISLIELPIY VRQAMTYSVL LLFKLHLSRY LIDKYVDSSR QSIVTVHRLL RNTLSSWKDL KNDISRTAKV LENLNIVLYT YPDILLNDNL EAGGSIINRM RSHLTASLFY DLVWCVHEGR RRTMIDKSKK SESLEDTKIP PNSTTSTSVS KRPAPLPFYN QITKDDFKTI TTTTPNGTTI TTLVPTDQAM NQARNASGNK PLEINGIPLA MLEATGSIKD TIRESQTPAP EVDNPTTNAP ILPSTALQHQ SFAMLGHQPV DTTPNQPMFI NSDSMQIQQP SLVDPASAQA TPIQYIGAPI NGVADQMDNF FQQQSNGWLN NDNYQDDDFL GWFDVNMRSD Q
|
| |