Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80003 |
Symbol | |
ID | 4841046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 143797 |
End bp | 146671 |
Gene Length | 2875 bp |
Protein Length | 931 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640392361 |
Product | predicted protein |
Protein accession | XP_001386424 |
Protein GI | 150866736 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0513] Superfamily II DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.586246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGACTACTC GAGCATGTCA GACAACGAAG AAGACTACGA CATTGCCCGC TCGTTGACCG TCAACCTCGA AGACAGTGAC TCTGACTCGG GCTCTGACTT CAGTGACGAG GAGCAAGAAG TCCAGGACAT AATTTCGAGC GAAGATGAAG CTGAAGAACA GCCCAAGAAG AAACAAAAGA CAGCCAAACC AGCCAAAGAG GCTTTCCCAT CTCTAGAGCT CTCAGGTGAT GAAGACGAAC AGGACGATGA CAAGGATATG GCGTCATATT TTGCTGCCAA CAATCCTCAG GCCAAGAAGG CTAAGGCTGG TTCTTTTCAA TCGTTTGGTC TTTCCAAATT GGTACTCACC AATATTGCTA AAAAGGGCTA TAGACAGCCA ACACCTATCC AGAGAAAAAC CATTCCACTC ATAATGGCCA ACCGTGATGT GGTCGGTATG GCCAGAACCG GTTCTGGAAA GACTGCTGCA TTCACATTGC CAGTAATTGA AAAATTAAAG GGCCACAGTG CTAGAGTAGG TATCAGAGCC ATTATCTTGT CACCTCTGAG AGAATTGGCT TTACAGACAT ATAAGCAAGT CAAGGAGTTC AGTAAGGGCT CAGATTTGCG TGCTATCGTA TTGACTGGTG GTGACTCTTT GGAGGATCAG TTCTCCAGCA TGGTTTCCAA CCCGGATATT GTCATTGCTA CCCCAGGTAG ATTTTTGCAT TTGCAGGTAG AAATGCAGTT GGATTTGAAA ACTGTAGAGT ATATAGTGTT TGACGAAGCC GATCATTTGT TCGAACAGGG TTTTGCAGAG CAGTTGAATG AGTTGTTGGC TGTTTTGCCT CCACAAAGGC AGTCTTTGTT GTTTAGTGCT ACTTTGCCAC GTTCGTTGGT GGATTTCGCA AAGGCTGGTT TGTCCAATCC TGTGTTGGTC AGATTGGATG CTGACTCCAA GATCTCTGAC CAATTGCAAA TGGCCTTCTT CACCACCAAG AAGAATGAAA GAGAAGCCAA TTTGTTGTAC GTCTTGCAAG AAGTCATCAA GATGCCATTA GGAACAGCAG AACAAATCAA GAAGTTAAGA CTCATGGATA AAAGAGTCAA CGACGAGGCA GAAGAAGAGG AGTTGGAAAA TGAATCGGGT TCGAAAAGAA AGTACAAATT CAAGAAGGAA AGGTTACCAT CTGCTAACGT TTTGCCCTCT CCGCATTCCA CCATTGTTTT CGTTCCTACT AAGCACCACG TAGAATACAT CACAACCTTA CTCCGTGACG CTGGGTATTT GGTTTCGTAC ATCTACGGTA CATTAGACCA GCATGCTAGA AAGAACCAGT TATACCAATT CAGATTAGGC ATGACTACCG TCTTGGTTGT CACAGATGTG GCTGCTCGTG GTATTGATAT TCCTGTCTTG GCTAACGTTG TCAACTACAC GTTACCAGGC TCGTCTAAAA TCTTCATTCA TCGTGTAGGT AGAACTGCGA GAGCAGGTAA CAAGGGTTGG GCGTACTCTA TTGTTAACGA AAAGGAGTTG CCATATCTTC TTGACTTGGA ATTGTTTTTG GGAAAGAAAA TCCTCTTAAC AGCAATGCAA GAACAGAAGT GTCAACTCTT GAAAGACAAG CAGGGAGACA ATTACGTCGA GCCAAAAGTC CAGTATACCG ACCGTTTGGT ACTTGGTTCT TGTCCAAGAT TGTCTTTGGA AACTTTTGAA GAACTTTATG AAAACTTGTT ACGTAACCAC TATGAGCTCT CTGTCATTAA GGAAGTTGCT GCCAAGGGTG AGAAGTTGTA TTACCGTACC AGGAAGGCTG CTTCTACCGA ATCGGTCAAG CGTGCAAAGG AAATCATGGA CACAGGAACC TGGGACGATC AGCATCTACT TTTTGGTCCA AACTTGGAAA AGGAAAAGGA AAAGTTCTTG GCCAAGTTAG CCAACAGACA TGTCAAGGAG ACAGTCTTTG AATTTAACAA AAAGGGTAAC GATCGTGACG AAGACAGTCT TGTCAGTTTC ATGCATAGGA GAAGAAAGCA ACTTGCTCCC ATCCAGCGTC GTGCAAGTGA AAGAAGAGAC TTGTTGCAAC GTGAAAGAGA AGCTGGTTTG ACCCATGGTA TCGAAGATGA GATCTTGAAG GCTCATGGGG AGACTGGATA CTCTGCTTCC GGTATTAACG ATGTAGACGA AGAGGAATTA CAAAATGCTT TTGAGGACGC TGATCAGTTA TCGTCCAAGT CGAACAAGAA GAACTACCGT GACGATAGGT TCTTCATTAG TCACTACGCA CCAGCTTCGG TAATTCAAGA TCAGCAGTTA AATATCACCT CGTCGTTTGC TAACGAGGCT GCGTCTGCTA CATTCGACTT GGACAACGAT GACAAGATCG GAAATGGCAA ACAAGTCATG CAATGGGACA GAAAGAAGGG TAAGTACATC AACTCCCAGT CCACAGACAA GAAATTCATC ATTGGTGAGA GTGGTGCCAA AATTCCAGCC ACCTACAGAT CTGGAAAGTT CGACGAGTGG AAGAAGAAAC GTAACATGCA ACCTGCCAAG GTGGGATCAC TTGAAACTGA AGGTGAAAGC AAGCAGCGTT TCAAGCATAA ACGTAATGCT GCTCCAAAGT TGCCTGACAA ATACAGAGAT GACTACCACA AGCAAAAGAA GAAGGTTGAA AAGGCTGTGG AATCGGGCCG TGACGTTAAG GGCTACCATA AACCGGGTCA AAGACTGGAA ATCAAGTCTA CTGAAGATAT TAGAAAGGCT AGATTGTTGA AAGAAAAGAA AATGGCCAAG AATGCCCGTC CTAGTAGGAA GCGCAAATAG ATATAGTTAA CTTATAGAAT CTGTACATTA ATAATAGTCT ACTAGAAATT GTATTACGCT ATGGG
|
Protein sequence | MSDNEEDYDI ARSLTVNLED SDSDSGSDFS DEEQEVQDII SSEDEAEEQP KKKQKTAKPA KEAFPSLELS GDEDEQDDDK DMASYFAANN PQAKKAKAGS FQSFGLSKLV LTNIAKKGYR QPTPIQRKTI PLIMANRDVV GMARTGSGKT AAFTLPVIEK LKGHSARVGI RAIILSPSRE LALQTYKQVK EFSKGSDLRA IVLTGGDSLE DQFSSMVSNP DIVIATPGRF LHLQVEMQLD LKTVEYIVFD EADHLFEQGF AEQLNELLAV LPPQRQSLLF SATLPRSLVD FAKAGLSNPV LVRLDADSKI SDQLQMAFFT TKKNEREANL LYVLQEVIKM PLGTAEQIKK LRLMDKRVND EAEEEELENE SGSKRKYKFK KERLPSANVL PSPHSTIVFV PTKHHVEYIT TLLRDAGYLV SYIYGTLDQH ARKNQLYQFR LGMTTVLVVT DVAARGIDIP VLANVVNYTL PGSSKIFIHR VGRTARAGNK GWAYSIVNEK ELPYLLDLEL FLGKKILLTA MQEQKCQLLK DKQGDNYVEP KVQYTDRLVL GSCPRLSLET FEELYENLLR NHYELSVIKE VAAKGEKLYY RTRKAASTES VKRAKEIMDT GTWDDQHLLF GPNLEKEKEK FLAKLANRHV KETVFEFNKK GNDRDEDSLV SFMHRRRKQL APIQRRASER RDLLQREREA GLTHGIEDEI LKAHGETGYS ASGINDVDEE ELQNAFEDAD QLSSKSNKKN YRDDRFFISH YAPASVIQDQ QLNITSSFAN EAASATFDLD NDDKIGNGKQ VMQWDRKKGK YINSQSTDKK FIIGESGAKI PATYRSGKFD EWKKKRNMQP AKVGSLETEG ESKQRFKHKR NAAPKLPDKY RDDYHKQKKK VEKAVESGRD VKGYHKPGQR SEIKSTEDIR KARLLKEKKM AKNARPSRKR K
|
| |