Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29648 |
Symbol | |
ID | 4836659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 771797 |
End bp | 774121 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640387974 |
Product | predicted protein |
Protein accession | XP_001382386 |
Protein GI | 150863789 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.26394 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCATAG ACTACATCAA CGCCAACTTC ACAGGTCTCG ACGACTTGGC GTCTGTCTCT TCGAAACTAG ACCATCTCAA CAAGCTCCGT CATTCCATCA AGAATGCTGT AGACGTGAAA GTTGGCGATA CTGAATCTGC ATCAACCTCT TCGTTCTCTG TAGACTCTGA AAAATTGAAC CTGTCCATTG ACAAGATCAT TTCCGTGTTG GACTATTCAC TGGAAACCAC TGATTTGGCC GATGCGCTTG CCAACATCGA CGCCTTGATT CTCGAGTTTG GTTCTCTTGA CTTTCTCACG AAATTGCGGA ACCAATTGGC AGAAAAGATA TCCGTTCAGA AGTCGATCAA GTTGCTTCAT GAAGCTAATG ATGTCCACCA ACAGCTTACT TCGGCATTAT CGATAACTGA ACTCGCAGAC ATTGCCAAAA AGGTCAAGCT ATCAATCCCC GTGGACGATC CTGTTTCAGA TCAGTTGTTG GAGATCTTGA ATACTAAAGT AGAAATGTTG GTAAGTGAAA AGAGAAACTC TATACAACCT AAATTTTCAA AGCTACTCCA CGATTCTAAC TGGTTACAGT CGAATTCGGA TATAAGCACT ATTCCCTCAG CTACTCTCAA CGCTATCAAT AGATACGTGA ATGATTTGGT CGATTTGCAA TCGATCTTAG ACCCTCCACT GTATCCTTCG ACATGGTGGG CCCTTGATAT CTTGTTAGAG CCAATAATAA CCAGGTTCAA CTACCATTTC AACACTCCGC ACATGGAGAC TAATAAGATA TCCAAGCCCG AATGGGCTCT TGACTTCATC GAGAAGTTCC TCTCGGATAA CTTAGCTCTT TTGAACTTGG TTGTAGAACA TATCTTCAAG TTGTGCCACA GAATAGCGAC ATTTGAAATC ATCACGACGT TGTTGAACCC ATTGCGAGAA AAGATCTTGT CGTCTATCAA GGTTTTGAAT GCCAACATCG ACAAATATCT GGAACAGCCT GTGAACCTCG AAAAGAGCGG AAGACTTTTG TCCCATCTCA TTTTTGAGTT ATCATCCTTC GATCAAAGAC TTCGTAATAT TTACAAATAC AATCCGTACA TAGAGAATCT TGCCCAGGCT CCTTCCAAGA AGTGGACTGG TCTTACCGGA GATGTTATGT TACGCGGGAA CGACGAAAGT TTGGCTGTGA CGAACTGGTT GAACTTTGAG AAACAGTTGG CCAACAAAAG ATTCAATAGT GAAATTCTCA ACTCTGACGA TGCGTTCAGG ATTGATTTCG ATTACCAAGG CTCTTTCGAT GAAGACGACA GTGAAGTGGT AAAGAATGTT ATCAAACCAA CCTATTCAGC ATATGCCATC GTAAAGTTGT TCGACAACCT CAGTAGTCAT TTCCAGACGT TGAGTATTGT TAAATACCAA TTGAAGTATG TATCGTCGAT TCAGCTTGAT TTGCTAGACA AATATTTTGA GAGATTAGAC AAGATGCGTA AAGAGTTTAA CAACAGCTTT AACCAGAAAG CTATGTTGAA CCTTATCCCT GGTGGTCTCA ACGAAAAGAA TGTAAGAGAG GTGAATACCG ATACTGTCAA GCTTGGAATG GCCAATCTTC AGACAATCTC GGAACTCTTC TGTCTGGCCA AGTTCATCAG CAATGCCTTG GAACAATGGA GTGAAGAGTT GATCTTCATT CAACTATGGG AAGCTTTCAA GAGCGTCTCT AAAGATGCTG GACTCAGCAT ATTTGATGGA ACTATCAGAC AGTACGATTC ACTTGTAGAG AAGCTGTTGG TATTGTACGA GGAGTTCTTC CGAAAAGAAA TCAGAACTTC TTTGAAAAAC TATGTCAATT CCAGCCAATG GAATATTTCT TCCAATGAAG CACCCGAAGT TCCCCAGGAA TTGATCGTTT TGGGTAATAA CCTACTCACA TACTTGGACT ATGTAAAACG TACCATGTCA AAGTTGGACT ACTTCCTTGT TTCAGACCGG GTGGTGTCGT TGATCTCCAT TATCTTGACA GAATACATTG TTACCAACAA CCAGTTCAGC AAGGATGGAG CTGCACAATT GAAATTTGAT TTTGAGTATA TCGTCAGTGA GTTGAGAGAT AGTTTATATC TTGAAGGGGA TAAAACTGAG CTTTCTAATA GTCTGAACCA TGATTTCTTG CGTTTGAGCC AATCGGTGGA ATTCCTCTCA AAACTTGATG CTGCTACTGC CAAACAGTAC CAGAGAAAAC AGGAACAGTT CCAAGAATTG AGAGACCAGT TTGCGGACGG ATTGGAAGCA TTGTCCAATC ATAACATAGG TGACTTGCTC TTGAGAATCG TATAA
|
Protein sequence | MGIDYINANF TGLDDLASVS SKLDHLNKLR HSIKNAVDVK VGDTESASTS SFSVDSEKLN SSIDKIISVL DYSSETTDLA DALANIDALI LEFGSLDFLT KLRNQLAEKI SVQKSIKLLH EANDVHQQLT SALSITELAD IAKKVKLSIP VDDPVSDQLL EILNTKVEML VSEKRNSIQP KFSKLLHDSN WLQSNSDIST IPSATLNAIN RYVNDLVDLQ SILDPPSYPS TWWALDILLE PIITRFNYHF NTPHMETNKI SKPEWALDFI EKFLSDNLAL LNLVVEHIFK LCHRIATFEI ITTLLNPLRE KILSSIKVLN ANIDKYSEQP VNLEKSGRLL SHLIFELSSF DQRLRNIYKY NPYIENLAQA PSKKWTGLTG DVMLRGNDES LAVTNWLNFE KQLANKRFNS EILNSDDAFR IDFDYQGSFD EDDSEVVKNV IKPTYSAYAI VKLFDNLSSH FQTLSIVKYQ LKYVSSIQLD LLDKYFERLD KMRKEFNNSF NQKAMLNLIP GGLNEKNVRE VNTDTVKLGM ANLQTISELF CSAKFISNAL EQWSEELIFI QLWEAFKSVS KDAGLSIFDG TIRQYDSLVE KSLVLYEEFF RKEIRTSLKN YVNSSQWNIS SNEAPEVPQE LIVLGNNLLT YLDYVKRTMS KLDYFLVSDR VVSLISIILT EYIVTNNQFS KDGAAQLKFD FEYIVSELRD SLYLEGDKTE LSNSSNHDFL RLSQSVEFLS KLDAATAKQY QRKQEQFQEL RDQFADGLEA LSNHNIGDLL LRIV
|
| |