Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30446 |
Symbol | |
ID | 4837900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 76216 |
End bp | 77760 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640389215 |
Product | predicted protein |
Protein accession | XP_001383296 |
Protein GI | 150864470 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.474759 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCGTG AGGACGCGTC TGTTCCAAGA TCGCACGATT CGGCTGATGC CAGCCCCAAT TCCACAGCAA AAGAGATACC TGTGCCGCAA CCCCCTCTAT TGAACTTGAA CTTGAATTTG AACATGAATA GCGGCTTCAA TCTATCCAAC TGGTGGCACC AGATCACACT GATTGAAGCT GACAGTAGTG ACGACCGCAA CAACAATAAT AGTAACAATG GCATCAACGC TGCAGAGTCC GATTCAGCCC AGTCTCAACC TATTGTAGTC TTGGGCCATT CATACCAAAC GACCGAAGAA GCTCACGAGG ATATCATCAA GAAGTTGTGT CTCACATATC GATATGGCTT CGAGCGGATA CCCCGGGCTG TTAATGGTCC CAGTCCGTTG TCGTTTATGC AATCGGTGAT CTTCAGTAAG AGTCTTCTCT ATAATCTCCA GAACTTCAAC AACTTCATCG AAAAGGAAAA CTTCACTACA GATGTAGGAT GGGGGTGTAT GATACGCACT TCACAAAGTT TGCTCGCCAA TACCTTCGTG CGTTTGCTAG ACAAACAAAG CGACATTATC GCTCTCTTCA ACGATACCTA CTTAGCACCG TTTTCATTGC ACAACTTCAT TCGTGTCGCC TCGTCACTGC CATTGAAGGT CAAGCCTGGC GAATGGTTTG GTCCCAATGC TGCATCTCTC TCGATAAAAC GTCTCTGCGA TGGCTATTAT GATAATTCGA CGTCAGAGAC GATCTTACCA CGAATCAATG TGCTTATCAG CGAAAGCACT GATTTGTACG ATAGTCAGAT TGCCCAGTTG CTTGAGCCAA GCACCGAGAC CAAGGGCTTG TTGGTACTCT TGCCCGTCAG ATTGGGTATC GACAGCATTA ATTCTTATTA TTTTTCAAGT TTGCTCCATC TTCTTTCGCT TGAGCAATCT GTAGGAATCG CCGGAGGCAA GCCGTCCTCG AGTTTCTACT TCTTTGGCTA TCAGGACAAT AGTCTCATCT ACATGGATCC ACATTCAGCT CAGATATTCA GCAGTGACAT TGATATGAGC ACCTACTACG CCACACGATA CCAGAGGGTT GACATTGGCA AGTTGGATCC GTCTATGTTG ATTGGAGTGT TCATTCGTGA CTTGACACTG TACGAAAATT TCAAAAAGAG CTGCCTTGAT GCCGCGAACA AAATTGTCCA CTTCCATGCG ACGGAGCGTC TGACGGTACC TGAGTCCAGA CGAAAGAACT CCGAGTTCGT CAACATCAAC AGAAGCGATT TGAAGGACGA AGACTATATC AATATCGACA GAGTCAACCG CTTGGACAGC ACTGACGACT TCATTGACTT AGGCGATGAC TATGTAGAAA CCAACACGAA CTTGGAAGAA GCTACTCCGC TGGCGGAAGA TACAGTTCCT GTTTCTACAT TAAGTGCCAG CGAGCTGGAG ATAACAACAT CTTCATACGA AACACCTACT TCAAAGGATG ACAACAGTTC CAGAGCAAGC TTGGACGTGG TAGTGCTCGA CACGACAGGT GAACAACAGG AATAG
|
Protein sequence | MAREDASVPR SHDSADASPN STAKEIPVPQ PPLLNLNLNL NMNSGFNLSN WWHQITSIEA DSSDDRNNNN SNNGINAAES DSAQSQPIVV LGHSYQTTEE AHEDIIKKLC LTYRYGFERI PRAVNGPSPL SFMQSVIFSK SLLYNLQNFN NFIEKENFTT DVGWGCMIRT SQSLLANTFV RLLDKQSDII ALFNDTYLAP FSLHNFIRVA SSSPLKVKPG EWFGPNAASL SIKRLCDGYY DNSTSETILP RINVLISEST DLYDSQIAQL LEPSTETKGL LVLLPVRLGI DSINSYYFSS LLHLLSLEQS VGIAGGKPSS SFYFFGYQDN SLIYMDPHSA QIFSSDIDMS TYYATRYQRV DIGKLDPSML IGVFIRDLTS YENFKKSCLD AANKIVHFHA TERSTVPESR RKNSEFVNIN RSDLKDEDYI NIDRVNRLDS TDDFIDLGDD YVETNTNLEE ATPSAEDTVP VSTLSASESE ITTSSYETPT SKDDNSSRAS LDVVVLDTTG EQQE
|
| |