Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_63490 |
Symbol | |
ID | 4840426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 144753 |
End bp | 146414 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391741 |
Product | predicted protein |
Protein accession | XP_001386242 |
Protein GI | 150866592 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCAA TAGCTATAGC GGCGATATCA GATAGCTCGG CGCGTCTTCC GAAGTGGGTA AATAACATCT GTTTCTATTC ATGTGTTTTG GCTACATTAA TAATCATAAG CTGTATCTTT CTTCATTTAC GAAATTACAG AAAACCGTTT CAACAACGTT TAATGCTCAG AATCCAAATC ATCGTGCCTT TGTTTGCATT ATCGTGCTAC TCAATGTTGA TCAACCAGGA GTCGCCCTTC AACAAATTCA TCTTGGAACC TGTGCGAGAG GTCTACGAAG CTTTTGTCAT CTACACATTC TTCTCGTTGC TTACTGATAT GCTTGGAGGT GAGCGTAATA TCATAATTAT GACTAGTGGA CGTGAACCGG TAAAGCACCC GGGCATATTA CTGTATATTC TTCCCCCACT CGATATTTCT GATCCGTATA CCTTTCTCGG AATCAAAAGA GGCATTCTTC AGTATGTATG GGCTAAGCCT ATAATCTGTT TTTCTACATT ATTATCGCAA GGTTTGGGCT TGTACGATGT CAACTCGATG GGTCCCAAGT CGATATATCT CTGGTTAACC ATTATCTACA ACGGCAGTGT CACCATGTCA TTATACTGCT TGGCCATCTT CTGGAAGATC TTGTGGAACG ATTTGAAGCC GTTCAACCCT GTAGGCAAAT TCTTGTGTGT CAAGTTGATT ATTTTTGCCT CGTACTGGCA AGGGGTCATT TTGGCGATTT TGAATGTGTT CCAGGTGTTG CCTGGAAGTG ACGAGTCTGA AGAAAAAGGC AGTATAGGTG TCTGCATCCA GAATGGACTT CTCTGTGTAG AGCTCATTGG CTTTGCTTTG GGCCATTGGT TTGCTTTCAG TTATCACCCC TTCACAATAT CACAGATACC GTATGGTAGA TTGAAGTTCA AGTATGCCTT TAAGGATATG GTTGGTATCA AGGACTTGAT ACATGATTTC AAGCTTACTT ACTATGGAGA CTATTACAAG GATTACAAGC AGTTTGACTC TGTGGAAGCC TTGATAGCCC ATCCTAGCTC CAGAGGGCGT ATGAGTAGAA TTAATCAAGG ATTACGCTAC CATAGCGACG GAAAACAAAA GCACTGGTTG TCCAACCAAG TCAGCACGCT TCAGCAGAAC AATACACATA TAAGAAGCAC TTCAGAGATA GCTGCTCTTC CCAATAGCCC GCCACAGCTC AACACAACTA ATAGCATCAG AAGCAGCAAT GAATACTCTG CATCGTTAAA TTCTACTGGA ACATCAATGA GAGCAATCTA TCCTGGCTCG CCCAAAAATG GATCTCCACC TGAGTCACCT GTAGTGTCCG GTTCGGAGCA ATTGCAATTT ATTTCAGAAA TTCTCAGAAG TGACAACTTT TTATCGTCTA TTAATTACAG CAAAGAGCTC TTGGACGAAG ATGAACTTTA CTATCAAAAT GCGTGTTCCG AAGTGCCTAA CTATAAGTTG GATCAGCCAG AAATTAAAAG GCTTCTCAAT TACCCAATAG TAGACGAAAT GATAGGAGGA CATGCGTATG GTTATAAGGT CAGAAGATTG AGACAGGAAC GTAGCTACAG ACAACTGCTG AGGGATTCAG AAGATCAGAT CCTAAATAAG GTGAATACAA ATGAAAGCTA CCGGTATGGT AGTATTGTTT AA
|
Protein sequence | MTSIAIAAIS DSSARLPKWV NNICFYSCVL ATLIIISCIF LHLRNYRKPF QQRLMLRIQI IVPLFALSCY SMLINQESPF NKFILEPVRE VYEAFVIYTF FSLLTDMLGG ERNIIIMTSG REPVKHPGIL SYILPPLDIS DPYTFLGIKR GILQYVWAKP IICFSTLLSQ GLGLYDVNSM GPKSIYLWLT IIYNGSVTMS LYCLAIFWKI LWNDLKPFNP VGKFLCVKLI IFASYWQGVI LAILNVFQVL PGSDESEEKG SIGVCIQNGL LCVELIGFAL GHWFAFSYHP FTISQIPYGR LKFKYAFKDM VGIKDLIHDF KLTYYGDYYK DYKQFDSVEA LIAHPSSRGR MSRINQGLRY HSDGKQKHWL SNQVSTLQQN NTHIRSTSEI AALPNSPPQL NTTNSIRSSN EYSASLNSTG TSMRAIYPGS PKNGSPPESP VVSGSEQLQF ISEILRSDNF LSSINYSKEL LDEDELYYQN ACSEVPNYKL DQPEIKRLLN YPIVDEMIGG HAYGYKVRRL RQERSYRQSS RDSEDQILNK VNTNESYRYG SIV
|
| |