Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66618 |
Symbol | |
ID | 4851730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2675602 |
End bp | 2679783 |
Gene Length | 4182 bp |
Protein Length | 1103 aa |
Translation table | |
GC content | 42% |
IMG OID | 640393438 |
Product | predicted protein |
Protein accession | XP_001387079 |
Protein GI | 126275416 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.413972 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGTCCTCCTG AATCTAACTT GGGCCCGACC GATTGCTTAT TTGAGATTCA CAACCTCAAT TGTATGGTGT AAAATTGTTT GTGGAAGTTA GAAGTGTGCT TCTTCTTTTT TCGGCCTGTT TTCTCAGCCT CTTTATTTAG TCGTCATTTT ATATATTTCT TTAATTATTC ATTTACTTGT TCATTTGCTC ATTTGCTCAT TTGCTCATAT CATTTATCCA TTTCTTCACT TTTGCTACCC AGCAAGTGGT TAGTTACCGG ACCCCCACAT CCATATCCAC ATCAAGACGA CCGAGTCCTA CATTTCCAGA TTCATATCAC AGGCTGTTGT GCCGAAGCGG AGTCTATCAT AAGATCTAGG GGTTTTCTCC ACCTTGCTGT CTGCAGCGTA CAGAAGATCC ACCCCCGAAA AATTGACAGG CAGATTGTGA AGTCACTCCG ACCTGCCGCC TACCTAGATA TATCAGTCTC GGACATTACT GTCAACCTGT CAGCCGTGCT CCAGCGTCAA TTGCTCTGTC TTTCTTGCAT CTGAAATATA TATGTCAAAC CCCATCAGTA CTCCGATCCA GAGCCAGAAT TCAGCTGATT CGTCCACCAC ATCGTCCTCA CGTCGACCGG TGGCGAGAAG AGCTTGCCTT TCATGTCGTG AGAAGAAGAT CAAATGCGAT GGCGAGCCAA TCAGCACCAT CGTGTCTGTC GACGGCTCCA ATAAGATTAT CCCCCAGCTG ACGAGAATAT GTTCTAATTG TAAATTTCTC GGTATAGAAT GTGTGTTTGT CCAGTCCAAC AGAGGTGGAC GACGCCGAAA AAGAGCTTCT ATCTCCCATG ATATCGACGA CCACGCCGAC GAATCGTATA TAGAGGGCCC CGACCACAGT AAAAAATTCC AACCTGATTC GACCAACTCT AGCTCTACAA TTTCAAGTTC CGCACCCGAC TTTCCGTACC ATATTCGCAG TGGTAGCGAG TCGGTCAATT ATTCCGCACT GAGCCCTAGG CCCACCGGCC ACGAGCCGAC ACTGAAAAAT TCCACAGACA GACTCAGTTC GTTCATACTA GACCAGCAAC GAACCTCTCT GATGCTGTCA TCACAGTTCA GACCAGTGTC ACCAGCATTT TCTTCGGTAT CGACACTTGA GAGACCGCCC ATGTATTTTT CACGTACAGA GCTGAACTCA GACTCTGCCT CTATTTCTAC TGCGACTTCT GGTGCCAATG CTGCCTACCA TGAAGACTAC GCCAGACATT ATAACCCTTA CGGGCCTCCA CCACCTCATC AAGGCTATCC ACATCCACCA CCTCCACATG TCCCCCACGG ACCTCACAAC CAACAGGGTC CCCATGACCA TCAAGGGCCT CACGGTCACC ACGGTCCTCC CCATCACCGA CATGGTCATT TCCACCATAG AGGTCCTGGT CGCGGCGGCA GAAGAGATTG GTTTGATCAC TACGGGCCTC CACCTCCCCC ACACCAGGGT ATGGGCTATG GACCTCCTCC ATGGGCTAGA AGACCATGGG ATGCCTGGGG CCCTCCTCCA CCGGGCCCAC CACCTCCTCC ACCGCATTAT TCCGAAAGCC AACTCCCTAA TAGAGAAGTT TACGAGTCCG CTCCAGCCGC ACCAGTACAA GCACCTCCAG CACCTCCAGC ACCTCCAGCA CCTCCAGCAC CTTCAGCACC AGCAAGTATA TCGTCTATTT CTAATCAGTC TGCCATTGCT GCTACTACAG CTCAAGAAAC AGAGTCCTCC ACTACCCAAA CTAAGTTGTC TGAAACACCT GTATCTACTA CTCACTCGGT TCCAGTTATA TCAGCTAACC TTACCAAAAG TACTTCTGAG TCGTCGCTAA TCGAGGGATC CGGAAAACCT CGTCTTCCTC CTATCGATGG TATACCTGTC AAGCCACCCA AGTCGGCGAA CTCCGAGCAT GGTTTCGCCG AAACCGACAA AATGTCACAA ATAGTGTTGC CGGGTATTTT GGGTACACAA TCGATTTCCA TGGTCGGCGG TGTCTCGCTG ACTTCGAATT ATAGCCTTTC GAAATCTTCC ACGACGACGT TGGAGTCGGC AAAAATTGAG AATGACTCCA ATAGATCGAT TTCTTCTGAA CAAAAACCGA GATTGCTAAA TTTACCTAGT TCTATCTACG AACAGGATAA GTCGACGACA GTGTCCTCTG TACCTCCATT TTCAGATTAT GAATTGAAGC ATTACAATCT ACCTCCGTGG GATATTTTGG CCCAAATATT GGATCTATAC TACGCTTTCC AACATCCAAA TCATAGACTT TTAGTGCCGA AGAAATTACT ACTTTCCAAA TTATCTCTAG GATACTCGAG CTCAATTTTG CATGCGTTGA TCGCTTCAAC TTGTCCTTTG TTGCTGGTCT TCAATGTTGA TGTTCCTAAG TCTTGCGATG AGAATTATTG GATCGAAAAA GTTTTCCAAT ACTGGGACGA CCTCAATGAT TTCGGTATTA TCTTAACCTA CAGTTTACTT TCCAAAACAT CGTCATATCG TTTCCGTTTG TCAAAACTTA ATGAATTTAA TATCAAAATA TGGGAAGCTA TTTACAACAA CAATTACGTG GAAATTTACA ACAGCACTAA ATTCGAGTTG GAAAACAGGG ATAAAAGCAA AACGTATACC TCGAGACAAA TATTCGAACG AGAGCTGATC ATTAATTTAA TATGGAATTT TTATGTGAAC GACCTCATAC TATTGAGGTT TAATTTGGGA AATCCGTATT TCAAATTATC CACCAGGCTC CAGGATTTTA AATTTAATTA TGAATTAGAC ATCTACCTGA AGAAGTTGCT TTTGCCGTTG GAACTCAACC TGGGATCTTC AAGATCTAAT TGGACAGAGT TGAATGATTC TACATTGACT CCATCAGGTT CCAATTCTGT TATCATTTCA GCAAAGATTC TTGAAGGTGT CATGACAAAA ATATCTAACG AAGAACTTAC GAATGACGAC TTACTTGATA ACACCCCTTT CAATCGTTCA TTCATTAAAA ACATTTCTAA CAAATTTGTA ACCATCGATC ACACCAGAAA ATTGTTGATT ATTGATACTG GATATTTATT ATCCAATTTT ATTTTCAAGA CAGCCGAGAT ATTACAGAGA TTTTATTTGC TAGATGATAT CCTTATTTTT AAGTTGGCGA AACTGATAAA TTACACTGGT CGATCTGGAT CTGATTTTAT TCCTTTGATA TGCGAAGTAG ACGTCAGTGA CATTGCCAAA TTTGAAAATG AAGGAAATGG CCCTAGGATT CTCGAGTCAA TCACAGAAGA AAAGTGGACC ATGATATTCT CTTTGATTCA AACTACATTG AATTTCATCA GGATGGTAGA GCTTATTGAT CACGAAAAGA GCGACCAGGA ATTGGACTCA TATTTGGTGG CTGTCGGGCC CACAGAAGAA GGTGAATCGG GCTCTGGTCA AATTTGGTTC GAGAACTCAC ATTTGCAAAC TACTATAAAA GAAGCATGGC TCAAGTTCCC TGACTTCACT CTTGTTGCAG CATGTACTGT ATTATCTGTT ATTTGCAATT TGGTCATGCT TACCAAGTAC ATCAAAGTTC AAAAGCCTCC GGCAAATGGA GATAGTTCTG CCTCCATTAG AATCATATTT CTTGAATCGA GCTTGGAGAA AAATTTTAGC ATTCCAGTTG ACGACGAAGA AATAATAGAA TTTAATTACG AAAATTTGCT AAAGAGATTG TCGACATTGG TTGAGTTCAT TAAAACCAAA TCGCAAATCA CAAACGAATC TGTTGTGACT AGCACAATTG TTAAAATCAA CAAGATCAGC CATTATCTCG AGGGCATTGC TACGAATCAA TAAATTTAGA AGATAGGAGA ACTGACGTGA AAATGACACA ATGGACCCTG AAAACGGGGT TCAAGATTTC TTTTTAAGGT AATATCGCTA TCAAAAAGCA AGGGCATTAT TAAAAGGGAC CTTCATTTGG TCATGAGGTT CCGGTTACAT TTTCAAGACC AAGATCAAAT TTCATCAGCA TCATTGCTCA TTTTATGACA AATTCACTTC TTCAACCTTG GAGGTTTAAT TTCAATTGTA TTTTGCATTC ATACGAATTG TATAATTAGT ACTGTACTAC CAATTGCATA TAGACTCTTT TTTCGTTTTG TTATATTATT AATTTCGGTG TA
|
Protein sequence | MSNPISTPIQ SQNSADSSTT SSSRRPVARR ACLSCREKKI KCDGEPISTI VSVDGSNKII PQLTRICSNC KFLGIECVFV QSNRGGRRRK RASISHDIDD HADESYIEGP DHSKKFQPDS TNSSSTISSS APDFPYHIRS GSESVNYSAL SPRPTGHEPT LKNSTDRLSS FILDQQRTSL MLSSQFRPVS PAFSSVSTLE RPPMYFSRTE LNSDSASIST ATSGANAAYH EDYARHYNPY GPPPPHQGYP HPPPPHVPHG PHNQQGPHDH QGPHGHHGPP HHRHGHFHHR GPGRGGRRDW FDHYGPPPPP HQGMGYGPPP WARRPWDAWG PPPPGPPPPP PHYSESQLPN REVYESAPAA PVQAPPAPPA PPAPPAPSAP ASISSISNQS AIAATTAQET ESSTTQTKLS ETPVSTTHSV PVISANLTKS TSESSLIEGS GKPRLPPIDG IPVKPPKSAN SEHGFAETDK MSQIVLPGIL GTQSISMVGG VSLTSNYSLS KSSTTTLESA KIENDSNRSI SSEQKPRLLN LPSSIYEQDK STTVSSVPPF SDYELKHYNL PPWDILAQIL DLYYAFQHPN HRLLVPKKLL LSKLSLGYSS SILHALIAST CPLLLVFNVD VPKSCDENYW IEKVFQYWDD LNDFGIILTY SLLSKTSSYR FRLSKLNEFN IKIWEAIYNN NYVEIYNSTK FELENRDKSK TYTSRQIFER ELIINLIWNF YVNDLILLRF NLGNPYFKLS TRLQDFKFNY ELDIYLKKLL LPLELNLGSS RSNWTELNDS TLTPSGSNSV IISAKILEGV MTKISNEELT NDDLLDNTPF NRSFIKNISN KFVTIDHTRK LLIIDTGYLL SNFIFKTAEI LQRFYLLDDI LIFKLAKLIN YTGRSGSDFI PLICEVDVSD IAKFENEGNG PRILESITEE KWTMIFSLIQ TTLNFIRMVE LIDHEKSDQE LDSYLVAVGP TEEGESGSGQ IWFENSHLQT TIKEAWLKFP DFTLVAACTV LSVICNLVML TKYIKVQKPP ANGDSSASIR IIFLESSLEK NFSIPVDDEE IIEFNYENLL KRLSTLVEFI KTKSQITNES VVTSTIVKIN KISHYLEGIA TNQ
|
| |