Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32842 |
Symbol | |
ID | 4840353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 752161 |
End bp | 753945 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640391668 |
Product | hypothetical leucine rich repeat protein |
Protein accession | XP_001385504 |
Protein GI | 150866036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.548353 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTTT CCATTATAAA CAAAGTTAAG CTGTTTCCTG CTGAGATTCA ATGTCAAATA ATCGATTATT TAGCTTTTGA CGACTTGGAA AGCTTGCTCA GAATACAAAC ACCTTTTCAA GACCATGTAG TTGACTTGCT TTTTCAGTAT ATAGTGCTAA ATGATTGTGA ACCCAACGAC GAGTCTATCC AATGGAAATG GCATACAGTG GGTATGTTAG TTGATCTTTT CCAAAGATAT CCGAAGATGA CATTCTCTGA AATCTTGGGT GATCTCACCA GCATATATGA GTTACATTTG GCAAGACCTG GTATCCTTAA CAGATCAACT ACAGTTACTA TAAGGTGCGA TTGTATGGAA GGCGAAGATG CCGCAGACCT ATTGAAAGAG CTCTGCAGTC ACTCGTACTC TATCAATATA GAAGAATGTT CTATCGAGAG CAAATCTATG GTTGAGGCTA TTTCTCCCAC TTTGACAGGG TTAGGTATGT TCCAGGAAAG ATTCAAAAAG GGTTATGATC TCAGCTATGA GCTTACCGAG TTGGGAACTT GCTCTAATCT TAGACAATTG GAACTAACTG AAATGCCAAT TACCTCTTCT CAGATAAGGT TCCTACCAAG ATGCCTAGAG AAATTGACTT TTGAATTAGA AGTCGAGAGC TTTTCATCTG TCCATGTATT AGAACTTCCT AATTGTCTCA TATATCTTTC CATGAAGCTC TCCCAAACTA GTAGTTCCCT TAAAAATGTA GAAATCAACG TCGAGTGCCT TTCAAATTTG AGAATACTCG AGTTGAAATA CTTCCAAATA CATTCTTTGT CATCTTGGAT CCTCCCAAAG GGAGTTGTTC AATTGTCAGT AATATCATGT GGAATTAAAG ATCTCTCAGG ATTGCTGAAC TTGTGCAACT TGAACAAGTT GGTACTACAA CTAAATCCAA AAACTGAAAT CGACTTCACC GTATTGCCCT CTAGTATCAG AAGGCTATGG TTGTATTGTG AAAGTGTACC TAACGAAATA TGTCAGTTGT CATCTTACAT TCAGTTAAAT CTTGTAGTTA CAAGTCTAGG ATCTGACAGA GATTTCTCAA AGATCAAGAA CTTGAATAGC CTATATCTCA ATGGAACGTC GTGGGCCACT TCTACAAACT TCACGTCTCT CGGTGACTTA AAACTTCCGT CGCAATTAAA GGAAATATCA TTTAACATGT TCCTAGGTCT ACAAAACATT CGTGATTTGC AATCTGGAAA TTGTTTATCG TCGTTAGCCC TTAATGGAAA GTTTGGCCCA ATGCTAGTCA AGTCACTTCT AAGAGGCAAC GTTTTTCCAG AATCACTAGT GGATATACGA TTGTTTATTG AGTTGACGAG TTCAGATTGG GTGGTTGATT CTAGAACCTA CCCCAAAGAA TACATTTCGT TCGGAGCTGA AGGTTCACAA AATTTAAGAT TTGGGAATTC ATTGTGTTTA CCAAACAAGT TACAAAGGTT GGCTATAGTG TGTGATTCGT TAGTAGCAGA AGGTGATCTC AATCTACCTT CTAGTATCCA AGAAGTTCAC TTAAAGGTAT TTGCATACAT ACAGATGTCA AAACTAGTAA TTCCCGAAGC TGTAAGAAGG ATTCTGCTCC CAGAAGTTTT ACCTAAAGCA GATTTCCATC TGTATCATTT CCCTACATCT TTAACAGATA TTTATGTTCC TGACGAGAGT TATAAAGTAA CACTTAGGAA TACCAATAAC ATTAAATGGC TCAATGTATA TCCTCCAGTA GCTAAAGATA TATAG
|
Protein sequence | MSFSIINKVK SFPAEIQCQI IDYLAFDDLE SLLRIQTPFQ DHVVDLLFQY IVLNDCEPND ESIQWKWHTV GMLVDLFQRY PKMTFSEILG DLTSIYELHL ARPGILNRST TVTIRCDCME GEDAADLLKE LCSHSYSINI EECSIESKSM VEAISPTLTG LGMFQERFKK GYDLSYELTE LGTCSNLRQL ELTEMPITSS QIRFLPRCLE KLTFELEVES FSSVHVLELP NCLIYLSMKL SQTSSSLKNV EINVECLSNL RILELKYFQI HSLSSWILPK GVVQLSVISC GIKDLSGLSN LCNLNKLVLQ LNPKTEIDFT VLPSSIRRLW LYCESVPNEI CQLSSYIQLN LVVTSLGSDR DFSKIKNLNS LYLNGTSWAT STNFTSLGDL KLPSQLKEIS FNMFLGLQNI RDLQSGNCLS SLALNGKFGP MLVKSLLRGN VFPESLVDIR LFIELTSSDW VVDSRTYPKE YISFGAEGSQ NLRFGNSLCL PNKLQRLAIV CDSLVAEGDL NLPSSIQEVH LKVFAYIQMS KLVIPEAVRR ISLPEVLPKA DFHSYHFPTS LTDIYVPDES YKVTLRNTNN IKWLNVYPPV AKDI
|
| |