Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_88034 |
Symbol | |
ID | 4837336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2403543 |
End bp | 2405583 |
Gene Length | 2041 bp |
Protein Length | 530 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388651 |
Product | predicted protein |
Protein accession | XP_001382686 |
Protein GI | 150864015 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGCTTACAA CGTATCCCCG ATCGACCCAT CCATATCACA CTTTCTTCGG ACGTCCAACA TTGCGCAGAA CGGCAATTGA CTGTCATCTA TCTGAATAAG CGAGAACTTC TATCACAGTA GTCTCAGTTC ATCCCAGTTC TTTCAAAGCT TTTCTCAATC TCAGTGTCAT TTCTCAGTAC TCATTTGCAA TTATTTTCCC TAATTTTTGA CGGGCTTTTC ATCGACATCT CAATCCTATC AACTTTTCCT TGACATTCGT ACACATCTCA CTTCTCACGT TCATCACTTT CCGTTAGCTT TCGTCATCCC AATCCATCAG AATGTCCAAT CCCCCCTTCC AAAGACAACA ACCCTCGTCG CCCTTAACTA CGCCCAACCA CCACCCTTCT AACCATCAGC ACCATGCATC ACGTAAACCC TCCATCGTGG AATTGTTGAG CTCGCCACCT CCCTTGCCCA ACAACACCAT CGATGACGAA ATCCACCAGT TCAGCTTGTC CCGCAACACG TCCATAAGCT CACGGACTTC GTCTTTCAGT CAACAGCAGC ACGGTGCTCC ACATCACGGC TCTAATTCCC ACCATTTGTC GGTATCAGGT ATGGACTGGT CGGAGATCCC TTTGAGCGAA TTGACCGAGT CCAACAAATT GATCTATATC AATTCGTCTT ACTCGGTGCA AAAAGCATTC GAAACGTTGG TATCAAACAA CTTGACCTCG GTGCCTGTCT CAATTTCGTC CTCTAATGAA AACGATTTAA GTAGCTGTTT GACGTTTGAC TACTCAGACT TGAACACATA TCTCTTATTG ATCATGAACA AGATCAATCT CAGTGAGCTT TCTGTCAGTG AGATAGGCAA TGAACACGAC TCTACAGCCA AGAAGCATGA GATCATCACC CAGACAATCA ACAAAGCCAA AAGAGGAGAA GAAGTACCCA TAGAATTCAT CATCAAACTT CATCCAAAAA ACCCCTTCAT AAAGTTTACG GAAAACGACA CATTGTTCAA GGTGATGGAG ACATTGGGTA ACGGAGTTCA CCGTGTAGCC ATCACCAATC TTGAGTCTAC CAAGATTACA GGAATATTGT CGCAAAGAAG ATTGATAAAA TACATGTGGG AGAACGCCCG GAGATTCCCC TCGCTTGACT TTTACTTGAA CTCTACATTG CAAGACTTGA AGATCGGCTC CAGTACCCCC ATATTCATCT ACGAAGACCA GTTATTGATC GAAGCCTTAT ACAAGATGTT CAACGAAAGA GTAAGCTCCT TGGCTGTCAT TGACAGAACA AAGTCACTTA TCGGCAATAT ATCTATTGTT GACGTCAAGA ACGTTTCAAG CTCCAAGAAC TCTCACTTAT TGTTCAAGTC AGTGTTGACT TTCATCAGCT ACAACTTGAG CCAGAAAGGC ATTGAAGAGG GCCAAGACCA GTATCCTATC TTCCATGTCA ACAAGCAGAG TTCGCTAGGC AGAGTCATTG CCAAGTTGGT GGCTACGCAA TCACATCGTT TGTGGATTGT GGAGTCCAAC ACAAGAACTC ACCAAAACTC CATCTCATCG CCTGTCACTA TTGAAGCAAC TTTGAATGTT AGTGCCAACC CTTCGTCAGC ATCTTCTTCT AATGCAAACA CTCCTGAAGG TAACTTCGGT TTACCAGGAA AATTAATTGG TGTCGTCACC TTGACCGATA TCTTGGGATT GTTTGCTACA TCTAAAGGTA CCAAGACCGA TCCACAATTC GCGAGAAACC AAAGAAGAAG ATCGTCTACT TCAACTACGC GCTCATCTAT AGACAGTGCT ATCAGTGTAG GTGACGGAAG CGCCAGAACA ACCAATGCCA ATGCTGACCT GGAGATCTTC CGCAAATCGT ACACTGCCGC TGCAAAGAAT GAAAGTGCCA TTTCCAAGGA CTAGAGAAAG AACTGATAGC ATGGCAGAAT CAGCTCAATC CATGTATCAG TAGTTATTCT CTATCCATAT TGTTCTACAG TTTATTTATA TTTATTATTT ACCAATTCTT ATAAATGCAT AAACCATACA G
|
Protein sequence | MSNPPFQRQQ PSSPLTTPNH HPSNHQHHAS RKPSIVELLS SPPPLPNNTI DDEIHQFSLS RNTSISSRTS SFSQQQHGAP HHGSNSHHLS VSGMDWSEIP LSELTESNKL IYINSSYSVQ KAFETLVSNN LTSVPVSISS SNENDLSSCL TFDYSDLNTY LLLIMNKINL SELSVSEIGN EHDSTAKKHE IITQTINKAK RGEEVPIEFI IKLHPKNPFI KFTENDTLFK VMETLGNGVH RVAITNLEST KITGILSQRR LIKYMWENAR RFPSLDFYLN STLQDLKIGS STPIFIYEDQ LLIEALYKMF NERVSSLAVI DRTKSLIGNI SIVDVKNVSS SKNSHLLFKS VLTFISYNLS QKGIEEGQDQ YPIFHVNKQS SLGRVIAKLV ATQSHRLWIV ESNTRTHQNS ISSPVTIEAT LNVSANPSSA SSSNANTPEG NFGLPGKLIG VVTLTDILGL FATSKGTKTD PQFARNQRRR SSTSTTRSSI DSAISVGDGS ARTTNANADS EIFRKSYTAA AKNESAISKD
|
| |