Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53682 |
Symbol | |
ID | 4851676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2508970 |
End bp | 2511114 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | |
GC content | 41% |
IMG OID | 640393384 |
Product | predicted protein |
Protein accession | XP_001387054 |
Protein GI | 126275232 |
COG category | [R] General function prediction only |
COG ID | [COG5191] Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTCAT CAGCTTCCAC ACTAGCTCCT GTGCAAGTCA CAAGTGAACA GATCTTGCTG GATGCATTTC AGAGCCGAGA TAGGCCTCTA GAGAGACCCA AACAGTCAAT CCAAGATCTT GAAGAACTCC GATCATTCCA GCAGAAGAAG AGAAAAGAAT ACGAACAACA GCTCAACAAA AACAGGTTAA ACTTTGGACA GTTTCTACGG TACGCAAAAT GGGAAGTCAA CCATAACCAC GATTTTCCCC GGGCTCGTTC TATTCTTGAG CGGGCTCTAG ATGTCAATGT TCAACATGTG CCGTTTTGGG TTCAGTATAT ACAACTCGAA CTTTCGCACA AGAATGTCAA CCATGCCTTG AATCTTCTTG ATAGAGCTAC CACTACCTTG CCACGAGTCA ACAAGTTGTG GTTTCTTTAC GTTCAGACGT TAGAGACGTT AAAAAACTAC CAGTTAGTGA GGAATGTATT TGAGAGATGG TTGAAATGGC ATCCTGATTC ATCAGCCTGG GAAGCATATG TAAATTTTGA AAGGCGGTAT GATGAATACG ATAATGTTAG AACAATTTTC TCTCGTTATG TTCAAGAATA TCCTTCTGCA CAAGTCTGGC TCAAATGGAT AGAATTCGAA ATGTTTGCTG GAAGCCTTTC CCAGAACACT GAAACTTCAG TTCAGAATAT AAGAGTAGTT TATGAACAGG CAGTGGATAC AATTATCAGC GATAAAAGGA TCAGAGATGA CCCTGAGTTG CCAGCTATTA TAGCCAACTG GGCAGATTGG GAGATTTCGG TTAAGGAGTA CGAACGAGCC AGAGCCATCT TCGTCACTTT GTTAAATGAA AACAGCAAGA TCAAACTTCT GAAACAACAA AGACTGCAAA TCTCTTCCTC GTTTACTACT TTTGAAAAGA GACATGGCAA CAAGGATTCG ATCGAATCAT CTGTGCTACA GAAGCGCAAA TTAAGACATC AAGAGAATAT AGAGAAAAAT CCCCAAGATA TAGATTCGTG GTGGTCATAT ATTCAAATAG TACAACTGGA GAATAATATC GAAGAAACAA GAAACGCCTT CAAAGGGGCA ACTTTTAATG TGCCTTCATC CAAGACCAAG TCCATTCAAT GGAGACGTCA TATCATGCTA TGGATCAAGT ATGCGTTATG GGAAGAGTTT GACAATGAAG ATATCACGTT GGCCAGAGCT GTTTGGAACG AGTGTCTAAA GGTGATACCA CACAAGAACT TCACATTTGC CAAAGCATGG ATCCATTTTG CAGAATTTGA ATTGAGAAAT AATGAAAGTG AAGAAAGCCT ACAGACGGCT CGAAAGATTC TTGGTAGAGG GATTGGTCAG TCTTCTGTTT CCGGACCCAA AAGGAAACTC TTCGCCTACT ATATTAGTTT GGAAAAAAGG TTAGGCGAAT GGGATAGAGT CAGAATGCTC TACGAGAGGT GGTTAGAAGT AGCGACACTG ACCGGTACTA GTTCCATTCC TGTTGTTCTT GAATACGTTG AATTCGAAAA GTCATTAAAT GAGATTGACA GAAGTATATC TATCTTCCAG ATTGCTTTAG AACTCTCTGA AGATGCAAAA ATTTCGTCCA GTTTTGAACC TGTGGAGACA ATTTGGATTT CGTTTATCAA TTTTTATAAA GAGGAGTTGA AGTACGACGA CGCAAGATCG TTATATCGCT TGTTGTTAGA GAAGATGGAC AGCACCAAGG TGTGGATATC ACTTGCACTT TTTGAGTCTT CCATTCCTTC ATCTCGTCAA TTGATAGAGT ATGAAGAGAG CAACGCCGAT GAATTCGAGT TTTCCGTAGA AGATGAGCAC AGAGAAAATA CTAGAGCTGT TTTCAAGGAA GCTGAACTAC ACTTCAAGAA TATTGATTCT AAAAACGAGA GACTTGTCAT TCTTGAATCG TGGAAAAGCT ATGAAAAATT GCATGGAAAT AACGAGAGCC TAGCAGATAT CACCAAGAAG TTGCCTACCA TTGTGAAGCG AAGAAGAACC GTTGATGGTG TAGACGAAGA ATACTTGGAC TATATATTTC CTGAAGACGA GTCGAGCGCC GCTAAAGTTC CCGGAATCAG CAAGTTCTTG GCGAATGCAA AGAAATGGGC TGCTCTACTG TCTCAGAAAA TGTAG
|
Protein sequence | MDSSASTLAP VQVTSEQILL DAFQSRDRPL ERPKQSIQDL EELRSFQQKK RKEYEQQLNK NRLNFGQFLR YAKWEVNHNH DFPRARSILE RALDVNVQHV PFWVQYIQLE LSHKNVNHAL NLLDRATTTL PRVNKLWFLY VQTLETLKNY QLVRNVFERW LKWHPDSSAW EAYVNFERRY DEYDNVRTIF SRYVQEYPSA QVWLKWIEFE MFAGSLSQNT ETSVQNIRVV YEQAVDTIIS DKRIRDDPEL PAIIANWADW EISVKEYERA RAIFVTLLNE NSKIKLLKQQ RLQISSSFTT FEKRHGNKDS IESSVLQKRK LRHQENIEKN PQDIDSWWSY IQIVQLENNI EETRNAFKGA TFNVPSSKTK SIQWRRHIML WIKYALWEEF DNEDITLARA VWNECLKVIP HKNFTFAKAW IHFAEFELRN NESEESLQTA RKILGRGIGQ SSVSGPKRKL FAYYISLEKR LGEWDRVRML YERWLEVATL TGTSSIPVVL EYVEFEKSLN EIDRSISIFQ IALELSEDAK ISSSFEPVET IWISFINFYK EELKYDDARS LYRLLLEKMD STKVWISLAL FESSIPSSRQ LIEYEESNAD EFEFSVEDEH RENTRAVFKE AELHFKNIDS KNERLVILES WKSYEKLHGN NESLADITKK LPTIVKRRRT VDGVDEEYLD YIFPEDESSA AKVPGISKFL ANAKKWAALL SQKM
|
| |