Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_70551 |
Symbol | |
ID | 4836963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 347485 |
End bp | 350530 |
Gene Length | 3046 bp |
Protein Length | 974 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640388278 |
Product | predicted protein |
Protein accession | XP_001382298 |
Protein GI | 150863731 |
COG category | [K] Transcription |
COG ID | [COG5164] Transcription elongation factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.770055 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCATCAGCCC GAACTTCACG ATGTCCAGTG AGGAAAATGT CAAACGTGAG CAGGATAATG TCCTCAAACA CGAACACAAT GAAGAGGATG TTTACAATGT GGGCGAACGA GATCAGGACG AGATCGATGA GGAAGTGGAA GAAGATGAGG AACAGAAGCT AGATATCCAG AAAGAGACAG GCCTCAAACG GACGAGATCC GAAGCAGAAA TTGACGTTGG AGACAACGAC GAAAACATCA ATGAAAAGAA TAACAATGAA AACGAAGAAG GAGAAGATGA CGATGAAGAA GATGACGAAG ATGAAGACGA AGATGAAGAT GAGGATGATG ATGGTGTGTC CACGGGAAGA AGAAGAAAGA GAAGAAGAGC TGGAAACCAG TTCATTGATA TCGAAGCTGA AGTTGACGAC GAAGAAGAAG ATGAGCTTGA CGAAGATGAC GAAGAAGCCG AACTTTTGCG AGAGCAGTTC ATTGCTGACG ACAGACATGC TGGCGAGAAC GAAGCTGAGA GCCACGATGA CAGATTGCAC AGACAGTATG ACCGTAGACG GCAAGAGGCC GAGGACCAGG ATGCTGAGGT ATTGGCCGAG ACCTTGAAGC AACGTTATAG AAAGACCCAT ACCGTCTACC GTGGGGACAC TGCTGCCAGT GGTACTGTAT CGCAGAAACT CTTGATGCCC TCTATCAACG ACCCATCCAT CTACGCTATT AGATGTACCC CTGGCCGTGA GAAAGACTTG GTGCGTAAGC TTTACGAAAA GAAGAGAACT CTTGACAGGC TGAATGCTCC GTTAGATATT CTCACAGTGT TTCAGAGAGA TGCCTTCAAG GGATACATCT ATATTGAAGC CAAGAGACCC GATGCCATCG ACAAGGCGCT TGTCGGAATG GTGAACATCT ATGTAAGAGA CAAACTCTTA GTTCCTGTCA AGGAATACCC GGACTTGCTC AAACAGGTCA AATCGTCGGA TGTCGAGTTG GTGCCTGGTA TCTACGTCAG AATCACCAGA GGTAAGTACA AGAACGACTT GGCAATTGTA GACAACTTGT CAGAAAACGG GCTTGATGTG CGTTGTAAGT TGGTTCCTCG TTTAGACTAC GGTAAGTTCG ACGAGTTCGA TAAGGATGGC CGTAGAATTA GACTGAAGGC AAGACCCTTG CCTCGTTTGT TCAGTGAACA AGAGGCTAGA CAGTACGATA GAGAGTTCTT ACAACCTGGA AGAGGTCCTC GTTCGTACGT CTACAGAGGC GACGAATACA TTGAGGGCTT CTTGTACAAG GACTTCAAGT TGCAGTTTAT CCAGACCAAA GATGTACATC CAACGTTGGA GGAATTGGAC CGCTTCCAGA CAGGCAACAA CGAGGAAGAA GGATTTGATC TTGCCGCAAT CGCTGCCTCG TTGAAGAACA AGAAAGGTGA AGGCAAGTCC ACTGCCTTCC AGCCCGGTGA CAAGGTGGAG ATCCGAAGAG GCGAGCAAGC TAAAACTGTG GGTGTAGTGG TGGAAGCCTC TTTGAACGAA ATCACTATCT CTGTTACTGA CAGTGGAGAC CCCAAGTTCG TCAACCAGAA GTTGACCGTC CCTGCCAGCG ATTTGAGAAA GATATTCAAT GAAGGTGACC ACGTAAGAAT CGTGGAAGGT AAGCATTTCG ACGAGACTGG GTTGGTTATC AAGATAGACG GCGACTCGGT TGTTCTTGTC AGTGACCAGA CACGTGAAGA TGTCAGAGTG TTTGCCAACT ACTTGGTGAA AGCTACTGAT GCCTCGCTGA ACATGGACAC CATTAACAGT AAGTACGATA TCAAGGACTT GGTGGAGTTG AATGCCGCCT CTGTCGGTGT CATTGTCAAG GCCGAAAAGA ACATCTTTGA AGTGTTGACC TCTGACGGAA GGTTATTGTC GGTGAAACCC AGCGGAATAT CGTCGAAGTT GAAGATGAGT CGTCGTGAAC AGATCGCTAC TGACAGAAAT GGGTTCACGA TCAAGATCGG TGACACCGTC AAAGAGGTCT TGGGTGACAA GAAGCGAGAA GGTGCCATTT TACATATCTA CAAGAACTCG TTGTTCATCA AGTCTAACGA GATTGTCGAG AATTTGGGTA TCTTTGTCAC TAACCGTATG AATGTTAGCA CTATTTCCAC CAAGGACTCC ATGGTATCCA AGAATTTGGG CCCTGACTTG ACTTCCATGA ACCCCAACTT GAAGCTTCCT AATCCTTCCG CTGGTGGTTT CAAACCCCGT GCTGGTGGCC GTGAAAAGTT GTTGTACAAG GATGTTGCTG TGACAAGTGG ATCCTACAAA GGTTTGAAGG GTAAGGTGAT TGAAACCGAC GATGTCTACG CCAGAATCGA ATTGCACACG AAGAGTAAAA AGATCAAGGT AAACAAGAAT AACTTGAACG TATTGATCCG CGGAGAAGCA ATACCATACT TGAGATTCAT TGGTGCAGCA CCTTCGGAGT CTCGCGAGTT CAACAAGCCC AATGCGCCAG CATTTACTTC TGGCGAGAGA TCGTCGTGGA ACGGAAATGC AACACCTGGC GTGGGTGCAA ATTCCGCCTG GGGAGGAGCT TCTTCCAGTT GGGGAGGAGC TTCTTCTGCC TGGAACGGTG GAAAGACCCC AGCTTACAGT GGAGGAAACT CTACCTGGGG AGGAGCTGCT TCTACTTGGA ACGGCGGAAA AACTCCAAAT GCTGGAGGAA CTTCTGAATG GGGTGCCAAT GGAAGCAATT CTACCTGGGG CTCTTCCAGA GGTGGGGGTT CTACTTGGGG CTCTTCCAAC AGAGGAGGAA ATTCTACGTG GGGATCCTCC GGAGGTAACA CTAGTAACCC CAGCAATAAA AACAACAATA ACAATAGCAG TGGCAGTAAC TCTGCTTGGG GTGGCAACAA CTCTACGTGG GGTGGCCAAA ACAAAGGGAA TAGCAGCACT TGGGGAAGAC AATAAACTGC TGAAACATAC TATTTGTCTA CCAGCTAGCT TCCATTCTTT ACAGCTTTAT GTACTTTAGC CAGCTTATCT TGTATACTAA TAAAAAAGAA GATATG
|
Protein sequence | MSSEENVKRE QDNVLKHEHN EEDVYNVGER DQDEIDEEVE EDEEQKLDIQ KETGLKRTRS EAEIDVGDND ENINEKNNNE NEEGEDDDEE DDEDEDEDED EDDDGVSTGR RRKRRRAGNQ FIDIEAEVDD EEEDELDEDD EEAELLREQF IADDRHAGEN EAESHDDRLH RQYDRRRQEA EDQDAEVLAE TLKQRYRKTH TVYRGDTAAS GTVSQKLLMP SINDPSIYAI RCTPGREKDL VRKLYEKKRT LDRSNAPLDI LTVFQRDAFK GYIYIEAKRP DAIDKALVGM VNIYVRDKLL VPVKEYPDLL KQVKSSDVEL VPGIYVRITR GKYKNDLAIV DNLSENGLDV RCKLVPRLDY GKFDEFDKDG RRIRSKARPL PRLFSEQEAR QYDREFLQPG RGPRSYVYRG DEYIEGFLYK DFKLQFIQTK DVHPTLEELD RFQTGNNEEE GFDLAAIAAS LKNKKGEGKS TAFQPGDKVE IRRGEQAKTV GVVVEASLNE ITISVTDSGD PKFVNQKLTV PASDLRKIFN EGDHVRIVEG KHFDETGLVI KIDGDSVVLV SDQTREDVRV FANYLVKATD ASSNMDTINS KYDIKDLVEL NAASVGVIVK AEKNIFEVLT SDGRLLSVKP SGISSKLKMS RREQIATDRN GFTIKIGDTV KEVLGDKKRE GAILHIYKNS LFIKSNEIVE NLGIFVTNRM NVSTISTKDS MVSKNLGPDL TSMNPNLKLP NPSAGGFKPR AGGREKLLYK DVAVTSGSYK GLKGKVIETD DVYARIELHT KSKKIKVNKN NLNVLIRGEA IPYLRFIGAA PSESREFNKP NAPAFTSGER SSWNGNATPG VGANSAWGGA SSSWGGASSA WNGGKTPAYS GGNSTWGGAA STWNGGKTPN AGGTSEWGAN GSNSTWGSSR GGGSTWGSSN RGGNSTWGSS GGNTSNPSNK NNNNNSSGSN SAWGGNNSTW GGQNKGNSST WGRQ
|
| |