Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_63672 |
Symbol | |
ID | 4840575 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 125454 |
End bp | 127703 |
Gene Length | 2250 bp |
Protein Length | 712 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391890 |
Product | predicted protein |
Protein accession | XP_001386041 |
Protein GI | 150866437 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription |
COG ID | [COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAAAGAGTGG AACCTGTAGA TCCATTAGCG ATATCTGAAT CGCTAGGAGT TCAGACTTTT CGTAGAGAAA CCAGACGGCC GTTTACCAAA GAAGAGGATG ACCGTTTGAC CGAGCTTGTC AACCGTTATT ATGGTGATAA GGTCCATGAC TTAAACTTAG ATCTGGTGGA CTGGGAGTTT CTATCCAAGG AGTTGGAACC TAACGGTTCT AGGAAACCCA AGATGTGTCG TAAGAGATGG GCCAATTCGC TTGATCCCAA CTTAAAGAAA GGCAAATGGT CGCCTGAAGA AGATGAATTG CTTATACGAA CCTACCAGAA GTATGGCGCA ACCTGGCTAC GTGTAGCTTC AGAAATTCCT GGCAGAACAG ACGACCAGTG TGCCAAAAGA TATACCGAAG TGTTAGATCC AAGCACCAAG GACCGGTTGA AGTCTTGGAC ACAAGAGGAA GACTTGAAGT TGATCAGTTT GGTCAAGATT CATGGCACCA AATGGAGAAC TATATGCACA AAGATAGCTG GCCGCCCAGC ATTAACTTGC AGAAACAGAT GGAGAAAGCT TCTAACCGAC GTAGTACGAG GAAAGTCTAG CGACTTTATA AAACGACAGG TTCAGTCCAT AACAGATGGC CTTAAGCTCG AAGCCACTGC CTCCATTTCT AATGAAGAGG ATCAATTTAC AAAGCCAGCA GCTTCTGGTT CTATGGAAAA TTCAACTCTT TCACAGACTT CCAAGTCTAT ACAAGCTGGT TCGTCTTCCA TACTGAGCAC GATCGCTTCA GTCTTGTCTG CCAACTCTGT AGGCACAGTA TCGGCTTCTC CCTCTGTGTC TGCCTCCACG TCTGTTTCAG GTATTGCTGA ACAAGCAGGT GACCAAGTTG CTCTAGCTTC CAGATCGAAA CCAGAACAGC GTACATCGTC GGAATCTACA AGGGAGGTTG AATGGAGATA TACCATAATG GGAGGCGAAG ATGCCGATTT GCCTCACAAG CGATTGTTCA ACAGCGAGAA CGGTGGTGCC ATCAAGAATC AAGAAATGGT GCATTATCTA ATTTCGTATG CCAAAACTCA TGGATTGAAT ATAACTGTCC ACCAACATAT CCACCACCAC TACTCGCCTC CACAGCATGC GGCAGTAGAT GTGAATGGTT ACCAGCAGAC CTACCAGTCT TTATTATCGC CTGTTAATGC TATCAATCAG ACTGATGGCT CCCATAGAGA CACCATAAGT AATGCGTCTG GATCTCCAGC TGACGGCCGT TCTTCATCCG CATACTTACT TGAGCCTGAA ACTCAATTAA ATAGACATCA ACACTTCAAC TACTTGCCAC CCACGACTGA AGTTCCAAAG TTGAACTCTT CACAGAACTC CCCTCATGAC AATCCCTCGA CTCATCATCA CCACCACCAT CATCATCATC ATCATCACAA TTCATTATCG AAGAGGAATC GAATTTTGAA TGAAAGCGAA TCTGTTGCAA AGGAGTCTGA CTTGCTCAAG ATATTGAATC AGTCGCAAAT AGACATGACC GATACTGCTA ATGAAAGAGA AAAAACTCCT GGTGGACATC CCTTGACTCC TTTGACGCAA GCAGTGGAAA TGGCTGCGGC TGCTGAGGCC AATTCTAAAA AGAGAAAGAA CACAGACGAA ACGAGAAATA GCAAAAAACT TCACTACGAA GTACCAGACG AAGAGGGTTT GGATTTTTGG GAAACTATGA GAAACTTGAC CGATCTACCC AACCACCAGG TAATCCAGCA ATCGATGACA AGAAAGGATA AGTCTGAATA CATGGTTTCA GAGCCGCAAA GACAACATCG ACTGCATAGG CACCAAAATC AGCATCAAAG CCAACACCAG AGCCAGAACC ATCAGAGTCC ACATCAGCAG CAACAGCTAC ATATTCATCA GCAGTCGATT TATGGAGATG ATCACCAGAA GCAAGTGTCG CACCATCAGC AATACTTCTC CAACAACAGT AATGTTCAAA CTGGTGAACA AACTATCCCC GAGGGAAAGG ATAGTCATGA TATTCATAAG AACGGAGAGG CCGAAGCGAA ACATGAAGAG ATCGATGAAG ATGTGGATCC TGAAGTGTTG AACGCTTACG GGTTGTTCTA CAATGTATAC ACAAGAGAAG GTTCCGTGTT GCCGGAAGGA CAGCCACAGA ACCAGCAGCC TCCAGCTCCA GGGGCTGTGT ATGATGCCTG GGGTGGAGGT TTTGGAATCA TTCCTTTCAA TCCTTCATAG
|
Protein sequence | KRVEPVDPLA ISESLGVQTF RRETRRPFTK EEDDRLTELV NRYYGDKVHD LNLDSVDWEF LSKELEPNGS RKPKMCRKRW ANSLDPNLKK GKWSPEEDEL LIRTYQKYGA TWLRVASEIP GRTDDQCAKR YTEVLDPSTK DRLKSWTQEE DLKLISLVKI HGTKWRTICT KIAGRPALTC RNRWRKLLTD VVRGKSSDFI KRQVQSITDG LKLEATASIS NEEDQFTKPA ASGSMENSTL SQTSKSIQAG SSSISSTIAS VLSANSVGTV SASPSVSAST SVSGIAEQAG DQVALASRSK PEQRTSSEST REVEWRYTIM GGEDADLPHK RLFNSENGGA IKNQEMVHYL ISYAKTHGLN ITVHQHIHHH YSPPQHAAVD VNGYQQTYQS LLSPVNAINQ TDGSHRDTIS NASGSPADGR SSSAYLLEPE TQLNRHQHFN YLPPTTEVPK LNSSQNSPHD NPSTHHHHHH HHHHHHNSLS KRNRILNESE SVAKESDLLK ILNQSQIDMT DTANEREKTP GGHPLTPLTQ AVEMAAAAEA NSKKRKNTDE TRNSKKLHYE VPDEEGLDFW ETMRNLTDLP NHQSQNHQSP HQQQQLHIHQ QSIYGDDHQK QVSHHQQYFS NNSNVQTGEQ TIPEGKDSHD IHKNGEAEAK HEEIDEDVDP EVLNAYGLFY NVYTREGSVL PEGQPQNQQP PAPGAVYDAW GGGFGIIPFN PS
|
| |