Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_38031 |
Symbol | |
ID | 4851362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1604985 |
End bp | 1606136 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393070 |
Product | predicted protein |
Protein accession | XP_001387539 |
Protein GI | 126274403 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1676] tRNA splicing endonuclease |
TIGRFAM ID | [TIGR00324] tRNA intron endonuclease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAAGC GTAATAACAA GAAACTTCTC AACCAGATAT ATTCTCGCCC GCTTCCGATC GAGTTGTCTT CTGACAAGTA CGGGGTGGCG ATGCCGACGT TGTTTCCACA TAATCCTGTT TCGTGGCTCT TTTATTTAAC TAGATTGATT CAAGTAAATA CTTTGTATTC TGTCCCCCAG TCTCTGGAGA CACTAATAGA CGTTAGCTAC GACTCTGATG GCATATTTAA GGTGCTGGAT GAAGAGTCAA TGGCAAAATT GTGGAGATGC GGCTTTTTTG GGAAAGGTAC TCTCTCTAGA TCAGAACCAA CGTGGAAGGC ACGTACCATC AAGAGATTGA ACTTGGACTC GAACACAGCC AATGCTTTAT CGATGGAAGA AGTAACCAAC AAGAGACGAG ACGAGAGGAA GAAGTTCAAG GCTGAAAGGT CTAGGCTCCA GGAACTCGAG CTTAAGCAAC GTAAGGGCGA AATTTCGGAT CTAGAATCCT CACAGTTAGA ACAACTCAGA GAAACCCTAG CATCGCTCAG ATTGATCGAC TTCAAATTGT CAAAAGATTC TTTTGATAGA GAAACAGACT TGAGGTTCGA GGATTTGGAT CTAATAGAGT CCAACCAGCT TGGACGGAAC CTTGAATTCT TACAGTTGCA AGCCATAGAA ACGTTCTTCT TGAAGTTTGC TGTCAACGTT ATCCGTGTAA ACGACTTTTC CACAAAGCAA TTGTTTCTAG AATGTTGTCG TCAATCAGGA ATACTTAAGC CTACAAACAA GTTTGTTTTA GATTATGTTG TATATCACCA TTATCGTTCG CTTGGATGGT GTGTGAGATC TGGAGTCAAA TTCGGCTGTG ATATGTTGCT CTACAAAAGA GGTCCACCAT TTATCCATGC CGAGTATTGT ATTTTGGTTA TTTCCAATGA TGATAAGGCA AGATACGACT GGTTTGAAAT GGCTGCGAAA GCCAGAGTCA TCGGGACCGT GAAGAAAACC TTTGTGCTCG TGTACGTCGA TTCCCCGACA GAAGAAAGGT TCAACTCGAT ATTGAGCAGC GCATATTCTG ACGAAGGTAT ACTCTTCCAG GACTTGTTCA AGTTGTATAA AGTCACTGAA ATTCTTTACC GGAGATGGGC TCCTAGCAAG ACTAGGGACT GA
|
Protein sequence | MSKRNNKKLL NQIYSRPLPI ELSSDKYGVA MPTLFPHNPV SWLFYLTRLI QVNTLYSVPQ SLETLIDVSY DSDGIFKVLD EESMAKLWRC GFFGKGTLSR SEPTWKARTI KRLNLDSNTA NALSMEEVTN KRRDERKKFK AERSRLQELE LKQRKGEISD LESSQLEQLR ETLASLRLID FKLSKDSFDR ETDLRFEDLD LIESNQLGRN LEFLQLQAIE TFFLKFAVNV IRVNDFSTKQ LFLECCRQSG ILKPTNKFVL DYVVYHHYRS LGWCVRSGVK FGCDMLLYKR GPPFIHAEYC ILVISNDDKA RYDWFEMAAK ARVIGTVKKT FVLVYVDSPT EERFNSILSS AYSDEGILFQ DLFKLYKVTE ILYRRWAPSK TRD
|
| |