Gene PICST_38031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_38031 
Symbol 
ID4851362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1604985 
End bp1606136 
Gene Length1152 bp 
Protein Length383 aa 
Translation table 
GC content43% 
IMG OID640393070 
Productpredicted protein 
Protein accessionXP_001387539 
Protein GI126274403 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1676] tRNA splicing endonuclease 
TIGRFAM ID[TIGR00324] tRNA intron endonuclease 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAGC GTAATAACAA GAAACTTCTC AACCAGATAT ATTCTCGCCC GCTTCCGATC 
GAGTTGTCTT CTGACAAGTA CGGGGTGGCG ATGCCGACGT TGTTTCCACA TAATCCTGTT
TCGTGGCTCT TTTATTTAAC TAGATTGATT CAAGTAAATA CTTTGTATTC TGTCCCCCAG
TCTCTGGAGA CACTAATAGA CGTTAGCTAC GACTCTGATG GCATATTTAA GGTGCTGGAT
GAAGAGTCAA TGGCAAAATT GTGGAGATGC GGCTTTTTTG GGAAAGGTAC TCTCTCTAGA
TCAGAACCAA CGTGGAAGGC ACGTACCATC AAGAGATTGA ACTTGGACTC GAACACAGCC
AATGCTTTAT CGATGGAAGA AGTAACCAAC AAGAGACGAG ACGAGAGGAA GAAGTTCAAG
GCTGAAAGGT CTAGGCTCCA GGAACTCGAG CTTAAGCAAC GTAAGGGCGA AATTTCGGAT
CTAGAATCCT CACAGTTAGA ACAACTCAGA GAAACCCTAG CATCGCTCAG ATTGATCGAC
TTCAAATTGT CAAAAGATTC TTTTGATAGA GAAACAGACT TGAGGTTCGA GGATTTGGAT
CTAATAGAGT CCAACCAGCT TGGACGGAAC CTTGAATTCT TACAGTTGCA AGCCATAGAA
ACGTTCTTCT TGAAGTTTGC TGTCAACGTT ATCCGTGTAA ACGACTTTTC CACAAAGCAA
TTGTTTCTAG AATGTTGTCG TCAATCAGGA ATACTTAAGC CTACAAACAA GTTTGTTTTA
GATTATGTTG TATATCACCA TTATCGTTCG CTTGGATGGT GTGTGAGATC TGGAGTCAAA
TTCGGCTGTG ATATGTTGCT CTACAAAAGA GGTCCACCAT TTATCCATGC CGAGTATTGT
ATTTTGGTTA TTTCCAATGA TGATAAGGCA AGATACGACT GGTTTGAAAT GGCTGCGAAA
GCCAGAGTCA TCGGGACCGT GAAGAAAACC TTTGTGCTCG TGTACGTCGA TTCCCCGACA
GAAGAAAGGT TCAACTCGAT ATTGAGCAGC GCATATTCTG ACGAAGGTAT ACTCTTCCAG
GACTTGTTCA AGTTGTATAA AGTCACTGAA ATTCTTTACC GGAGATGGGC TCCTAGCAAG
ACTAGGGACT GA
 
Protein sequence
MSKRNNKKLL NQIYSRPLPI ELSSDKYGVA MPTLFPHNPV SWLFYLTRLI QVNTLYSVPQ 
SLETLIDVSY DSDGIFKVLD EESMAKLWRC GFFGKGTLSR SEPTWKARTI KRLNLDSNTA
NALSMEEVTN KRRDERKKFK AERSRLQELE LKQRKGEISD LESSQLEQLR ETLASLRLID
FKLSKDSFDR ETDLRFEDLD LIESNQLGRN LEFLQLQAIE TFFLKFAVNV IRVNDFSTKQ
LFLECCRQSG ILKPTNKFVL DYVVYHHYRS LGWCVRSGVK FGCDMLLYKR GPPFIHAEYC
ILVISNDDKA RYDWFEMAAK ARVIGTVKKT FVLVYVDSPT EERFNSILSS AYSDEGILFQ
DLFKLYKVTE ILYRRWAPSK TRD