Gene PICST_46601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_46601 
Symbol 
ID4839501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1330392 
End bp1332434 
Gene Length2043 bp 
Protein Length680 aa 
Translation table12 
GC content42% 
IMG OID640390816 
Productpredicted protein 
Protein accessionXP_001385259 
Protein GI150865870 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.628004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0180876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGA AGAGAGGTAT TGATTCAGAG CAGGTGCTTG CTTCTTCCAA GAAGTTCAAG 
GTGGAAACAC CAGAAGTTGC CAAGAAGTCT ATTCAGGAGC ATGAAGTAGG CATAACTTCT
TATATCAACA AGACCGAGAC CGGATTTACT GGTCTCATCA AGCTGCTCTA TTCAGATTTC
CAGGTCAACG AAATCGATGT CCTGGGTAAT GTCGTCCATT TGGTCGATGA AGGCATTGAT
GTAGGAAAGA GCAAGAAGGA AAGAAAGATG GAAAAGAGGG CCGAAGACAG AAAGGAGTTA
CAGGGCAAGA CTGAAGAAGA AATAGAAGCT ATCAAAGCCC AGAAAAAAGA AGAAGAAGAA
AACAAGTCCA AGTACGATTT ATCAGATGAA CATCGTCAAG AGCTTTTGAC TTACATCACC
AATGACGAAT TGGCCCAGAT AGAAGAGTTG TTCTCTACAG GAAACAACAT GGAAACAAAG
ACTTCCTTTG ATGATAAGCA ACAGCGTGGA AAATTGCACC AATTGCTTAG AGCAGCCTTC
CAGGGCAAGT TGGAGTCCAT TACTTCGCCA GAAAATACAT TCAAGATCGC CATAGCTAAG
AAAACTTCTA GAGGAAGACA GCATCCTCAA GAGAGCATGC ACCATGTTGA TGAGAACGGA
ATTTTGAACT ATGGATTGGG AGCTTTCAAG CCTTACTTGC ATTTCACTGT TTTCAAGGAA
AATAGAGAAA CTATGGAGGT AGCCTCCACC ATTTCCAAGT TCTTGAGAAT ACCTCACAAA
GCAATTAACT ATGCTGGTAC CAAAGACAGA AGAGGAGCCA CTTGTCAGAG ATTCAGTATT
CATAAAGGAA AGGTTGTTAG AGTCAACTCG TTGAACAAGG CCTTGAAGAA TACCGTTTTA
GGTGGATTCA CCTATGAAGA CTCTCCCTTG GGATTGGGTG ATTTGAAGGG TAACCAATTC
ACAATCACTA TCAGAGATGT TGAACCATTG GGCGGAGAGA ACCTTGCTGA AATTGTCGAC
AATAGTTTCA CATCGCTCAA GGAAAAGGGA TTTATTAACT ACTTTGGAAT GCAAAGGTTT
GGTTCTTTTT CGATCTCTAC ACATATGCTT GGTATTCATC TCTTGAAAGA AGAATGGAAA
GATGCCGCCG AGCTTATCTT GTCAGAACAG GATATCGTCG CACCAGACTC GATTGAAGCC
AGAAGAATCT GGGCGGAAAC CAAGAACCCA TCATTGACAC TTAAGAAGTT ACCTCACTAC
TTTTCTGCAG AAACAGCCAT CTTGAAGGTG TTAGACACTG AACAATTAGA CGAAAACGAA
GAGTATGGAA AGAATTCCTA TTTCAAGAGT ATCATGGCTA TTCCTAAGAA CTTGCGAATG
ATGTACTCGC ATGCCTACCA ATCGTACGTG TGGAACTTGG TTGCATCCAA GAGAATTGAA
TTGTTTGGTC TTGAGCTACA AGAAGGAGAT TTAGTCATTG ACGACGAAAT CAAGGTCAAA
TCTGACCAAG ATGACGATTT TGAAGAAGAT GTCAAAGTCA ACAGAGATGT CAAGGTTAAA
TCTATTACTA AGGAAGACGT GGAATCTGGA AAATATAGTA TCTACGACGT TGTTTTGCCA
ACTCCAGGAT TCAAGGTCCA GTATCCCACT AATGAGAAAT TGGCGCAAGT CTATGTTCAA
ACAATGGCAC AAGACGGCTT GGACCCATAT AAGATGGGCA GAAGAATCAA AGAGTTCTCG
TTAACTGGAT CTTACCGTAA CCTTATGACC AGGCCAGAGA ATTTGACCTA CAAGATTGTC
AAATACAGCG ACAACTCCGT TCCTTTGGTA AGAACAGATT TGGAGATCTT GAGACTCAAG
AAAGAAGGCA AGGATGTCGA AAGAATCATC AAGGTCGAAG GTGATGAAGC TACCAAGACT
GCTGTGGTTC TCACTATGCA ATTGGGAGTA AGCTCTTATG CCACTATGGC TATGAGAGAA
TTCATGAAGG CTGATACTTC CAGATGGAGC GAAAACATGA TGAAAAAAGA GGAGACGAAA
TAG
 
Protein sequence
MSQKRGIDSE QVLASSKKFK VETPEVAKKS IQEHEVGITS YINKTETGFT GLIKSLYSDF 
QVNEIDVSGN VVHLVDEGID VGKSKKERKM EKRAEDRKEL QGKTEEEIEA IKAQKKEEEE
NKSKYDLSDE HRQELLTYIT NDELAQIEEL FSTGNNMETK TSFDDKQQRG KLHQLLRAAF
QGKLESITSP ENTFKIAIAK KTSRGRQHPQ ESMHHVDENG ILNYGLGAFK PYLHFTVFKE
NRETMEVAST ISKFLRIPHK AINYAGTKDR RGATCQRFSI HKGKVVRVNS LNKALKNTVL
GGFTYEDSPL GLGDLKGNQF TITIRDVEPL GGENLAEIVD NSFTSLKEKG FINYFGMQRF
GSFSISTHML GIHLLKEEWK DAAELILSEQ DIVAPDSIEA RRIWAETKNP SLTLKKLPHY
FSAETAILKV LDTEQLDENE EYGKNSYFKS IMAIPKNLRM MYSHAYQSYV WNLVASKRIE
LFGLELQEGD LVIDDEIKVK SDQDDDFEED VKVNRDVKVK SITKEDVESG KYSIYDVVLP
TPGFKVQYPT NEKLAQVYVQ TMAQDGLDPY KMGRRIKEFS LTGSYRNLMT RPENLTYKIV
KYSDNSVPLV RTDLEILRLK KEGKDVERII KVEGDEATKT AVVLTMQLGV SSYATMAMRE
FMKADTSRWS ENMMKKEETK