Gene PICST_31452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31452 
Symbol 
ID4838976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp836762 
End bp838561 
Gene Length1800 bp 
Protein Length599 aa 
Translation table12 
GC content44% 
IMG OID640390291 
Productpredicted protein 
Protein accessionXP_001384462 
Protein GI150865306 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.277299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATT CATCGACTAA TGAAGAGTCG GTCTCGTCCA GCGGCGAATC CGTCTCCACT 
GATCTTATCA GCAAAGACGA AGACCTCAAG AATGCTGAAC AGAACCTGGC TACTGATTAC
AACGAGACTG TGGGCTCTTC CAAAAAGATC CACAACACAG CTATCGGCGA ACTATTGCAG
CCGTTGAAAG AGTCTGCCGA AGACGACAAC GATAAACAAG ACGCATCGGA GCCTCTTCCC
AGAGGCAGAC CATCTTCTTG TGTCTTTGTT GCCAGTTTAT GTTCTTCGCG ATCTGATGAT
GAATTGTGTG TCTCCGTCAC CAATCACTTT CTGAAGTGGG GCAAGTTGTC CACAGTCAAA
GTTCTACGTG ATACGTCTAA TCGTCCGTAT GCGTTTGTTC AGTATACAAA CGATCGTGAT
TCCAAATTGG CTATCAGGAT GGGCCACGAT TCTGAATTGG ATGGAAGAAA CATCCGGTGT
GAAGCAGCTA AAGTAAACAG AACTTTATTC ATAAGTTCAA ACGAATTAGT TCATGAGAAC
AAAATGAGTG AATTTCTCAG TGGGTTTGGT GAGATTGAGC AGATCCTCTC AAGCAACAGC
TCAGGACATC TTGTAAGATA CAAAGTGAGC AAGTCGTCTA AGTATTGGTA CTGTAAATTT
GTATATCGTG AAGACGCCAT TAGAGCATTT GCCAATATCA CGGAGTCTTG TGCCTATAAA
GTAGATTGGA CCCAGAATAT TGAGGAGGAT GTTCATTATG GACACAGAGA TGAGGTCTCC
CAGGAGACGC AAGTAACCTT TGATAAATTC TCCATATTTG TAGGCCAATT GCTGGCTGAA
GTCAGCGAAA AAGATCTTGA TAGTCGATTT TCACGTCACG GCACTATTGC TGATATCAAT
CTTATCCGCA AAGGATCCAA TCTGTTTGCC TTTGTCAGAT TTGAAAAGGA ATCCAGTGCA
GCAGGAGCCG TAGAAAGAGA AAACCATGCC ATGTTTAAAG GAAAGACTTT GCATGTACAA
TACAAGGAGA CTCATTCCTT AACAAAGTCT GTTAAAAGTA AGCCGGTTGG ATTGGCATTT
GCACCACCTC CTATTCATTT ATCACGTAAA GTTGAGAACT GTTACAGAAA AACTTCAGGT
AATAACTTTC CTCCGTTGAA ACCGAGATTT AATAACTATT CTTCGAATAA CGCCAATAGA
GGAAGCTATC AACTGTATAG AGGGAACTAC AAGCTGCATT ATTCTAATGT ACCTAATCGT
GTGAGAAGAT TTACATCTAT GAATACCACA CCTAGTGGTT CAGCCCATGG AAAAGGAGAA
ACTACCAGTG AGAGAAGTGC CTGGAGCCCT GAAGCCAAAG GAAGCTTGGA AGCAAATGGA
TCACCTCCTA CTACTCCAGT TCCAAAGCAG GCTGATATCC ATGCATGGAG AAATGTATAT
GGACATCAAG GAGGATCGAA CCCAGATAGA CCGGACCCAA GTAATCCCTA CCATAAATTG
GCAAGACCAG ATATGAGTGG GGGAGCTCCT CCCCCTAAGA ATGGAATTCC TTACTTTTAC
TATATTCCCA ATCCTGAAGT TTCCAACATG CATGGAAGCG GATTCATGGG AGCTCCAGTA
AGTTCAGGCA ATCCAGTGGG ACTAAGTCCC AACCCCAAGG GCTTCTATCA GCCATACTAT
GTACCATATG ATACGCATGA GTACGGGCCA GCGGCAGCAG CAGCAGCAGC AGCCGCGTAT
TCGATGCAAT ATCCAATGTA TTATCAGGCG AAAGAAAATA ACGCAAATGA TGAGAACTAA
 
Protein sequence
MSDSSTNEES VSSSGESVST DLISKDEDLK NAEQNSATDY NETVGSSKKI HNTAIGELLQ 
PLKESAEDDN DKQDASEPLP RGRPSSCVFV ASLCSSRSDD ELCVSVTNHF SKWGKLSTVK
VLRDTSNRPY AFVQYTNDRD SKLAIRMGHD SELDGRNIRC EAAKVNRTLF ISSNELVHEN
KMSEFLSGFG EIEQILSSNS SGHLVRYKVS KSSKYWYCKF VYREDAIRAF ANITESCAYK
VDWTQNIEED VHYGHRDEVS QETQVTFDKF SIFVGQLSAE VSEKDLDSRF SRHGTIADIN
LIRKGSNSFA FVRFEKESSA AGAVERENHA MFKGKTLHVQ YKETHSLTKS VKSKPVGLAF
APPPIHLSRK VENCYRKTSG NNFPPLKPRF NNYSSNNANR GSYQSYRGNY KSHYSNVPNR
VRRFTSMNTT PSGSAHGKGE TTSERSAWSP EAKGSLEANG SPPTTPVPKQ ADIHAWRNVY
GHQGGSNPDR PDPSNPYHKL ARPDMSGGAP PPKNGIPYFY YIPNPEVSNM HGSGFMGAPV
SSGNPVGLSP NPKGFYQPYY VPYDTHEYGP AAAAAAAAAY SMQYPMYYQA KENNANDEN