Gene PICST_39616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39616 
Symbol 
ID4851697 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2576435 
End bp2577683 
Gene Length1249 bp 
Protein Length269 aa 
Translation table 
GC content45% 
IMG OID640393405 
Productpredicted protein 
Protein accessionXP_001387063 
Protein GI126275298 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.312591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA ACGGTACAGA GACAATTGCG CCCAAGGCTG TCATCGAGGA CAGAATCTAC 
GTGGGCAATG TTGACTTCAA GGCCTCCGAG GAAGAGTTGA AGCTGTTCTT TGAAGGTTTA
AATGTGTAAG TATAGTATGG AGTTTATTAG CGAAGATTGA ATCATGTGAT GTGCCACTAA
GAGCAGAGAA AAGAGTTAGA AAGAGTAGAG ACTCGGAGAA GTTAGAGAAT GCGAAGCGTT
CAATTGCGTG TCTGTCACGG CAATGGAATG AGAGGAAATG TTGGCTGCTT CGATCACGGT
GTATATGGTT AAGTTCTGTG AGGCTACATT CTCTGCACTT TTGCAATGGT ACCATTTATT
GGAAATCGTA CACAAAAATG TCCACCCTTT TGGCACCACA ATGGTATTTG ATGGGGCACG
ACCTGTGGAG GATGCAGTAG ATATCGATTA GCGATTTGCA ATCAAAATGA TTTTGCAGTC
GTTTAATCGA TTCTAGCGCT TCACAGTTGG GGCAATTGGT TTAATCTTCA CTAGGGTTCA
TTAGGTTTTT TTTACTAACA TTTTTAGCAC CGAAGTTGAT ATTCCTTTTA AGGAAAACGT
CCGTGGTGAC AAGGTATTCA AGAGACACTT GGGGTTTGCC TTTGTCCAAT TTGAAACCAA
GGAAGATGCA GACAAGGCTC TTGCCGACTA CAATGGCCAG AAGTTCCAGA GAAGAAACAT
CTTCATCAAG AAGGCTGTTC CTCCTCCTAC TGAAGAGGAA AAGAAGGTCA GAGTAGAGGC
CTATAGAGCC AAAAGGGAGG CCCTCTTAGC TGACAAGGTC AAGAAGAAGG CCGAGGCTGC
TGCTGCCAAA AAGGAAGCCG ATGGTGCAGC ACCAAAGTCC GGAGCCGTGG ACTCGTCCAG
CGACGACAAG ACTCCCGAAG GCAAGGCTTC GAAGGATACC ATTTTTGTGA CCAATCTCAA
CTATGACGTC ACTGTCAAGG ACTTGAACGT TTTGTTCAAG GACTTGAAGC CCAAATGGAT
CCATGTTCCT GCTAGAAAGG TCCCCTTGCA CGTGTTGAAG AAGAACCACG GTCGTAGAAA
GAATAAGGGT ATTGCTTTCG TTAAGTTTTC CAGCGAAGAA ACACAGAAGC AGGCTGTTTC
TGAGTTTAAC GGAAAGGAAA TCAACGGAAG AGAAATCATT GTAGATATTG CTATTGACGC
CAGAACTCCA AAGGAAGAAG ATGTAAAGCA GGCCCAAGAA GCCGAAGCT
 
Protein sequence
MSENGTETIA PKAVIEDRIY VGNVDFKASE EELKLFFEGL NVTEVDIPFK ENVRGDKVFK 
RHLGFAFVQF ETKEDADKAL ADYNGQKFQR RNIFIKKAVP PPTEEEKKVR VEAYRAKREA
LLADKVKKKA EAAAAKKEAD GAAPKSGAVD SSSDDKTPEG KASKDTIFVT NLNYDVTVKD
LNVLFKDLKP KWIHVPARKV PLHVLKKNHG RRKNKGIAFV KFSSEETQKQ AVSEFNGKEI
NGREIIVDIA IDARTPKEED VKQAQEAEA