Gene PICST_40963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40963 
Symbol 
ID4837211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp699499 
End bp700569 
Gene Length1071 bp 
Protein Length356 aa 
Translation table12 
GC content43% 
IMG OID640388526 
Productpredicted protein 
Protein accessionXP_001382899 
Protein GI126132748 
COG category[R] General function prediction only 
COG ID[COG1094] Predicted RNA-binding protein (contains KH domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTCGA CTCATAACCG TGACAAGCCC TGGGATACTG CTGACATAGA TAAGTGGGCA 
CTTGAAGAAT TCAAGCCCGA GCACAATGCC TCAGGACAGC ATTTCACTGA GGAGTCAAGT
TTTATGACTC TTTTCCCTAA GTACAGAGAG CAATATTTAC GTAGTATATG GGCAGATGTC
ACAAAGTCTC TTGAGAAGCA TTTTATCAAG TGTGAGCTAG ACTTGGTGGA GGGTGCTATG
ACTGTAAAGA CCACTACCAA GACGTTTGAT CCGGCTATAA TTTTAAAAGC CAGAGACTTG
ATCAAATTGT TAGCACGTTC TGTGCCTTTT CCACAAGCTG TTAAGATTTT GCAAGATGAC
ATTGCCTGTG ATGTAATCAA GATCGGTAAC TTTGTAGCTA ACAAGGATCG TTTTATAAAA
AGAAGACAGA GATTGGTGGG ACCTAATGGG AACACCTTGA AAGCATTGGA ATTGCTTACG
AAGTGCTATA TTTTGGTCCA GGGAAATACT GTGAGTGCCA TGGGTCCATT CAAGGGTTTG
AAGGAAGTCA GAAGAGTAGT TGAGGATTGT ATGAGAAATG TGCATCCTAT CTATTACATC
AAAGAGCTTA TGATTAAGCA AGAGTTGAGC AAGAAGCCTG AGTTGGCCGA GGAAGACTGG
TCGAGATTCT TGCCTTCTTT CAAGAAGAGA AATGTTGCCC GTAAGAAGGC CAAGTCGTCC
AAGAGAGAGA AGAAGGTGTA CACTCCATTC CCACCAGCAC AAACTCCACG TAAGGTTGAT
TTGCAAGTGG AAAGTGGTGA GTACTTCTTG GGCAAGAAGG AAAAGGCAAT GAAGAAGTTG
AAGGAAAAGA GGGAAAAGCA AGAAGAAGCA TCTGTGGCTA GAAAGCAAGA GAGAGAGAAG
GATTACGTAG CCCCAGAAGA AGAAAAGTAC GAGAACAAGC TTGCCAAGAA GGAGAAGAAG
GAGAAGAAGG AAAAGAAGGA AAAGAAGGAA AAGAAGGAAA AGAAGGAAAA GAAGAAGAGA
TCCGCCAGCG AGGAAGAAGA AAGGGGTTCT AAGAAGTCCA AACATGCATA A
 
Protein sequence
MVSTHNRDKP WDTADIDKWA LEEFKPEHNA SGQHFTEESS FMTLFPKYRE QYLRSIWADV 
TKSLEKHFIK CELDLVEGAM TVKTTTKTFD PAIILKARDL IKLLARSVPF PQAVKILQDD
IACDVIKIGN FVANKDRFIK RRQRLVGPNG NTLKALELLT KCYILVQGNT VSAMGPFKGL
KEVRRVVEDC MRNVHPIYYI KELMIKQELS KKPELAEEDW SRFLPSFKKR NVARKKAKSS
KREKKVYTPF PPAQTPRKVD LQVESGEYFL GKKEKAMKKL KEKREKQEEA SVARKQEREK
DYVAPEEEKY ENKLAKKEKK EKKEKKEKKE KKEKKEKKKR SASEEEERGS KKSKHA