Gene PICST_39888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39888 
Symbol 
ID4851790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2835894 
End bp2837018 
Gene Length1125 bp 
Protein Length374 aa 
Translation table 
GC content42% 
IMG OID640393498 
Productpredicted protein 
Protein accessionXP_001386880 
Protein GI126275609 
COG category[R] General function prediction only 
COG ID[COG0218] Predicted GTPase 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.564657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGTAA GGCCGTCTTC GAGGTATCTA CAGCCTTGTA ATCATGTTCA AAGACGATAT 
GCTTCCATCA ACTATTTGGT GGCTGCTTTG GCGCAAAATA AAGCCAAATT TCCTGAAATT
TCACCAGTGA AAAATTCATC CAAATCCAAA CCTATAGCTA AAGAGTTACC TCCCAAGAAC
CTTCTGAGAC TTGTGATAAC TCCCTCGGCT TTGACGGAGC ATTTTCGTCA GGCGGAGTAT
ACCAATTTTT CAGTCCAGCA AATCAGCGAA GCCCAGTCAT TCTTCAACCG TGCCAAAGTC
AGCTTGGAAT GGACATTAGC GGACTACGAA GAAATCCCAG ATATCAAATA TGAGCGATTG
CTTGAAAAGA GAATAACCAG TTTAAATGAA ATTGACCCCT ATTACAAGAC GAAGTACCAC
GAATCAATGC TTAATTCAAA GAAAACGTTC GGAATTCAGC CTGAGTTGTT GCGGCCACTT
CCTGAAGTTC TTCTTTTAGG ACACACCAAT GTAGGAAAAT CGTCGCTTCT CAACAATTTG
ATTGTCAACT CCGATGCTTC TGAATATGCC TATGTTTCCC AGAGAGCAGG ATATACTAAA
ACTATCAATT GCTACAATAT TGGAAGAAAA TTGAGGGTGA TAGATAGTCC TGGGTATGGC
CAATTTGGTG AAGCTAAACA GGGCAAGGTC GTTCTAGACT ATATCAGTAA ACGCCATTTG
CTTAGAAGGG TATTCATTTT GATTGATAGT GTAGAAGGGT TCAGGGTGGA AGATATGCAG
ATGATGGACC ATCTCATTTC TGAGGGAGTT CCATTTGAAG TTGTCTTCAC CAAGACAGAT
GCCGTCATTG GCAAGTATAT GCCCAAGAAA GGCATTTTCA ACTCCCAGAA CAACAAAAAA
AGTGATCCAC AGAAGAGGGC AGAAATGAGT GATATGATAG CTAAAAGCAA CGAACAGGTT
ATTGCATACT ACACCAGAAT AATTCACGAA GCCAAGTTAC ACGAGGTGGT GACTGTGCCA
AGGTTACTCT TCAATAATGC CGCTACCAAC CGATACATAG ACAAGACCCA TGGGTTCAAG
GAAGTACGAA CCACCATCAT GGAAAGCTGT GGGTTGTTGA AATAG
 
Protein sequence
MLVRPSSRYL QPCNHVQRRY ASINYLVAAL AQNKAKFPEI SPVKNSSKSK PIAKELPPKN 
LLRLVITPSA LTEHFRQAEY TNFSVQQISE AQSFFNRAKV SLEWTLADYE EIPDIKYERL
LEKRITSLNE IDPYYKTKYH ESMLNSKKTF GIQPELLRPL PEVLLLGHTN VGKSSLLNNL
IVNSDASEYA YVSQRAGYTK TINCYNIGRK LRVIDSPGYG QFGEAKQGKV VLDYISKRHL
LRRVFILIDS VEGFRVEDMQ MMDHLISEGV PFEVVFTKTD AVIGKYMPKK GIFNSQNNKK
SDPQKRAEMS DMIAKSNEQV IAYYTRIIHE AKLHEVVTVP RLLFNNAATN RYIDKTHGFK
EVRTTIMESC GLLK