Gene PICST_42420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42420 
Symbol 
ID4837567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp853175 
End bp854281 
Gene Length1107 bp 
Protein Length368 aa 
Translation table12 
GC content42% 
IMG OID640388882 
Productpredicted protein 
Protein accessionXP_001382937 
Protein GI150864205 
COG category[R] General function prediction only 
COG ID[COG1163] Predicted GTPase 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.257266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.210698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATCC TTGAGAAGAT CGCTCAGATC GAACAGGAGT TGGCGAGAAC TCAAAAGAAC 
AAAGCAACTG AGTACCATAT TGGTCTTTTG AAAGGGAAAC TTGCCAGATA CAGAAGAGAA
CTTTTAGAAC CACAACCAGG ACAAGGTGGT GGTGGAGGAG GTCAAGGATT TGAAGTTGCT
AAAGCTGGTG ATGCCCGTGT TTCGTTAATT GGGTTTCCCT CGGTAGGAAA ATCTTCTTTT
TTGTCGAAAG TGACCAACAC AAAGTCAGAG GCTGCGAACT ATGAGTTCAC AACTTTGACA
TCTGTAGGAG GAATTCTTGA GTACAATGGT GCTGAGGTAC AAATTGTAGA TTTACCTGGT
ATTATCAAAG CTGCTGCCAA AGGTAAAGGT AGAGGTAGAC AAGTCATTGC CGTTTCTAGA
ACGTCGGACT TGATTATGAT GGTATTGGAT GCTACCAAAG GTGGTGACCA GAGACTGATT
TTGGAGAATG AATTGGAATC TATGGGAATT AGATTGAATA AGCAAAAGCC CAATATTTCT
CTCAAGTATA AGAAGACTGG TGGAGTCAAG ATGAACCTGA TAACGCCTCC CAAGTATTTG
GATGAAAAAC TTGTGTCGTC CATATTGAAA GACTACAAGA TCCACAATGC GGATGTACTC
ATCCGAGACG AAAATGTGAC TATTGACGAT TTTATCGATG TGATTAACGA GCAGCATATT
TCGTATATCA AGTGTCTTTA TGTGTACAAC AAAATCGATG CTGTGTCGTT GGAAGAGTGT
GACCGTTTGG CCAGAGAACC CAACACTGTG GTGATGTCGT GTGAACTAGA TCTCGGAATT
GAGGATCTCA AGGAAGAAAT ATGGAGAAAG TTGGATCTTC TCAGATTGTA TACCAAGAGA
AGAGGTGTGG AGCCTAACTT AGATGATCCC ATGGTTGTCA GAAGCAATTC AACTGTCAAG
GAAGTCTGTG ACGCCATTCA CAGAGACATG AAGAATCAGT TCAAGTATGC CAATGTCTGG
GGATCCAGTG CTAAGCATTC ACCACAGAAG TGTGGATTGA GCCATCCTGT TAACGACGAA
GATGTAGTGG AGATAGTCAC GAAGTAA
 
Protein sequence
MGILEKIAQI EQELARTQKN KATEYHIGLL KGKLARYRRE LLEPQPGQGG GGGGQGFEVA 
KAGDARVSLI GFPSVGKSSF LSKVTNTKSE AANYEFTTLT SVGGILEYNG AEVQIVDLPG
IIKAAAKGKG RGRQVIAVSR TSDLIMMVLD ATKGGDQRSI LENELESMGI RLNKQKPNIS
LKYKKTGGVK MNSITPPKYL DEKLVSSILK DYKIHNADVL IRDENVTIDD FIDVINEQHI
SYIKCLYVYN KIDAVSLEEC DRLAREPNTV VMSCELDLGI EDLKEEIWRK LDLLRLYTKR
RGVEPNLDDP MVVRSNSTVK EVCDAIHRDM KNQFKYANVW GSSAKHSPQK CGLSHPVNDE
DVVEIVTK