Gene PICST_46516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_46516 
Symbol 
ID4839103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp627169 
End bp628224 
Gene Length1056 bp 
Protein Length351 aa 
Translation table12 
GC content38% 
IMG OID640390418 
Productpredicted protein 
Protein accessionXP_001385128 
Protein GI150865781 
COG category[R] General function prediction only 
COG ID[COG1100] GTPase SAR1 and related small G proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.85819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGGC AAGTAGTTAT TGGACCTCCA GGATCAGGAA AGTCTACATA TTGCTATGGA 
ATGCACCAAT TCATGTCTGC AATAGGGAGA AAACTGTGTA TAATCAATCT TGATCCCGCT
AATGACCGCT TACCATATCC AGATTGTGCA TTGGATATAC GTGACTTTAT AACTTTGGAA
GAAGTCATGG AGGAATTAAA ATTAGGCCCA AATGGAGGTC TTATGTACGC ATTAGAAAGT
TTGGATGAAA CAGGAATAGA CCACTTTATC GACATGATAA CTGAGTTGGT TGAAGATCAG
AACTACTTAA TTTTTGATAG TCCAGGACAA GTAGAATTAT TCACCCATCA CAATTCGATC
TATAAGATAT TTAAGAGGTT AACAAATACA AAAAGGCTTC GGTTATGTGT GGTGTTGCTT
GTTGATTCCT TATACCTTAC CAGTCCTTCA CAATACATTT CAATTTTACT ACTAACTTTA
AGATCTATGT TGCAACTAGA TTTTCCTCAA GTGAATGTGA TATCAAAGAT AGACATGTTG
AAAAATTACG GGGAATTGCC GTTTAGACTC GACTACTATG CTGAAGCTCA AGATTTGGAA
CAACTAACTC CTTACTTGGA GAAAGAGTCT AATTCAGTTC TAGGTAGAAA CTATGTTAGA
TTAACTAAAA TGATTGGAGA GTTGGTAGAA GATTTCAACC TTGTGTCATT TGAAGTTTTG
TCCGTGGAGA ATAAGCAAAG TATGATAAAT TTACTCAGCG TAATAGATAA AGCAAATGGC
TACAGCTTTG GAAGTGAGAT TGGAGGAGAC TCTATCTGGA GCGAAGCTAC GAGGCAAGGT
GGGGCTTCAG GATATGCGGC GGTAGATATC CATGAACGTT GGATTGAATA TAAAGATCAA
TATGACCAAG AAGAAAGAAA GTCAGAAGAA AGACTTGAAC AAAGTGATAA TGAAGGTGAT
GCTACACAGC CTTCTATGAC AGAGGATGAA GAATGGGAAC TTGCCGTTCA TGAATGGGAG
AAGAATAGAG GAGCAAGTGG TCCTCTTTCT AGATGA
 
Protein sequence
MFGQVVIGPP GSGKSTYCYG MHQFMSAIGR KSCIINLDPA NDRLPYPDCA LDIRDFITLE 
EVMEELKLGP NGGLMYALES LDETGIDHFI DMITELVEDQ NYLIFDSPGQ VELFTHHNSI
YKIFKRLTNT KRLRLCVVLL VDSLYLTSPS QYISILLLTL RSMLQLDFPQ VNVISKIDML
KNYGELPFRL DYYAEAQDLE QLTPYLEKES NSVLGRNYVR LTKMIGELVE DFNLVSFEVL
SVENKQSMIN LLSVIDKANG YSFGSEIGGD SIWSEATRQG GASGYAAVDI HERWIEYKDQ
YDQEERKSEE RLEQSDNEGD ATQPSMTEDE EWELAVHEWE KNRGASGPLS R