Gene PICST_21178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_21178 
Symbol 
ID4839094 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1596282 
End bp1597442 
Gene Length1161 bp 
Protein Length261 aa 
Translation table12 
GC content42% 
IMG OID640390409 
Productpredicted protein 
Protein accessionXP_001385001 
Protein GI150865680 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.132821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTG AGATCAATTG GGAGAACCTC ACTTCTGACA GCTCGATTAA CGAGTCGCTC 
AAAGAGTTTC TCGATCGCCA ATTCCAGAAT ATTTCACTCC CTTCGTATAT AGCTAATCTA
TCAGTGACCA ACTTCTCAGT TGGCGATATT CCACCAGAGA TCACCATACG ACACATTGGA
GACCCGTTTG ACGAGTTTTA TGAAGACGAA AACGACGAAG GGCTGAGCGG TCCAGAACGC
GTCTCTTCCA ATTCGAATAT GAACACAAAA GAGACTAACT ACATGTCTAG TGATGATGAA
GACGACGATG AGGATAATGA TCTTTCAACT ATAGCAGAGG ATTCACACCT CAACAGTTTT
AGTCATAGCA GCACACTTTA CCATTCACAC GAGCAAAGTC CTCCTCCGGG ACCAGCCCCA
ACTCCGCCCC TTCTTCTGCG TTCTAGAACA TCACTGGATC CCATTTCATA CATTATGGCC
AACACTAGTC TCAACTACTT ACACAACTAT AATATCAACA ATATTGGATT GGGACATGCT
CCTAGCGGAA CTGAGACACC GACAACTATT CTCAATCAAA ATGCCTTGAC CAACGCCAAA
AATTCAAGAG TCATATCCAG TCTTCAAAAA ACTACCAGAG GAGAAAATGA CATACAAATC
ATAGCCGAAA TAGAATATAG TGGCAATCTC CATGTAGACT TGATAGTGAA TCTTTTGGTA
AACTACCCTT CTCCTAACTT CATTTCGTTG CCTATCAAGT TGCACATTAC TGATATTGTC
ATACATTCGA TTGCTACTAT TGCCTACTTG AAGAAGGCGG TGTACTTTTC ATTTCTCTGT
GACATCAACG AATCTACACC AGACTACTTT TCCACTTCTT CGTCCAGCTC TGTCTCGACT
TCTACTGCAG CACCAGCTAC ACCAACGACA TATAATTCTG GTGGGAATTT TGTCGATTAT
ATTGCTGATC CCAACAACCG TGAGAGAATC GATATCGTAA AGAAGATCAA AATCGAGTCG
GAGATAGGAG AACTCGAGAA CAACGTCTTG AGGAATGTTG GTAAAGTAGA AAAGTTTCTT
ATTGAACAGC TAAGAAATAT CATTCGTGAA GAATTGGCAT GGCCTAGTTG GATTTGTATA
GACATGAGTG AAGATGAAGA C
 
Protein sequence
MSFEINWENL TSDSSINESL KEFLDRQFQN ISLPSYIANL SVTNFSVGDI PPEITIRHIG 
DPFDEFYEDE NDEGSSGPER VSSNSNMNTK ETNYMSSDDE DDDEDNDLST IAEDSHLNRE
NDIQIIAEIE YSGNLHVDLI VNLLVNYPSP NFISLPIKLH ITDIVIHSIA TIAYLKKAVY
FSFLCDINES TPDYFSTSSS SSRIDIVKKI KIESEIGELE NNVLRNVGKV EKFLIEQLRN
IIREELAWPS WICIDMSEDE D