Gene PICST_5333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_5333 
Symbol 
ID4851849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3002081 
End bp3003187 
Gene Length1107 bp 
Protein Length369 aa 
Translation table 
GC content43% 
IMG OID640393557 
Productpredicted protein 
Protein accessionXP_001387141 
Protein GI126275780 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.684837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GATCCAAAAG TATTTGCTCC AAAGGCAGTA GCAAACCCAC CAGCAGAACT TGGCTTTAGA 
TTGAAGCTAA TAGTACAACT TGTAAAGGCT TATAGACAGC ACCATCCTGA TATCCAAACT
CCTATAACAC GGGCCATAGA GGAAGAATAC AAAGTAGCTA AAGTTTCATC TAGAACGACT
TACTCAGCAT CGATCAAGAA AGTGATCTAT GCGGCTCTTC ATCCAGAGAA AGCTAAGACG
CCCAAGGAGA ACGGGCCCAC AGAAGAGCAA TACAAAAGGC TATTGTCCGA ACTCGTCATC
CCCGTGGAGA AACTTGAGAA GTTCGGCTTT ATAATGAGAT CTCCAGAGAC TATTACTCCA
AGTAGAATTC GCACATGCCA TAGATGTGGT GCGGAGTTCA CCCGTGACGA ACAGCTTCTG
CCGGTTCAAT GTCAGTACCA TGCGGGCAGG GTGAGAAAGA CAGATTTTGG CAGAGTTTAC
GAATGCTGTC AGTCCGAAGT CAGTCTGGGT GATACCCATC CGTGTACAGT ATCCAATATG
CATGTGTTTT ACTGGCAGAA TAAGGAAGAA ATGGAGTGGT CTATTCCTTT CCAGAATACA
GATAGACTCT TTGGTGAGAG TAAAGGGTCC TTATTTGCAA TTGGTATAGA TTGTGAGATG
GGGTACACCA CCAGAGGACT GGAGCTCTTG AGAGTGACAG CAGTAGACTT CTTCTCTGGC
AAAGACGTTT TGGATATCTT TGTAAGACCG TACGGAGAAG TAGTAGACTT AAATACGCGT
TATTCTGGTG TATCTGAAAT AAAGCCCGAG GCAGTATCTT TCCATGAGAT GCTCAATCAA
TTGGGCCATA TCATGGACAA GAACACGATT CTAGTCGGCC ATGGACTTGA GAACGATATG
AATGCCATGA GACTTATCCA TAATAGAATT ATCGATACGT CTATCTTGTA TCCTAAACAC
AAGGCCACTC CTACCTTCAA ATTCAGTTTA AAAGACCTCG CATTCCAGTA TCTCAGCCGT
GTAATCCAAA CAGGAGAACA CGACAGTAGT GAAGATTCGC TAGCAGCCAT TGACATTGTA
AAATATTTTA TCAAAAAGGA TATTCAG
 
Protein sequence
DPKVFAPKAV ANPPAELGFR LKLIVQLVKA YRQHHPDIQT PITRAIEEEY KVAKVSSRTT 
YSASIKKVIY AALHPEKAKT PKENGPTEEQ YKRLLSELVI PVEKLEKFGF IMRSPETITP
SRIRTCHRCG AEFTRDEQLL PVQCQYHAGR VRKTDFGRVY ECCQSEVSLG DTHPCTVSNM
HVFYWQNKEE MEWSIPFQNT DRLFGESKGS LFAIGIDCEM GYTTRGLELL RVTAVDFFSG
KDVLDIFVRP YGEVVDLNTR YSGVSEIKPE AVSFHEMLNQ LGHIMDKNTI LVGHGLENDM
NAMRLIHNRI IDTSILYPKH KATPTFKFSL KDLAFQYLSR VIQTGEHDSS EDSLAAIDIV
KYFIKKDIQ