Gene PICST_31564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31564 
Symbol 
ID4838651 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1118160 
End bp1119380 
Gene Length1221 bp 
Protein Length378 aa 
Translation table12 
GC content49% 
IMG OID640389966 
Productpredicted protein 
Protein accessionXP_001384514 
Protein GI150865340 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[R] General function prediction only
[T] Signal transduction mechanisms
[Z] Cytoskeleton 
COG ID[COG5126] Ca2+-binding protein (EF-Hand superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGTCG ATGATTTGCC GACATTTCCT ACTCCACAAC AAGCAGCCTA TGTTGTGGAT 
ACCCATCCCA AAAAGACCTC CAAACCACCT TCCAATAACT ACCGTCCACC AGTAAAGTCT
GCTAGCTCAT ATCAACAGCA GGGCTACCCG CCCCAGCACC AGGGCTATCA GAATGGCCAT
TCGACTTCAC CATACCAGCA CTCACAGTAT GGAGCTCCCC CACCACCACA ACAGTACTAT
CCACAGGCTC CTCCGCAGCC ACCCGTACAA CACTACCAGG CTCCGCCACC ACAGCAGCAC
CACCAGCAAC CTCCACCACA TCAGTATGCC CCACCACCGC AACAACAGTA CTACCAGCCT
CAGCCAGTGC ACCATCAGGC TCAACCACAG GTCTATCAAC CTCCCCCACA ACAGCAGACA
CATCACCAGC AACCTCCACA ACACCAGCAA CCTCCGCAGC ACCAGCAGCC TATCTACAAT
CAGGATTACG GCCAGGAGTA TGGCCAGAGC CAAAGTCAGA GCCAAAAATA CAATACGCTC
GGCACATCTC AGGCCAACAC GCCTTCATCT GTTCACAAGA GACCCCCGCC TGTAAGCAGC
AACAGCTCGC GGACAGTCGA GAAGTCTAAT CGGTCAAGAG AATCTGTAGC GACTGAAGGA
ACGGTGAAGA CCCTGAAGCA GAAGCTTGAG AGTGAGTTAC GTTCCGTTTT TGAAAAAGTG
GATACGAATC GCTCAGGAAG AATTTCGGCC AAGGAGCTTT CTTTAGCGTT ATTGAACTTT
GACAATACTC GATTCCAATC TTCTACAGTA ATGTTGATGA TCAAACTTTT CAGCAACCCT
GATGCTCCGT CCAAGAGTTT GAACTTCGAC CAGTTTGTGT CGTTGTGGAA GTACCTTTCA
GCATACAAGA AACTATTCAT ACAGGCCGAC TCCAACAAGT CGGGTGACAT TTCCTTTGGT
GAGTTCCAGA AGATCTTACT TGAGATCGGC TACAAACTAG AGATCGACGT GGTATTGCAT
TTGTTCCTGA GATTTAGCTA CAAGGAGGGC AACTACGATA GCGGCACAGG TGTAGGAAAG
CTCAAGTTTG ATGCGTTCAT CGAGTTGCTT GTCTACTTGA AAAAGTTGAC CGATGTCTTT
AAGAGATACG ATAAGAACCT TTCTGGCGAA GCCACCATCA GCTTCTCCAA CTTCTTGTTT
GAAGTGAGTA ACCTCTCGTG A
 
Protein sequence
MPVDDLPTFP TPQQAAYVVD THPKKTSKPP SNNYRPPVKS ASSYQQQGYP PQHQGYQNGH 
STSPYQHSQY GAPPPPQQYY PQAPPQPPVQ HYQAPPPQQH HQQPPPHQYA PPPQQQYYQP
QPVHHQAQPQ VYQPPPQQQT HHQQPPQHQQ PPQHQQPIYN QDYGQEPPPV SSNSSRTVEK
SNRSRESVAT EGTVKTSKQK LESELRSVFE KVDTNRSGRI SAKELSLALL NFDNTRFQSS
TVMLMIKLFS NPDAPSKSLN FDQFVSLWKY LSAYKKLFIQ ADSNKSGDIS FGEFQKILLE
IGYKLEIDVV LHLFSRFSYK EGNYDSGTGV GKLKFDAFIE LLVYLKKLTD VFKRYDKNLS
GEATISFSNF LFEVSNLS