Gene PICST_32340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32340 
Symbol 
ID4839336 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1259819 
End bp1260845 
Gene Length1027 bp 
Protein Length314 aa 
Translation table12 
GC content44% 
IMG OID640390651 
Productpredicted protein 
Protein accessionXP_001385249 
Protein GI150865863 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.46857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.27086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGTT CAACATGGCC TTATTTGGTG AATAGCGTTC GGATTTTTCA TCTTTTTTCG 
ATCAACATCT ACTAACATAA TTTAGACTGA TAATACTGAG AAGTATCCTC CGGATATCCA
GAAGCTTTTT GCACCCAAAC CGCCGCTCTT GTATTTGCTG TCTTCAGATT TTGCAGCTGG
ACAGAGAGCC ACAGCATCCA TAACACCCGT TTCAGCCTGG AGATCTGAAA TTGACAAATA
CACCGTTCAG TTGAAGGAGC AGGATTCTTC CAGCATAAAG AAACAACCAA CAAAACACCA
ATTACAAGAA GAAGCTGCTC GTGAAAAACA GCTTCTCAAG CGAGAATCGT TCAAACGACA
ATTGCGCGAA TGGAATGATC CCGAAATATT GCATCAAAAT GAGAAAGAAT TCATGAAAGA
TCCATATAGA ACCATCTTTG TCTCTCGTTT AGACTTCAGC TTAACCGAGC TTGATATTTC
TAAGCATTTC AGCAAGTATG GCGTGATTGA GTCTGTGCGT ATTATACGTG ACTCTGTAAC
CGGTAAATCT CGAGGATACG GCTTCATAGT GTTTGAACGA GAGTGGGATG CCCAGAGCTG
TATCAGTGAA GTGGCGAGAA CAGGTGTAAG ACTTCCACAA GCAAAGAGAA CTATTTTGGT
AGATATAGAG AGGGGCCGTA TAGTGCTGAA CTGGCGTCCG CGCAGATTAG GAGGAGGTCT
AGGAGGTAGA CACTATACGA GACCCGATCC CCGTTTCAAT AGTACAGCTT CAGCTGCAGC
CAGTGGTAGA AGTATTAATA TTGCTAACAA CCCACATATA CCGTCTGGTC ATAGTGGCCA
TCGCCAGCAG CCGTCATATT ATCCTCCAAC ACAGAGTACG TTCAAGAGCT ATCCTAAAGA
AACCGAAAAG AAACCGGAAA AGTCTGTCAA GGACAAGTAT GCCAAGTATG CTGCTGTTCT
GGAGTCGTCT GGTGGTTACC GTTCTGTGGG AGAGACCCGG TCAATCAGAA GTATAAGGCA
GGGGTAA
 
Protein sequence
MTDNTEKYPP DIQKLFAPKP PLLYLSSSDF AAGQRATASI TPVSAWRSEI DKYTVQLKEQ 
DSSSIKKQPT KHQLQEEAAR EKQLLKRESF KRQLREWNDP EILHQNEKEF MKDPYRTIFV
SRLDFSLTEL DISKHFSKYG VIESVRIIRD SVTGKSRGYG FIVFEREWDA QSCISEVART
GVRLPQAKRT ILVDIERGRI VSNWRPRRLG GGLGGRHYTR PDPRFNSTAS AAASGRSINI
ANNPHIPSGH SGHRQQPSYY PPTQSTFKSY PKETEKKPEK SVKDKYAKYA AVSESSGGYR
SVGETRSIRS IRQG