Gene PICST_32230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32230 
Symbol 
ID4839175 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp998997 
End bp1000353 
Gene Length1357 bp 
Protein Length433 aa 
Translation table12 
GC content42% 
IMG OID640390490 
Productpredicted protein 
Protein accessionXP_001384861 
Protein GI150865586 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATTC ACTTGCTTTC AAGACTCCCA GGGGAGTTAC CCGTTTCAAG AAACCTGCTT 
AATGCCATGG GGAGCCTTTT ATCCCCAGAA TTTGAGTATC TTAACGAGTT TTTGCCATCT
CATCTAGATT CGAAATCAAA TTCTTTGGAT CTAGATGTTG GACTGATTTC TTCTAAAGTT
GAATCAAACC TTCCAGTTAG TGATGAATTC GATAGCTTGT TCAGGTATCT TCTAACCCCA
ACACTTGTTG GACCATCTCC AAATTCATTT TACTACGATT ATGTTGCCTG CGCCGACAAC
AATGAGTTCT TGGGTATATC TGAACCAATA GCTAGTCAAC CTAGAGAATT CAGTCTTTCT
GTGGACGGAA TCAAAGTAAC AACCTCTGAA GACGATACTT TCTTCCAAAA TCTCCTGTCT
CCCAATAGGT CTGATGCTGT GAGTACTCGT GAACCAGGTG AAACAGTTGT GCAACACCAA
CTGGAATTAG CAACAAGGCT GCAGAGTCAT CGGGAGGTTG TAGCACACCA AAAAGCTAGA
ACTACGACTG GATCTAAGCC CAGCTCCACA TACAAGGTAG TGAAGCATAG GCCAAAGGGA
TCTAAACAAG CTTGTGTTCC AATTAAAATT TCATACGAGA AACTTAAATT GACAACAAAG
CTTGGAGCTG AACTCTCGGA TTCGTTTGTC GAAACTGTTG AATCCAGTAT GTCAGCGTCG
GTCCGTGCAA TGTTAGCCGA AAGAAAATTA CCAGAGGAGC TTGAAAATGG TGCCTCAAGG
TGTAAGATTG ATAGACAAGT CTATGAAAGA CCTCTATTGA TTGAAGAAAT GGAAAAGTTC
TGTGGCCATC CAAAGGTGAG ATATATTCGA AACTCAAACT TTGGACGGAC TCCCTACGAA
GCAGAGTACT ACCTGTACCA GGTGGACAAT AAGGGTCAAC TGATCAACCA TACAAGACAT
GGTTTGTGTC CATATTGTCC AGAAGTCCTG TTCTTCAAGT TGAAGAATTC TGCCTACGGG
AACCATTTAG GCAACATTCA CGGCATCCGC ACGAACGGGT CCCTTTTTCC AGATCCAATT
CTCCCAGGAA TCTACTTGAT GGCCAAGAGT GAATTTGTAG AAACTGAAAG AAAAACTCTA
GCCAAAGAGA GAGCTACAGC TGGGGTGGTA TGTCAAGCAT GTTATACAAT TCAAGAGATG
CAGTGCACCC TGAGAAGCAC AGATTTGGGA CACTATCTTC GACATTACCG AGACAACCAT
GTCAAATGCA AAAGCAGAAG CAAAGGTTGT CGCTCTGGTC GGAGTAATTT GGAATATAAT
TAGAAAGACT TTGGTGTTTT TGGTTATCGT CAACTAG
 
Protein sequence
MSIHLLSRLP GELPVSRNSL NAMGSLLSPE FEYLNEFLPS HLDSKSNSLD LDVGSISSKV 
ESNLPVSDEF DSLFRYLLTP TLVGPSPNSF YYDYVACADN NEFLGISEPI ASQPREFSLS
VDGIKVTTSE DDTFFQNLSS PNRSDAVSTR EPGETVVQHQ SELATRSQSH REVVAHQKAR
TTTGSKPSST YKVVKHRPKG SKQACVPIKI SYEKLKLTTK LGAELSDSFV ETVESSMSAS
VRAMLAERKL PEELENGASR CKIDRQVYER PLLIEEMEKF CGHPKVRYIR NSNFGRTPYE
AEYYSYQVDN KGQSINHTRH GLCPYCPEVS FFKLKNSAYG NHLGNIHGIR TNGSLFPDPI
LPGIYLMAKS EFVETERKTL AKERATAGVI WDTIFDITET TMSNAKAEAK VVASVGVIWN
IIRKTLVFLV IVN