Gene PICST_32879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32879 
Symbol 
ID4840061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp852198 
End bp854054 
Gene Length1857 bp 
Protein Length618 aa 
Translation table12 
GC content40% 
IMG OID640391376 
Productpredicted protein 
Protein accessionXP_001385862 
Protein GI150866313 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.134484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00551311 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACAATA TTACAGATAC CATAGCAGCT CCCTCGTTGG AGAATATCAC CTTTGCGTTT 
AGCAACTTGT CCTATATCGA TCAGCTTGCT TACATCATTC CCCACAAGAT ATATTTGAAG
GACTCGCCAA TCATTCCTGA AGTGATCAAT GGAGCTCTAA TTTTGCTTGT TAGTATAGCG
ATCGTAATAG TAGGATCGTA TTCTACAGTA TCACGACCTT CGAACAGCGA AGACCCTCGT
TTGGACAGAA AATCTCCGTA CTGGGATCCA TCGGATGTGG ATAACACCGA ACACTTTGTA
GCCAACAAAA TGCAATTGTA CACATTGGGA GGCCAGAACT TGGGTTTAGT ACATGTATTG
CTTATGCCTC TATCCACAGC AGGAACGCTA TACTTCTTGA ACTATGTCAT GAACAACTGG
AATATCGACG ATATCAACTT GTGGTTGAAC AGATACATTT TAGCAGTGCT GTTGTTCTCC
ATATACGGTA GTTTGGAATA CGTTCTTGTT GCTTCTTCAA GAAAATTGTC CAAGCTTCTT
GGTGGACCCC TTAGCAATTC TAGCAATCTT TTCTGCAGAT ACCGTTTGAC ATTGACTGCC
GACAAGGACG ACAACTTTCC ATTGGGCAGA TTGGAGAACT TTGATGAAGA CGAATTCGTA
GAGAAGGAAT TGAAGAAAGA TCCAAAATGG AATGAAGCCT TTCAAAAGTA TTTGAGCGAG
GAGAAGATCG AAATTTTGCG TCCTACTTCA GTGAGAATAA TTCCTAAAGT AACTGAAAAT
ACCAATTGGA TCTTTGACTT GAAACCAGCA GTAATCCTTC CTTTAACAAT TGGCCTAATT
TACCTGTTCT ACAAATACAA CCCAATATTG AATTCTGAAT ATAACATGAA TGATATCAAC
TGGTTAGTTC TTGATTCCAT GGCTATTAAT TTTGCAATAT TTGGTATTCA AAAGATCAAA
TTTGGTCAAT TCAAGTATGG GTTCCTTTTG TTGTCTGGTC TTTTCTTCTA TGACATTTAC
TTTGTCTTTG GAACAGAGAT AATGGAAAAG GTTGCCACAG GATTGAATAT ACCAATGAAG
ATATTGCTTC CTCATCCAGG TAGCAGCTGG GGCGAGCCAT TGAAGTTCAG TTTGCTTGGA
TTGGGAGATA TCATTGTCCC AGGTACGGTT GCCTCTTTAT CGTTAAGATT TGACGTCTAC
CGTCACCACC AGAAGAATCC ATCTACAGCA TTCCACTACT TGACTCCAAT CGCAAAGCCT
TATTTTACTG CAGCAATTGT CTCTTATTTC ATTGGTCTTG CAGCTACGCT TGTTATGCTC
AATATTTTCC GCGTAGGCCA GCCAGCTTTG CTATATATAG TTCCTTCTCT TTTGGGAGGA
ATAACAATCA CTGGTCTTGC AAGAAGAGAA TTCACTGAAT TATGGGAATT TAAAGACGAG
ATCAAGCAGT TTGACGAGAA GGACTTCGAA AATGAAAATG AAAACTACAT AGAAGAGGAG
GATGAAGATT ACATTTTGAA CGAAGACGAA GCCTCATTTG ATGACTGGGT TGACCAAGTT
GAATTGGAGA GGGCCGGATC AGAAGATGAA ACTGATTTGG ATGAATTCAG AAAGTTTGCA
CCCAAAAGAT ACACAGCAGA AGATTTCGGC CCGGACGATG AAGAGGAAGA CGACGATACA
TTTGTGATTG GAGAAGGCAG CGACGACGAA CTTGACGACG ATGACGATAT CGAAGAAGAA
GAAGTCGAAT ACGAGGAAGA TGACGACGAA GCTGTAATCG AGGTTCTCGA GGAATTGCAA
GTTATTAGAG AGGATTTGAA CAGACAGCCA CAAAGATGGT ACAGTGACGA AGAGTAA
 
Protein sequence
MDNITDTIAA PSLENITFAF SNLSYIDQLA YIIPHKIYLK DSPIIPEVIN GALILLVSIA 
IVIVGSYSTV SRPSNSEDPR LDRKSPYWDP SDVDNTEHFV ANKMQLYTLG GQNLGLVHVL
LMPLSTAGTL YFLNYVMNNW NIDDINLWLN RYILAVSLFS IYGSLEYVLV ASSRKLSKLL
GGPLSNSSNL FCRYRLTLTA DKDDNFPLGR LENFDEDEFV EKELKKDPKW NEAFQKYLSE
EKIEILRPTS VRIIPKVTEN TNWIFDLKPA VILPLTIGLI YSFYKYNPIL NSEYNMNDIN
WLVLDSMAIN FAIFGIQKIK FGQFKYGFLL LSGLFFYDIY FVFGTEIMEK VATGLNIPMK
ILLPHPGSSW GEPLKFSLLG LGDIIVPGTV ASLSLRFDVY RHHQKNPSTA FHYLTPIAKP
YFTAAIVSYF IGLAATLVML NIFRVGQPAL LYIVPSLLGG ITITGLARRE FTELWEFKDE
IKQFDEKDFE NENENYIEEE DEDYILNEDE ASFDDWVDQV ELERAGSEDE TDLDEFRKFA
PKRYTAEDFG PDDEEEDDDT FVIGEGSDDE LDDDDDIEEE EVEYEEDDDE AVIEVLEELQ
VIREDLNRQP QRWYSDEE