Gene PICST_32081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32081 
Symbol 
ID4839723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp638241 
End bp639398 
Gene Length1158 bp 
Protein Length385 aa 
Translation table12 
GC content41% 
IMG OID640391038 
Productpredicted protein 
Protein accessionXP_001384791 
Protein GI150865534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.555096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0738823 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCT ATGCAAATCC CCCATTGACA GACGGAGCGA TACGAACGTC ATTGTCTGAT 
GATAATGGGA GTTTTGAGGA CGAAGAATTT GATGACACAG GAATGGAATT TGAGTCTCCA
AGCAAAGACA TAGTTCAGCT GAAATGTTCT ACAGATGATC GGACTAATGC AGAATTGACC
AATGGGCCCG AAATCAGCGA ACGCATCTCC TCTTTACCAA AAACTAGGGA GAGTCGTTTA
ATGTTCCAGG CGAAGAGTAT GACAACACAC AGACGTCAGC CTATGAATTC ATCCTCAAAC
AGTAATTACT TCGGTTACTC GGACGAAGAG CAGAGAACCA TTATTCGACA GAATGAATTG
GTTTCTTCTC CGCGAGACAG ACCTAATGGG TTCGCCATTA GGGGAAGGCA ACCTTCTCAT
TCACATGTTT CACCTTCTTC GAGATTGAAA TTTAACAACA TTTTGGACAC TCTAGCAAGT
TCTCCTAGCA AAATAAAGAG CAACAATAGA TTGGAGAAGC TGTCTTGGAA CACTGAAAGG
ATGCCGAATA GTGAAATTGA AGGTGGACGT GATACGAGTC CACAAAAAGC TTCACATGAA
TTGATCGCCA CAAAATGGAA CACGTCAGTA GATCTGCCAT ATCGTGTAAA AGTCCAGCCT
ACAAGGCTGA TAATCAATGA TGAAATATCA AGAGATAGTA TTATTGAGAA AGTTAACCTT
ACGTTGGATT CTCTCAGCAC TTCAATTAGA AAAACGAAGA CGATTAACCC TGATATTACT
TCTACGCCCA AATCAAATTC TAAGGATATT AAGCTGGCCG CCCCATCGAA CAGTTTACAA
TCTGAATTTG TCGACCATTT CTTGAGGAGT CCAGAACCCA CGTTGTTCAA GTCTAATTCC
GGAAAAAACC ATGAATTCGA ATCAAATACT TGCGGAAGCG GTACATGGCC CAGTGATAAA
TGGTTGAAGT TGCGAAAGAT AGTAAAACTG AGGTCAATTA CAAGACTGGA AGCTATTGGA
AGTACTTTTC TTCTTCAGGA GCTTGATTGT TCAAAAAAAG AACTTACATT AAGGTATGAC
TTCTTGCAAC AGCTTCCGAA AAAGAAATCC CGAAGTAAAA GAATGAGTAG AGTAGAAAAG
GAGACTCTTT ACAAATAG
 
Protein sequence
MNAYANPPLT DGAIRTSLSD DNGSFEDEEF DDTGMEFESP SKDIVQSKCS TDDRTNAELT 
NGPEISERIS SLPKTRESRL MFQAKSMTTH RRQPMNSSSN SNYFGYSDEE QRTIIRQNEL
VSSPRDRPNG FAIRGRQPSH SHVSPSSRLK FNNILDTLAS SPSKIKSNNR LEKSSWNTER
MPNSEIEGGR DTSPQKASHE LIATKWNTSV DSPYRVKVQP TRSIINDEIS RDSIIEKVNL
TLDSLSTSIR KTKTINPDIT STPKSNSKDI KSAAPSNSLQ SEFVDHFLRS PEPTLFKSNS
GKNHEFESNT CGSGTWPSDK WLKLRKIVKS RSITRSEAIG STFLLQELDC SKKELTLRYD
FLQQLPKKKS RSKRMSRVEK ETLYK