Gene PICST_33758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33758 
Symbol 
ID4840874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp409161 
End bp410360 
Gene Length1200 bp 
Protein Length399 aa 
Translation table12 
GC content45% 
IMG OID640392189 
Productpredicted protein 
Protein accessionXP_001386471 
Protein GI150866767 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC CGTCATTAGC CTCTTTCCCG AAAAGAGAGT TGGAATTGAC TCCATTCTGT 
TCTCAGTTGG ATTTCATCAA CAAGGCCTTG CTTTCGCTTT CAAAACAGGA CTCATCAGTC
AACAACTACA CGAACACGTC TCGTAGGTAC ACCAATCAAA ACCAGCAGAA CACGCATGCA
AACTACAACA ACAACCAGCA AAACAAATAC GCTTTGGCTG CTGTTGCTGC CGCCGCGGCT
GTCCAGCACC AACAGCTGAA CTACCTTCTT CAACAACATC AGCAAAATCA ACAAAATCAA
AACCAACAAT ACCAAAGTCA AAACCAGCAG AATCAGCAGC AGCAGCTTCG CAATAACCAA
TATTCACAGC AGTCACATCC TCAGTTCCAA CTGAATCAAC ACCAGCAACA ATATCATCAA
CCACAGTATC AGCCGCAGCA ATATCAGCAA CAACGTCTGA ATCAACAACA ATTCCAAAAT
CAACAGAATC AACAAAACCA GCAAAATCAA CAAAATTACC AACAGCTGCA ACAATTACAA
CAGAACCAGC AAAGTCTTCA GAACTTCCAG CACCAGAATC TTAATAATTT TCAAAATCAG
CAACAGCCTC CACAGGTATT TGGTTCTGAA CTCTCTGCCT CTTCTTCTGT TGCCTCTGAC
TCTCAATCCA ACTCTACCAG TGCTGGAACA CCATTGACTG GCATTCCTCC TGCTCTTGAT
TTGAAAAGCA ACTTCTTGAT GTTCAACGAA AGCTCGCACT CAATATCTTC TCCTACTTAT
TCTAGCGCAG CTCAGCAACA GAAACCTCCG CAACTCCACT CCAAGTTATT TGACTCTCTG
TCCAGCAACT CGTCTCCGAT TTTGCTTCCA CCGAACTCAA GCGCTTCCAA CAAGAACTCA
CCTCCCAACT TGTTTCTCTC TACGCCTGTG ACTACACCTT TGAATGCTAC ACATTCTACT
GCAACCGGCT CTCCATTGAC CAAGCCTACC CTTCCAGAAA CGTCTGGTGG ATTTCTCAAC
TCACCACTGT TGCTTCCTAC TCTGACACCA ACCTCTGCGG GAGGTGCCAA CTGGAGCACT
ACCAATTCTA GTGCTTCAAT CTGGGGTAAC ACTACCAATG GAGTGTCTGC TCTGTCTGCT
AACTCTTCTA AGGGTAACAG CCATCTAATC TCCAGCTTCG GCAGCTCAGG AATGTGGTGA
 
Protein sequence
MSDPSLASFP KRELELTPFC SQLDFINKAL LSLSKQDSSV NNYTNTSRRY TNQNQQNTHA 
NYNNNQQNKY ALAAVAAAAA VQHQQSNYLL QQHQQNQQNQ NQQYQSQNQQ NQQQQLRNNQ
YSQQSHPQFQ SNQHQQQYHQ PQYQPQQYQQ QRSNQQQFQN QQNQQNQQNQ QNYQQSQQLQ
QNQQSLQNFQ HQNLNNFQNQ QQPPQVFGSE LSASSSVASD SQSNSTSAGT PLTGIPPALD
LKSNFLMFNE SSHSISSPTY SSAAQQQKPP QLHSKLFDSS SSNSSPILLP PNSSASNKNS
PPNLFLSTPV TTPLNATHST ATGSPLTKPT LPETSGGFLN SPSLLPTSTP TSAGGANWST
TNSSASIWGN TTNGVSASSA NSSKGNSHLI SSFGSSGMW