Gene PICST_33452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33452 
Symbol 
ID4840613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp676529 
End bp677959 
Gene Length1431 bp 
Protein Length476 aa 
Translation table12 
GC content40% 
IMG OID640391928 
ProductUncharacterized conserved protein 
Protein accessionXP_001386144 
Protein GI150866514 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.242751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCAG ACTCCAAGCC CAATGCAATT GAATTGCACG TGCCGACTTT GCTCGACACA 
ATATTCAAAG ACAATGTTCA AGAGCTACTT CCCCCACACA GCATTAAAGC TCCCGAATAC
ACAGAAATAC TTCTTCAGAA GACAACCAAA GCCAACAAGT ATTACGGATC TTTCCAACGG
TATACACGGC AATTCATACC AGATGATTCA GCAGCAAACA CGGCCATTGT GCGATGGACC
AATTCGTCTA TAGAGTCTAC ACAAAGACTC ATAATAGATT CGTGGATAGG CTCAGATTAC
AAAGGAAATC AGGTCAGAAC CGAAAATCTG CCACATCAGA ATGTTCAGAC GAAAACTACT
GCAAATTCAT TATTCTCATG GTCCTCTGGT GATGCTTACA TTGAACAGAG AAAGAAAGAA
CTTCAGTTAG CCAAGGGCAA GGAAAAGAGT AGAAAGGAGA ACCAGAAAAC ACATAGTCAT
CAAAAAGAAG TTCTGCCCGA GGAGACAGCA TCTAATAATG TTACAGAATC AGAAAGCAAG
AGGGAAGCAC AAGAGAAGAG ACTCAAGAAG TTGAATCTCA CGGTTCCGAA AGAGTCTATT
ACCAACAGAC TTAATCGTAT TATAGAGAAA GAATCCAATA CCTTCATTCA TGCCCGTATC
CAGCGTATTA AGACACATCA CACCGAGCAG ATCACAAAGC AGATCCATGA GAGAAAGAGA
AAAGACCACG AATTGTACTT GAAGAAGTTA CGACTCAAGG AAGAAGAATA CGACAAAGCA
TTAAAGGCAG CAACCACAGA AACACAGAAA CAGGGATTTT TCGGTAACTT GTTTGGATTC
AATTCGACAA CTACGAATTC TCGTTCAGAA TTGGATTTTC AACTCAAAGA TTCATTTGAT
AACGAATCAG TTGCTAGCCA TACATCAACT ACAGCATCTA AGACGAACAA CAGTAATAGC
AAGAGATTTT CGTTTCTTCC TGTAGGAGGG CTCTGGAGCT CTGGTTCTGG AGGGAAAGAG
CTGAACAAGA ATGACTTGTC TGAAGTTTCT CCTAGTAAGA AAAGTGAATC TTCTAATGAT
CATATACATC CTGCCGAAGT TGTCCTTCGC GAGGAAATCG AATTACAAGA AGAAAATGAA
GTAGCAGAAA TAGAAGAGAA AGAAGAAGTA CACGAAGGTG AAAATGAAAA TGGGTATGAC
TTTGATGATG AAGAATTCGA CGATTTCACA TCGGCAGAAC CATTGCCTAA AACTTCAACT
ACCATAACAG CAATGGCTCC TTTAAAACCA TCTACTTTTC TGCCTCCTCC TAGTTCTACA
ACGAATGAAT TCAATCTCAC TACTCAACAC AACATCAACG CGGACCTTTT GGACATATTT
GGTCCTGAAA AGAAGAGTAA CGAAGCTAAC GAAAATCTAC TCAATATATA A
 
Protein sequence
MDSDSKPNAI ELHVPTLLDT IFKDNVQELL PPHSIKAPEY TEILLQKTTK ANKYYGSFQR 
YTRQFIPDDS AANTAIVRWT NSSIESTQRL IIDSWIGSDY KGNQVRTENS PHQNVQTKTT
ANSLFSWSSG DAYIEQRKKE LQLAKGKEKS RKENQKTHSH QKEVSPEETA SNNVTESESK
REAQEKRLKK LNLTVPKESI TNRLNRIIEK ESNTFIHARI QRIKTHHTEQ ITKQIHERKR
KDHELYLKKL RLKEEEYDKA LKAATTETQK QGFFGNLFGF NSTTTNSRSE LDFQLKDSFD
NESVASHTST TASKTNNSNS KRFSFLPVGG LWSSGSGGKE SNKNDLSEVS PSKKSESSND
HIHPAEVVLR EEIELQEENE VAEIEEKEEV HEGENENGYD FDDEEFDDFT SAEPLPKTST
TITAMAPLKP STFSPPPSST TNEFNLTTQH NINADLLDIF GPEKKSNEAN ENLLNI