Gene PICST_32001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32001 
Symbol 
ID4839212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp439395 
End bp441869 
Gene Length2475 bp 
Protein Length824 aa 
Translation table12 
GC content42% 
IMG OID640390527 
Producthypothetical protein 
Protein accessionXP_001384748 
Protein GI150865504 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACTG TTAAAAGTCT AGAGGATTTA CCCTCACTTA ATCTGGATAC GAGCTCGTAT 
TCATTTGATA CTCCAGTGAG GGTTCTACGA ATTTACTCTC CCCAAGATGC CACAGAAAGC
AACTACTACC ATATCTATGT CACAGACTAT TCGGCGATAG GGCCCCCAAC TTGGATGGCA
GACGAAGTGG AACGCGTCAA AAGTCCCCAC ATTGCAGATA CATCCATACT GCGCGAGTGG
AAGTTTCTCA TTGTCGCCAG CAGAACCCAA TTGCAAACGC TTATAGAGGA GAAATTCAAG
AAAACTCCGA AAGATGTCGT TTTGAAACAG TTGCCAGTTT TGGCCCAGGC TCGCTTTGTA
GTTAAAAGAC ATGTAAATGG CATATACGGC TCATTAATAT CCTTGGATTT CAACACAGCA
GGTATCATTG CTGGTTTAGA ACTTAGAGAA GATAAAGGAT ATTATTATGC CTATGCTCGA
CTTTTCGCTC GTTTCAAGAT TCTTGCTACT CAAAATCAAA TTTATCTTGC TGAACAAGTA
TATAATTTCA CAGCCAAGTT CGGCAAATTG CTTGATGAGA TTCTTGAACC GACGAAGCAG
AGAAAGAAGG GACGCCAGCA CAAACTACCT GCAAACACAT CTGAACAAAA TGCAGTTATG
AAACAAAAAG ATAAAGAAAA GCAAAAGGAC AATTCAGATG CCAACCACAA GAGGCATAGT
ATCGCGAACA GAGATACGAC AATTCAAATA AAAGTCAGCG TAGAAAACGA CTACACCAAA
AGTACAAAAA AAACTGTAAA TATCAACTCC AACAGTAACG AAACTAATGA CACTACTAAT
ACCGATATCA ATAACGCTGC AGCTGTCAAT ACCATCAACG ATACCGTCAT TAATGATAGC
AACTCAGCTG TCTCTGGAGA CTCCGTAGAT AAAGAACAAA CTTCAGAAGC TCCAGATTCA
CAAGATATTC GTCAAGTCTC AGACAAATTC AAGGTTGATG CAGAACCTTC AGTATCAGAT
AAAGCAGCAG ATGTACGAAT GGGGACATAT CGAGGTGAGA CACAACCACA AACTCCTATA
GTTGGTAACA AGTTCTCAAC TTCATCGCCT CGAAGTGAAC AATTAGAGCA TTCGTCGCCA
CTCAGAACGA AAGAATCACT GGAAAATTCC ATTATAAGAG CCCAACAACA GATACCTTCG
GCTTCAGCGA GTCTGGATGA CCATTCACCC ATTGCCCTAC GTGCAGAATC ACTTCTGTCA
TCGATTCAAG AGCAACGAAA TGTAAATTCT TCGAAGGACA AGTTCAGTAA ACAAGAAGAC
CATTCGCTTA CCTCTGTAAA GAGACTAGCT TCACTATCAT CTTCTCCTCT GAAGAAACAG
CGTAAAACCG AGTCATTAGC ACGTACAGAA TGGGCACCTG GAGGTCATCT GATTTTAAAC
TTGGATAATA CTAAGTTTAA GAAGCTGCCA CCGACCGGAG CTGATCTTCT TTTTCATAAC
CAAGACGACG AGGACTCTCA GGCTCCATTA TTTCCACCAA CACAAGCCAC GGCACCGAAC
TACCTAACCT TTGAAGCGAT GCTTAACAAA CAATCACAAA TAGAGATTGT AGAAGAAGAG
AACGACTCCA CTGCTTATCA TAGCTTGCCC CCAGAAACTA AAAGCACTCA GTATGATACG
TCTTCGTCAA ACATGCCACC TCCGCCAAGT TCCCGAGTGA TATTGCCGAA TTCGAGCCAG
CCTCCATTGG CACAAGGCTC TCTTCCTGTT CCCCAGACCC CCAAAGCATT ACAGGGTGTC
AAACGTCAAT CCAGCTCTGT ACCTAACCAT CCCGATAATG TGCATAGTCA AATTTCAGAG
AATGCAACGG TTGTATCACC AACATTAATG CCAAGTCAGC GTGCTGAAAT AGAGATAGAC
ATCGATATTT CTCCTCCTCC TACGCAGCCA CGCCCTGAAG ACAGTACTGG CAGTGCAGTA
AGTTCCAGAG AGAGCCAGGA CAAGGATAGC ATCGGAATTA TAGGAAAAGC TCCTAGGACC
TACATTTCAA AGACTATGTC CATTGCACAA TTGTTAAAAG TACCATTGAC GGTCGAGGAG
ATTTCTAAGA AACAGATCTA TGAAGTACGA GGTGCTTTTA TCAAAGGATT GCAACCATTC
AGGCCATTCA TAGTAAAACC TTTCAAAAGA ACAATAAAAG TTGCCAACTT CAACATCGTC
TTAACCGATA GAAGCAGAGA TCTAGTGGTC GAGTTCCACA ACGAAATTGA AATCTGCCAC
TTTCTCGGAG TCGACGAAGT GGAAGAAGTC TACAATCATC TCTCGACTAT AGAAGACGAT
ATTATCAAGC TAGCTAAACT AGAACACCCT ACCACCAACA ATATCCGTCT TGTTAGAAAG
ACCAAAACTG CACGAGATAA TATGTTGTAT CCATACTGGG CATGTCTCAG CACCTTGGAA
GACCTCATAG CGTAG
 
Protein sequence
MSTVKSLEDL PSLNSDTSSY SFDTPVRVLR IYSPQDATES NYYHIYVTDY SAIGPPTWMA 
DEVERVKSPH IADTSISREW KFLIVASRTQ LQTLIEEKFK KTPKDVVLKQ LPVLAQARFV
VKRHVNGIYG SLISLDFNTA GIIAGLELRE DKGYYYAYAR LFARFKILAT QNQIYLAEQV
YNFTAKFGKL LDEILEPTKQ RKKGRQHKLP ANTSEQNAVM KQKDKEKQKD NSDANHKRHS
IANRDTTIQI KVSVENDYTK STKKTVNINS NSNETNDTTN TDINNAAAVN TINDTVINDS
NSAVSGDSVD KEQTSEAPDS QDIRQVSDKF KVDAEPSVSD KAADVRMGTY RGETQPQTPI
VGNKFSTSSP RSEQLEHSSP LRTKESSENS IIRAQQQIPS ASASSDDHSP IALRAESLSS
SIQEQRNVNS SKDKFSKQED HSLTSVKRLA SLSSSPSKKQ RKTESLARTE WAPGGHSILN
LDNTKFKKSP PTGADLLFHN QDDEDSQAPL FPPTQATAPN YLTFEAMLNK QSQIEIVEEE
NDSTAYHSLP PETKSTQYDT SSSNMPPPPS SRVILPNSSQ PPLAQGSLPV PQTPKALQGV
KRQSSSVPNH PDNVHSQISE NATVVSPTLM PSQRAEIEID IDISPPPTQP RPEDSTGSAV
SSRESQDKDS IGIIGKAPRT YISKTMSIAQ LLKVPLTVEE ISKKQIYEVR GAFIKGLQPF
RPFIVKPFKR TIKVANFNIV LTDRSRDLVV EFHNEIEICH FLGVDEVEEV YNHLSTIEDD
IIKLAKLEHP TTNNIRLVRK TKTARDNMLY PYWACLSTLE DLIA