Gene PICST_65774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65774 
Symbol 
ID4839113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp836045 
End bp839135 
Gene Length3091 bp 
Protein Length961 aa 
Translation table12 
GC content43% 
IMG OID640390428 
Productpredicted protein 
Protein accessionXP_001384829 
Protein GI150865562 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.169462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.537048 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTTCTGAGCT AGCAAAGCTT CCAGAAGACA CAGACACGTC ACTGTCGTAC CCCACCAACC 
ATCTCCTAAA AGCTCATAGT CGCAAGCATT GCCATGGTCG CTGGAGATAA CTTACAACAG
CTAAAGCTGG CATTGGAGAC TATGTACTCC AATGCTAACC AAAACGATAA AATCAACGCT
ACCCATTTTC TCGAGACATT CCAAAAATCC CAAGATGCTT GGGAAATCGT CCATACCATT
CTCAACGACG CGCATTTAGA TATCCATATT CAGCTCTTTG CCGCCCAGAC GTTGCGTTCA
AAAGTGACAT ACGATTTGTC TCAATTGCCA GAGCAGAACT TTGCAACCTT GAAAAATTCC
ATTATTCAAT TACTAACGGT GTTTACCGCC AACAACCAAC GTCTTGTGCG TACGCAGCTC
TGTGTTGCGT TAGCACAGCT TGCATTGCAA TATTTGACGT GGCAGGATGC TGTGTCGGAA
ATTGTTACTA AGTTATCGCT GACGGCAACG TACTTGCCCT GTTTGTTGGA CTTCTTGAAG
ATCTTGCCCG AAGAGTTGTC TGACGTCAAA AAGACATCTT TGTCCGACGA TGAGTTCAAC
ACGAGAACTA GAGAATTGAT AGAGAACAAC GTAGAACAGG TGTTACTCTT GTTGAAGAAC
TTGACCGACA CTAACTCCAG TAACTCGTCA CAAGACTCAA TGGTTCTCGA CTGTTTGAAC
TCGTGGATCA AGGAATGTCC CATAGAAAGC ATTCTCCGTA TTGATTCTTT AACTTCACTT
ATCTTCCGTA GTTTAGCCAG CGAAGAAACC TTTGATAAGT CTATTGAATG TCTCTGTACG
ATTATTAGAG AAACAAGAGA CATCGACAAC CATGAGCTCA TCGAAGCCTT ATACAAACAG
ATCATCGAGT TGAACTCGTT TATGCATGCT AACCCTGATA GACTTGAAGA TCCCGAAACA
TTTGACGGTT TGTCACGTTT GTATGTGGAA GCCGGCGAGT CGTGGCATGT TCTTATCGCC
AAGAACCCGA AGCACTTCAA GCCGTTGGTG TTAATCCTTT TGGAAATCTG TAAATACCAA
GACGACTTGG ACATCGTCAA GTACACATTC TATTTCTGGC ACTTGTTGAA GCAGTTGCTC
ACCATTTCCA AGTTCCAGGA ATCAAAGGAA GAGTTGGCAG ATATCTTTGC CAATTTAATC
ACCATCATAA TAAAGCATTT AACCTACCCC ATAACTGGAA ATGACCACGA CCTTTTCAAT
GGTGATAGAG AACAGGAAGA CAAGTTCAAA GAGTTCCGTT ACGAAATGGG GGACGTTCTC
AAAGACTGCT GTGCAGTAGT AGGACCCTCG AAGGCTTTGA GCATTCCCTT CCACCAGATC
CAGACCATTT TATCCTCAAA CATGCCTTCG ACCAACTGGC AGCACTTAGA GGCACCTTTG
TTTTCCATGA GAGCCATGGC TAAGGAAGTT TCTACCAAAG AGAAAGTCAT GTTGCCTACT
ATCATGTCAT TTCTTGTGCA GTTGCCGGAA CATCCAAAGG TCAGGTATGC AGCTACATTA
GTATTGGGAC GGTATACCGA ATGGACAGCC AAGAATCCGG GATTTTTGGA ACCACAATTG
AACTACATTA TCAAGGGCTT TGAGATTGTC AGCTCCAACA GCGCAGACGA ACAGGGAAAA
CACGACATTA TCATTGCTGC TTCTCGAGCC TTGATGTATT TTTGTCAGGA TTGTTCCGAA
TTGTTGGTCA GTTATTTGGA ACAGTTGTAC ATGTTGTATG GGCAAGTTCG TGACCAACTT
GACTTGGAAT CAACGTACGA ACTAGTTGAT GGTTTGGCCC ATGTAATTTT GAAGTTACCA
ACGGAAAACT TGTACACCAC TACAGAAATG TTCATTTCGC CAACTTTGCA GACTTTAAAT
CAATTGCTCG TAGCTGGTGA AAATGAAGCG AACTCCAAGT CTGTTGCTGA TCAAATTGAG
GTTTTGACAA AGTTCATATA TGTCTTGAAG GCTAACAATT TCAGTAAGCC TGATAGCCCT
ATTGCACGTT TATTCATAGA AAAGATATGG CCAGCTATTT CTCAATTGTT GGCTGCATAT
GGTAAGTCGG TCATTGCAAG TGAGAGAATT TTGAAGTTAG TCAAGTCAGG AATTCAATCC
CAGAGCACAT ATTTGAACAG TCTTTTGCCC GAAATGGCTA CCTTGTTGAT TCAGGGCTTC
CAGCAGTCAC ACTATGGGTG CTATCTTTGG GTATCTGGGG TTTTGATCAG AGAATACGGT
GATGAGTATA CCTCGGAAGA TATCAAGGAT GCTGTCTACA GATTTGGTAT GGAACAATGC
TCGTATTTCT TTAACCTATT GTTCAATACC AATGAAGAGG GAGTTCGTGC CATGTCGGAC
GTTGTAGAGG ATTACTTCCG TATGATGAAC GACTTGCTTA TGTTTTACCC GTTTAAGGTG
ATAGCCAACC AGGACTTATT AAAGTCTACT CTCAAGGCAT CGTTATTGAC TTTGAATCTG
ATCAACGAGT TCAACCCAAT CATTTCATGT ATACACTTCC TTGTAGACTT GGTATCATGG
GGATTGCCTA GCCCTCCAAT TTCGTTCTTT GATGAGAGCG ACTTGACTAT TCCCAGACAC
GGCATGCAAC AGTTTCTCGT TAGCGAGAAT AACGGAGGAG AGTTGTTGAG AGTGGTGTTG
AATGGCTTGA TTTTTAAGTT CAACAACGAT ATTCAGCAGG ACACCAACGA CTTGATTCTC
AAGATCTTGG TAGCTGTTCC AGATAAAAAT ATTTCTATAG GCTGGTTGCA TGAAGTGGTG
AAGGCTTTAC CCAACGTCAA CCAGAAAGAG ATCAGTAAGC TTATGGATAC AGTTTCAGTG
GCATTGCCAA ACAAGGACAA TAGAAGAGTG AGGTCTGCGC TTCGTGACTT TGTCAACTGG
TACAGCCGTA AGAACGTGAC ACCCAGAAGT GAATTCTAGG TGGAAATCTA CAGACTAAAA
GGGAATCATA CGTCAAAAGT TATAGTTAAT GGATACTAGA TAGAGAAGAT GTAGAGGAGA
GAGAGGTTGT GAATAGAATG AGCCACCACG G
 
Protein sequence
MVAGDNLQQL KSALETMYSN ANQNDKINAT HFLETFQKSQ DAWEIVHTIL NDAHLDIHIQ 
LFAAQTLRSK VTYDLSQLPE QNFATLKNSI IQLLTVFTAN NQRLVRTQLC VALAQLALQY
LTWQDAVSEI VTKLSSTATY LPCLLDFLKI LPEELSDVKK TSLSDDEFNT RTRELIENNV
EQVLLLLKNL TDTNSSNSSQ DSMVLDCLNS WIKECPIESI LRIDSLTSLI FRSLASEETF
DKSIECLCTI IRETRDIDNH ELIEALYKQI IELNSFMHAN PDRLEDPETF DGLSRLYVEA
GESWHVLIAK NPKHFKPLVL ILLEICKYQD DLDIVKYTFY FWHLLKQLLT ISKFQESKEE
LADIFANLIT IIIKHLTYPI TGNDHDLFNG DREQEDKFKE FRYEMGDVLK DCCAVVGPSK
ALSIPFHQIQ TILSSNMPST NWQHLEAPLF SMRAMAKEVS TKEKVMLPTI MSFLVQLPEH
PKVRYAATLV LGRYTEWTAK NPGFLEPQLN YIIKGFEIVS SNSADEQGKH DIIIAASRAL
MYFCQDCSEL LVSYLEQLYM LYGQVRDQLD LESTYELVDG LAHVILKLPT ENLYTTTEMF
ISPTLQTLNQ LLVAGENEAN SKSVADQIEV LTKFIYVLKA NNFSKPDSPI ARLFIEKIWP
AISQLLAAYG KSVIASERIL KLVKSGIQSQ STYLNSLLPE MATLLIQGFQ QSHYGCYLWV
SGVLIREYGD EYTSEDIKDA VYRFGMEQCS YFFNLLFNTN EEGVRAMSDV VEDYFRMMND
LLMFYPFKVI ANQDLLKSTL KASLLTLNSI NEFNPIISCI HFLVDLVSWG LPSPPISFFD
ESDLTIPRHG MQQFLVSENN GGELLRVVLN GLIFKFNNDI QQDTNDLILK ILVAVPDKNI
SIGWLHEVVK ALPNVNQKEI SKLMDTVSVA LPNKDNRRVR SALRDFVNWY SRKNVTPRSE
F