Gene PICST_82707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82707 
Symbol 
ID4838222 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp319023 
End bp322054 
Gene Length3032 bp 
Protein Length901 aa 
Translation table12 
GC content44% 
IMG OID640389537 
Productpredicted protein 
Protein accessionXP_001383691 
Protein GI150864733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAAGT CCAAGATCAA GTCCATCTTG CCCAAGCCGT CGCCACTGCA TCCCCAGCAG 
ACGCCCTCGC CAAAATTGTC TGGCTCCGCA ACTCCACTTT CTGGAAAAAG ACGGTCTGTG
GCCTCAGGAC TAGCAGACTC TAAGAAGCGA AGATCTCTTC CTACCAACAT CAGCTCGAAC
ATGAGCTCAG TTCCAGCCAT AGTTCCTATA GCCCCTGCTC CTGTTTCTAT CAGCATGGCG
GAAGACGATA GTCTGGAAAA GTCGAAGCAA ACCGGCCACA GACCTGTGAC TTCGTGCACC
TTCTGTCGCC AGCACAAGAT CAAATGTAAT GCTCTGGAAA ACTACCCCAA TCCATGTCAA
CGCTGTGAAA GAATGGGTCT TAAGTGCGAA ATCGACCCTC AGTTCAGACC CAAGAAAGGA
TCCCAGATCC AATCGCTCAA AAGCGATGTA GACGAACTCC GGGCCAAGAT CGAAATCCTC
ACAAAGAATG AATCGCTTTT GACCCAGGCA CTAAATCAAC ATAACATCTT GCAACAGCAG
CAACAACAGC AGCAGCAGCA GCTGTATACA CCTAGGGCAC AGTCAACCCA TTCTACAAAC
TCGCCTGTAA ACTTCCAGTC GCCGCAGCTC TATCCTGCTG GAACTTACCA ATCCTCCCCG
AACTCTATAT CGTTACCATC GGGCCACCTC AATGACTCTG TAAATAGCAG CACCAACCCT
AACCAACTCG CCCATGTAAT TCAGGAAGGC TCCGATACTT CGCCGTCTAC AAACAACACG
CCTAACTCTC AACATTTGAA TAGAGAAGAA GTGCAATATG TCTCAGAGTT CATACTTGGA
GAAGTACATC TTCCTCTAGA CAAGGCGAAC GACTTGCACC ATATCTTCAT GACAAAGCAT
CTTCCTTTCT TGCCCATTAT CACCTCTCGA TCGGCGACAG AATTGTACCA TAAATCGCAA
CTCCTTTTCT GGACAGTAAT ACTAACTGCA TCTCTCTCGG AACCTGATCC AACGTTGTAC
ATGTCGTTAG CTTCGTTGAT CAAGCAGTTG GCTATTGAAA CCTGCTGGAT CAGAACACCC
AGATCGACCC ATGTGATTCA GGCTCTCATC ATCCTCTCCA TCTGGCCCTT GCCCAACGAA
AAGGTGTTGG ATGACTGCTC GTATCGTTTT ATTGGTCTTG CTAAAAATTT ATCGCTTCAG
TTAGGTTTAC ACAGAGGTGG TGAGTTCATT CAAGAATTTA GTAGAACCCA AGTAAGTCTT
GGCCCCGATG CGGAACGTTG GAGAACCAGA TCATGGATTG CCGTTTTCTT CTGTGAGCAG
TTCTGGTCCT CGGTCTTAGG TTTACCTCCT TCCATCAACA CCACGGACTA TTTACTAGAA
AATGCCAGGG TGGACCAGAC ATTGCCCAAA GACTTTAGGT GCTTGATTTC GTTGTCAATT
TTCCAGTGTA AACTCGTAAA TGTCATGGGT ATTTCTGTAA CAAGACCCGA TGGTTTATTG
GAGCCTCTGA ACAGAGCTGG CTCGCTAAAC ATCTTGGATC GTGAGTTGGA AAGATTGAAG
TTCAAATTGA ACATTGTAGA TGGATCTGCA ATTGAAATCT ACTATCTCTA TGTGAAGTTG
ATGATCTGCT GCTTTGCCTT TTTGCCAGGC ACGCCAATCG AAGACCAGGT TAAATACGTC
AGCTCAGCTT ACTTATCAGC TACCAGGGTT GTCACGGTGT CTTCGCAAAT GTTGAAGGAC
AATATTCTGT TGATAGAATT GCCCATATAT GTCAGGCAGG CGATGACGTA CTCAGTGTTG
CTTCTTTTCA AGTTGCACTT GTCGCGTTAC TTAATTGACA AGTATGTGGA CAGCTCCAGG
CAGCTGATAG TCACGGTACA CAGATTGTTA AGAAACACTT TGTCGTCATG GAAAGACTTG
AAAAACGATA TATCCAGAAC AGCAAAGGTA TTGGAAAACT TGAACATCGT TCTCTATACC
TACCCTGACA TCTTGTTGAA CGATAATTTA GAAGCTGGTG GTAGTATTAT CAACAGGATG
AGATCACACC TAACTGCATC CTTGTTCTAC GACTTGGTGT GGTGTGTCCA CGAAGGTAGA
AGAAGAACTA TGATTGACAA GTCGAAAAAG TCAGAGTCTT TGGAAGACAC GAAAATTCCA
CCCAACTCCA CTACATCTAC TTCTGTCAGT AAAAGACCCG CACCTTTGCC GTTCTATAAC
CAGATCACCA AGGACGACTT CAAAACCATC ACCACCACTA CACCCAACGG GACTACCATT
ACTACTTTGG TTCCTACTGA TCAAGCTATG AACCAGGCTA GAAACGCATC TGGAAACAAA
CCTTTGGAAA TCAATGGTAT ACCTTTGGCT ATGTTAGAAG CCACAGGTAG TATCAAGGAT
ACTATCCGAG AGCTGCAGAC TCCAGCTCCA GAGGTAGACA ATCCAACCAC AAATGCTCCT
ATACTTCCAT CTACAGTCAA AATCAAACTG GAGTACGACA ATGTTGTTTC TACACCTCAA
CAGCCGTTAT TGGCTCACCA ATCACAGGCG CTTCAGCACC AGTCGTTTGC TATGCTTGGA
CACCAACCTG TAGATACTAC GCCTAACCAA CCGATGTTCA TAAATAGTGA CTCGATGCAA
ATCCAGCAAC CCAGCTTGGT GGATCCAGCA AGCGCACAAG CTACACCTAT CCAGTATATT
GGTGCTCCCA TCAATGGAGT CGCTGATCAG ATGGATAACT TCTTCCAGCA GCAGTCTAAC
GGTTGGTTGA ATAACGATAA CTACCAAGAT GATGATTTCC TCGGTTGGTT CGACGTGAAC
ATGCGTTCCG ATCAATAAAC CAATTGTCCC TAAAATAATG AAATGACTTG TTCCTATTAT
GTCCCTCTAT TCTCTGTCTA CTCTCTGAGT TTAATAATCT TATTATTATG TATATATTTT
CCATCTATAT TCATTATTTG TTGATTGAAT TTGTTATGAT TGCTAAAAGA GTACTATATA
CTATTAACTG CTTGTTAATA TAAATTTTCA AA
 
Protein sequence
MEKSKIKSIL PKPSPSHPQQ TPSPKLSGSA TPLSGKRRSV ASGLADSKKR RSLPTNISSN 
MSSVPAISKQ TGHRPVTSCT FCRQHKIKCN ASENYPNPCQ RCERMGLKCE IDPQFRPKKG
SQIQSLKSDV DELRAKIEIL TKNESLLTQA LNQHNILQQQ QQQQQQQSYT PRAQSTHSTN
SPVNFQSPQL YPAGTYQSSP NSISLPSGHL NDSVNSSTNP NQLAHVIQEG SDTSPSTNNT
PNSQHLNREE VQYVSEFILG EVHLPLDKAN DLHHIFMTKH LPFLPIITSR SATELYHKSQ
LLFWTVILTA SLSEPDPTLY MSLASLIKQL AIETCWIRTP RSTHVIQALI ILSIWPLPNE
KVLDDCSYRF IGLAKNLSLQ LGLHRGGEFI QEFSRTQVSL GPDAERWRTR SWIAVFFCEQ
FWSSVLGLPP SINTTDYLLE NARVDQTLPK DFRCLISLSI FQCKLVNVMG ISVTRPDGLL
EPSNRAGSLN ILDRELERLK FKLNIVDGSA IEIYYLYVKL MICCFAFLPG TPIEDQVKYV
SSAYLSATRV VTVSSQMLKD NISLIELPIY VRQAMTYSVL LLFKLHLSRY LIDKYVDSSR
QSIVTVHRLL RNTLSSWKDL KNDISRTAKV LENLNIVLYT YPDILLNDNL EAGGSIINRM
RSHLTASLFY DLVWCVHEGR RRTMIDKSKK SESLEDTKIP PNSTTSTSVS KRPAPLPFYN
QITKDDFKTI TTTTPNGTTI TTLVPTDQAM NQARNASGNK PLEINGIPLA MLEATGSIKD
TIRESQTPAP EVDNPTTNAP ILPSTALQHQ SFAMLGHQPV DTTPNQPMFI NSDSMQIQQP
SLVDPASAQA TPIQYIGAPI NGVADQMDNF FQQQSNGWLN NDNYQDDDFL GWFDVNMRSD
Q