Gene PICST_53391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_53391 
Symbol 
ID4851627 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2377181 
End bp2380069 
Gene Length2889 bp 
Protein Length941 aa 
Translation table 
GC content38% 
IMG OID640393335 
Productpredicted protein 
Protein accessionXP_001387028 
Protein GI126275088 
COG category[L] Replication, recombination and repair 
COG ID[COG1111] ERCC4-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0564935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTCGGTCCAA CTCATCACAA GATTGACTAC GGTAACTTGG ATAAATATGT CTATCCCTCA 
AACTTCGAAG TTCGAGATTA TCAATTCAAC ATTGTGCAAA GAGCCTTTTA CCATAATCTT
CTTGTGGCTC TTCCAACTGG TTTGGGAAAG ACTTTTATTG CATCCACGGT TATGTTGAAC
TTTCTCCGGT GGTTTCCGGA ATCAAAGATG ATTTTTGTGG CACCCACAAA GCCTTTGGTT
GCACAGCAAA TCAAAGCTTG CTGTTCTATC ACGGGTATTC CTAGCTCCAA AGTAGCTATT
CTTCTAGATA AAACGAGAAA GAATAGAGGC GAGATTTGGG ATGAAAAGCA GGTCTTTTTC
ACAACACCAC AAGTTGTTGA AAACGATTTG GCCTCTGGTT TAGTGGATCC CAAGACAATA
GCACTTCTAG TTATCGATGA AGCTCATCGT GCAAAAGGGA ACTATGCCTA CAACAATATT
GTCAAGTTCA TGGATAGATT TACCAACTCC TATAGAATTT TAGCATTGAC GGCTACTCCA
GCTTCAGATG TAGATGGAGT CCAAGAAATC ATCGACAACT TGAACATATC TAAGGTGGAG
GTACGTTCTG AAGAAAGTAT TGATATCATC AAGTATATGA AGAGGAAACG TATCATCCGA
CGAAACATAT ATCAATCTGA TGAGATCAAG GAATGTATTG ATTTACTATG TACTGCAATT
GCTCCAGTAT TAAAAGTAGC AAATGGGAAA GGAATACTAG AAATTACAGA CCCTCTGAGA
ATCAACTTCT TCCAATGTAT GGATGCTTCA CGTAAAATCG TAGCTAATCC CACAATTCCT
GAGGGCACAA AGTGGTCCAA TTTCTTTACA TTGCAATTGT TGGGAGTTGT CGGACAATGC
TTCCGAAGAC TAAATGTATA TGGATTACGT TCTTTCTTCA GTTACTTCAA TGAGAAGTAT
ACAGAGTTTA TGGCTAAGCA TAGCAAGAAG AAATCATCGA ACAAGTTAAA CGCCGATTTC
TACTTCAGTG AGCCAATCAA ACAATTAATG AAAAGAATAA GGACAATGAT AGACGATCCT
AAAGTCTTCA GTCATCCTAA AATAGAAGCC ATGATGGAAG AGTTAGATGA GTTCTTCACT
ATTAACAACG CCACTGACTC GAAAGTAATT ATCTTTACAG AATTCAGAGA ATCTGCTCTT
GAGATTGTTC GGTTTATCGA GAAGGTAGGC AAGAATTTGA AACCACACAT ATTTATTGGA
CAAGCCAAAG AAAGAGACAA ATTTGACGAG TCAAATTTTG GCAAAAAAAG TAAAGGTAAA
AGAGTTGGCA AGAAACAACA AGATGATTCA AAGTCGAGCT CTGAAAATGC TCAAATTAAC
GGTATGAATC AGAAACTTCA GAAAGAAATC ATTAAGAATT TTAAACAAGG AACGTATAAC
ATTTTGGTGG CAACTTCTAT TGGTGAAGAA GGCTTGGATA TTGGAGAAGT TGATTTGATC
ATCTGCTATG ACTCTACTAG CTCACCTATA AAAAATATTC AAAGGATGGG AAGAACTGGA
CGTAAACGTG ATGGTAAAGT AGTTCTCCTT TTTTCTAGTA ACGAAGAATC CAAATTTGAC
AAAGCTATGA ACGGTTACGA GTATATTCAG CAACATATCA TGAAGGGACA ACTTATCGAT
TTAAAAGAAC AGAATAGAAT GATTCCTAAA GACTGGGAAC CAAAGGTGGA AATGAGGTTC
ATAGAAATCC CTGAAGAAAA CCATGAGCTA CAGGTGGTAG ATGATGAAGA TGAAATCATC
AGAATTGCTA CTCAGTATAT GATGGGAGGT AAACTGAAAA AGAAGAAAGC AGCAGCAAGC
AAAAAAGGCA AAACAAAAGA AAAACGAGCC AAGCAGTTTT TCATGCCTGA TAATGTAGAG
ATTGGTTTCA GAAGTGTAAC CAGCATGGTA AGGGCAGTTG GATCTAGTAA GTCCTTGGAA
GAAGAAAAGA AGGAAGAAAA AGTAAGGGAT GTGTTAGATA GAATAGTAGA TTCCGATAGT
GACGAAGAAA TTCCACTCGG TTCAATACCT ATACCAAGAA GTGAAGTGAT TGCGCATAAA
CAATCCACCA CAGATGAACA ATTACTTGAA AGAGATTGCC AATCTGGTTC TAATATTTCA
GATCGTACAC TTGACCAACA CCATTCAGCG AGCGAAGAGA GGGGCATTAA TTCTAACTTC
AGTCATGAAA GTAACCTTCC TACTCCTCCT GAAAATTCTC CACCAAAAAG AAAGCTGATT
GTACTAGAAG AGGCACGCAT TGCAAAAAAG AAACACAAGA AAAGTTTGGG AATTCGAAAG
CCGACAATAC GTCCTCCCAG CATCATAGAC CAATTGAAGA AGCAGAAAAG CAAAATTATA
CGTCCTGACT CAGCAAATGA AACTATTTGT CTCGACGAAG ACGACATACT TCTTCCGGAA
TATACAGTCA CAGGCTTCTA TGAGACTTCA GCGTCTAAGA ATGAAAATCC AACAGATGAA
ATACTGCAGG AGAATATTAC TGAAAAAGAG GTAACAGTCC AAGAAGATAG AAGAGAAATA
GAACACGATG ATGACAGTGA GATTTTTGAT GATGGGTTAG ACGAACAATT GGCAATGATA
GATGATATGA ATACAACTAA ATCATTTGTG GAGCCCACAA GAATAGATTT TAAAGATGAA
GTATTCAAGA ATGACTTCGA TGAACATGAA GGATTCTTGA ACAACGATGA GCTTATGGAA
CTTCATACCT CGTATTTCAC AGCCATAGAT CCTATGGACA AGGTATTTTA TTATGATCCC
TCATCGAGTG TTCATGTTGA CGGAGCCAAT CGGGAATACG CTTTCTATGG TAAGATTGGC
CACAGCAAA
 
Protein sequence
LGPTHHKIDY GNLDKYVYPS NFEVRDYQFN IVQRAFYHNL LVALPTGLGK TFIASTVMLN 
FLRWFPESKM IFVAPTKPLV AQQIKACCSI TGIPSSKVAI LLDKTRKNRG EIWDEKQVFF
TTPQVVENDL ASGLVDPKTI ALLVIDEAHR AKGNYAYNNI VKFMDRFTNS YRILALTATP
ASDVDGVQEI IDNLNISKVE VRSEESIDII KYMKRKRIIR RNIYQSDEIK ECIDLLCTAI
APVLKVANGK GILEITDPLR INFFQCMDAS RKIVANPTIP EGTKWSNFFT LQLLGVVGQC
FRRLNVYGLR SFFSYFNEKY TEFMAKHSKK KSSNKLNADF YFSEPIKQLM KRIRTMIDDP
KVFSHPKIEA MMEELDEFFT INNATDSKVI IFTEFRESAL EIVRFIEKVG KNLKPHIFIG
QAKERDKFDE SNFGKKSKGK RVGKKQQDDS KSSSENAQIN GMNQKLQKEI IKNFKQGTYN
ILVATSIGEE GLDIGEVDLI ICYDSTSSPI KNIQRMGRTG RKRDGKVVLL FSSNEESKFD
KAMNGYEYIQ QHIMKGQLID LKEQNRMIPK DWEPKVEMRF IEIPEENHEL QVVDDEDEII
RIATQYMMGG KLKKKKAAAS KKGKTKEKRA KQFFMPDNVE IGFRSVTSMV RAVGSSKSLE
EEKKEEKVRD VLDRIVDSDS DEEIPLGSIP IPRSEVIAHK QSTTDEQLLE RDCQSGSNIS
DRTLDQHHSA SEERGINSNF SHESNLPTPP ENSPPKRKLI VLEEARIAKK KHKKSLGIRK
PTIRPPSIID QLKKQKSKII RPDSANETIC LDEDDILLPE YTNITEKEVT VQEDRREIEH
DDDSEIFDDG LDEQLAMIDD MNTTKSFVEP TRIDFKDEVF KNDFDEHEGF LNNDELMELH
TSYFTAIDPM DKVFYYDPSS SVHVDGANRE YAFYGKIGHS K