Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53391 |
Symbol | |
ID | 4851627 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2377181 |
End bp | 2380069 |
Gene Length | 2889 bp |
Protein Length | 941 aa |
Translation table | |
GC content | 38% |
IMG OID | 640393335 |
Product | predicted protein |
Protein accession | XP_001387028 |
Protein GI | 126275088 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1111] ERCC4-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0564935 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTCGGTCCAA CTCATCACAA GATTGACTAC GGTAACTTGG ATAAATATGT CTATCCCTCA AACTTCGAAG TTCGAGATTA TCAATTCAAC ATTGTGCAAA GAGCCTTTTA CCATAATCTT CTTGTGGCTC TTCCAACTGG TTTGGGAAAG ACTTTTATTG CATCCACGGT TATGTTGAAC TTTCTCCGGT GGTTTCCGGA ATCAAAGATG ATTTTTGTGG CACCCACAAA GCCTTTGGTT GCACAGCAAA TCAAAGCTTG CTGTTCTATC ACGGGTATTC CTAGCTCCAA AGTAGCTATT CTTCTAGATA AAACGAGAAA GAATAGAGGC GAGATTTGGG ATGAAAAGCA GGTCTTTTTC ACAACACCAC AAGTTGTTGA AAACGATTTG GCCTCTGGTT TAGTGGATCC CAAGACAATA GCACTTCTAG TTATCGATGA AGCTCATCGT GCAAAAGGGA ACTATGCCTA CAACAATATT GTCAAGTTCA TGGATAGATT TACCAACTCC TATAGAATTT TAGCATTGAC GGCTACTCCA GCTTCAGATG TAGATGGAGT CCAAGAAATC ATCGACAACT TGAACATATC TAAGGTGGAG GTACGTTCTG AAGAAAGTAT TGATATCATC AAGTATATGA AGAGGAAACG TATCATCCGA CGAAACATAT ATCAATCTGA TGAGATCAAG GAATGTATTG ATTTACTATG TACTGCAATT GCTCCAGTAT TAAAAGTAGC AAATGGGAAA GGAATACTAG AAATTACAGA CCCTCTGAGA ATCAACTTCT TCCAATGTAT GGATGCTTCA CGTAAAATCG TAGCTAATCC CACAATTCCT GAGGGCACAA AGTGGTCCAA TTTCTTTACA TTGCAATTGT TGGGAGTTGT CGGACAATGC TTCCGAAGAC TAAATGTATA TGGATTACGT TCTTTCTTCA GTTACTTCAA TGAGAAGTAT ACAGAGTTTA TGGCTAAGCA TAGCAAGAAG AAATCATCGA ACAAGTTAAA CGCCGATTTC TACTTCAGTG AGCCAATCAA ACAATTAATG AAAAGAATAA GGACAATGAT AGACGATCCT AAAGTCTTCA GTCATCCTAA AATAGAAGCC ATGATGGAAG AGTTAGATGA GTTCTTCACT ATTAACAACG CCACTGACTC GAAAGTAATT ATCTTTACAG AATTCAGAGA ATCTGCTCTT GAGATTGTTC GGTTTATCGA GAAGGTAGGC AAGAATTTGA AACCACACAT ATTTATTGGA CAAGCCAAAG AAAGAGACAA ATTTGACGAG TCAAATTTTG GCAAAAAAAG TAAAGGTAAA AGAGTTGGCA AGAAACAACA AGATGATTCA AAGTCGAGCT CTGAAAATGC TCAAATTAAC GGTATGAATC AGAAACTTCA GAAAGAAATC ATTAAGAATT TTAAACAAGG AACGTATAAC ATTTTGGTGG CAACTTCTAT TGGTGAAGAA GGCTTGGATA TTGGAGAAGT TGATTTGATC ATCTGCTATG ACTCTACTAG CTCACCTATA AAAAATATTC AAAGGATGGG AAGAACTGGA CGTAAACGTG ATGGTAAAGT AGTTCTCCTT TTTTCTAGTA ACGAAGAATC CAAATTTGAC AAAGCTATGA ACGGTTACGA GTATATTCAG CAACATATCA TGAAGGGACA ACTTATCGAT TTAAAAGAAC AGAATAGAAT GATTCCTAAA GACTGGGAAC CAAAGGTGGA AATGAGGTTC ATAGAAATCC CTGAAGAAAA CCATGAGCTA CAGGTGGTAG ATGATGAAGA TGAAATCATC AGAATTGCTA CTCAGTATAT GATGGGAGGT AAACTGAAAA AGAAGAAAGC AGCAGCAAGC AAAAAAGGCA AAACAAAAGA AAAACGAGCC AAGCAGTTTT TCATGCCTGA TAATGTAGAG ATTGGTTTCA GAAGTGTAAC CAGCATGGTA AGGGCAGTTG GATCTAGTAA GTCCTTGGAA GAAGAAAAGA AGGAAGAAAA AGTAAGGGAT GTGTTAGATA GAATAGTAGA TTCCGATAGT GACGAAGAAA TTCCACTCGG TTCAATACCT ATACCAAGAA GTGAAGTGAT TGCGCATAAA CAATCCACCA CAGATGAACA ATTACTTGAA AGAGATTGCC AATCTGGTTC TAATATTTCA GATCGTACAC TTGACCAACA CCATTCAGCG AGCGAAGAGA GGGGCATTAA TTCTAACTTC AGTCATGAAA GTAACCTTCC TACTCCTCCT GAAAATTCTC CACCAAAAAG AAAGCTGATT GTACTAGAAG AGGCACGCAT TGCAAAAAAG AAACACAAGA AAAGTTTGGG AATTCGAAAG CCGACAATAC GTCCTCCCAG CATCATAGAC CAATTGAAGA AGCAGAAAAG CAAAATTATA CGTCCTGACT CAGCAAATGA AACTATTTGT CTCGACGAAG ACGACATACT TCTTCCGGAA TATACAGTCA CAGGCTTCTA TGAGACTTCA GCGTCTAAGA ATGAAAATCC AACAGATGAA ATACTGCAGG AGAATATTAC TGAAAAAGAG GTAACAGTCC AAGAAGATAG AAGAGAAATA GAACACGATG ATGACAGTGA GATTTTTGAT GATGGGTTAG ACGAACAATT GGCAATGATA GATGATATGA ATACAACTAA ATCATTTGTG GAGCCCACAA GAATAGATTT TAAAGATGAA GTATTCAAGA ATGACTTCGA TGAACATGAA GGATTCTTGA ACAACGATGA GCTTATGGAA CTTCATACCT CGTATTTCAC AGCCATAGAT CCTATGGACA AGGTATTTTA TTATGATCCC TCATCGAGTG TTCATGTTGA CGGAGCCAAT CGGGAATACG CTTTCTATGG TAAGATTGGC CACAGCAAA
|
Protein sequence | LGPTHHKIDY GNLDKYVYPS NFEVRDYQFN IVQRAFYHNL LVALPTGLGK TFIASTVMLN FLRWFPESKM IFVAPTKPLV AQQIKACCSI TGIPSSKVAI LLDKTRKNRG EIWDEKQVFF TTPQVVENDL ASGLVDPKTI ALLVIDEAHR AKGNYAYNNI VKFMDRFTNS YRILALTATP ASDVDGVQEI IDNLNISKVE VRSEESIDII KYMKRKRIIR RNIYQSDEIK ECIDLLCTAI APVLKVANGK GILEITDPLR INFFQCMDAS RKIVANPTIP EGTKWSNFFT LQLLGVVGQC FRRLNVYGLR SFFSYFNEKY TEFMAKHSKK KSSNKLNADF YFSEPIKQLM KRIRTMIDDP KVFSHPKIEA MMEELDEFFT INNATDSKVI IFTEFRESAL EIVRFIEKVG KNLKPHIFIG QAKERDKFDE SNFGKKSKGK RVGKKQQDDS KSSSENAQIN GMNQKLQKEI IKNFKQGTYN ILVATSIGEE GLDIGEVDLI ICYDSTSSPI KNIQRMGRTG RKRDGKVVLL FSSNEESKFD KAMNGYEYIQ QHIMKGQLID LKEQNRMIPK DWEPKVEMRF IEIPEENHEL QVVDDEDEII RIATQYMMGG KLKKKKAAAS KKGKTKEKRA KQFFMPDNVE IGFRSVTSMV RAVGSSKSLE EEKKEEKVRD VLDRIVDSDS DEEIPLGSIP IPRSEVIAHK QSTTDEQLLE RDCQSGSNIS DRTLDQHHSA SEERGINSNF SHESNLPTPP ENSPPKRKLI VLEEARIAKK KHKKSLGIRK PTIRPPSIID QLKKQKSKII RPDSANETIC LDEDDILLPE YTNITEKEVT VQEDRREIEH DDDSEIFDDG LDEQLAMIDD MNTTKSFVEP TRIDFKDEVF KNDFDEHEGF LNNDELMELH TSYFTAIDPM DKVFYYDPSS SVHVDGANRE YAFYGKIGHS K
|
| |