Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_42423 |
Symbol | |
ID | 4837459 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 633953 |
End bp | 636295 |
Gene Length | 2343 bp |
Protein Length | 726 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640388774 |
Product | predicted protein |
Protein accession | XP_001382884 |
Protein GI | 150864166 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00348392 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTTGGACG AAGAACTCAT AGACGACAAG ATGAGGCTAG CCATCTTGTA CTTCAAAGCC GCGGACTTTG AACGAGCGCT CAATTTGTAC AATGAATTGG TCGAGATGGT GGCTTCTATC CTGGCTGTAG AAGCGCAAAA AATCAGAAAA CACGTCTACA ACTTGGCAGA AAAGCCCATT GTGGGAGCAT GTGTACATCC GAAGTTGGGA CTGATTCTCG ACCAACGAGC TGCCACTCAC GAGAAGTTGG ACCAGTTCTC TAAAGCACTA GAAGACTCTC GTAGAATAGT CAAATTAGAG CCGATTAGTT GTAAAGGATA TCTCAGGGTT GGTAAATTGC TTCTCAAGTT GAAGCAGGAC GTAGAAGCGT ACAAAACCTA CCAAAGAGGT GTATACATTA TCGAAAAAGT TATAAAAGAG CATCTGGTTT CAGTACCAGA GAAGCTCTTC TCGCAATTGA AAACTCAATA TAAGCTGTTG AACAAGACGC TCAAGACTAA AAGACAAAAC AAATCTCAAG AACTGCAATA TCTATCAAGA GAAGATTCGC ATGAACTGGA TAGTTCTCTA AGAATGAAGT TTTCTCAGGG AAGCAGCAAT GCCAAGGGCA TAACATTGTC TCAGGAAAGT GGTCAGGCAC TGAGACTCTC TAGTATCAAT GGCTTTACTA CTGCATTGGA ACATATGCTT CCTTTGAAAA GAACAGCTTC CACCTCTCAA ATATCAGCCC CAACAACTAG TTCCAAGAGA GCCAAGCGTA TGGGTTCAGT TTCAAAACCG ATAGCAGACC CCTTCCAGCA TCTCCCGCTA GAAATCATAG AGCTCATATT CCAAAACTTG GCCTTTAAAC AACTCCTATC TTGTCATCTA GTCAACAAAT TGTGGTACTA TAATTTGACT AAGATTCCCA ATTTGTATTG TACTCGTGTC AATTTAAAGT CGAACATCAC ACTTTCTGAA TTCACCCATG GGATTCGATT GGTTAAAAGA GTAGCCCACA AGTCGTATTC ACAACAAATC AAAGCCCTTA AAATGCGATC CGCAATCAAT GCTTCACAAT TGCAAAAAAT CATAGAAATA ATCATTAATG AACCAAACTT TTCCATACGA TCGTTGGATA TGTTTGACAA ATATCTTAGT TTTCAGCTTC TATTGAATAA GCTCAGCAAG TTCAGTTGGA AATTGAACAA CTTGTCCAAT TTGGAGTACC TTCGCTTAGG AGTGAATTCA AGTCTTCGAT ATGAGAATAT CATATTGGAA TTATTCAAGA AGTTGAAGAC TCTCTATATT GTAATCCTAT ACTCCGACAT GAGTGGACCT AATAACCAAA TTCTACCTAC TACAGAAAAG TATTTCAATC GTTTGCACGA GAAATCAGTC AATAATTCAG ATGATTACGA ATCTATGCTG AATTTGACTC TTGTAAACCA TCCGAAGTTA CTACCAGGAG AAAGTCAAGT AGCTCCGAAG TTTGAAACTT ACAATCCATA CCCAATTTTC CTTGATAGAA GCTTTTCAAA TTTGGTAGAA TTGAGTTTGG TCTCTTTTGA TTTCTTTAAT AGGTTGCCTC TTTTGGGAGA ATTTTTCTGT AAATGCAAAC TGTTGACCAA ATTGATGCTT GAAAACAATT TTAACTTTCG TATGCTTGAT TTGTTTCAGA TGCTCCAAAA TTATAACCCA AGTTTCCGGT TGGAGAAATT ACTATTCAGA GAGCCCAAAA TCATTAGTAC GACCACCATG AATGAATTCT CAACCGATGA CTTACCTCAG TTGAACAGCT TGCAACTGTT GGATGTGTAC GGCTCATCAT TGACCACAAA GGGGTTGATG AAATTGCTTA GAATTTGCAA CAAAGGAAAT AAGGAACTTA CCACTTTAGT AATGGGAAAT TCCACATACT TGCATTTCAA GACCGATGCT TTTCAGACTA GCAATCGGAA CCTATTTTCA TTTGTACAAA TGCTTCGAAT TGCACCCAAC TTAGAGAATC TATACCTCAA CGAATTAGAT ATAGACAATC AAACTATGAA GCAGCTTAAT AAGGATATAG AATCTATTGG TTATGTCAAT TGCAAGCTTA AAGTGCTTGA CTTGAGTTTC TGCAACAGAG TTGAGGGTAT TGGACTAATT GATTTGTTCA AAGCGTTTCC TTCTAATATT AAACAGATTA ACGAGAATAG TGGAAGTGCA TTTCGGATCC GAGAATTAAT CATAGACGGT ATAGAGATCA ACATAGCTAC ATTGCGTTTA CTACAGAAGC ACAATTTTGT CAGCACCATC AAAAATGATC CCAACAAGAA GAGATGGAGA CAGTACGGGG TGAATACTTT GGTTCCAGTT TGA
|
Protein sequence | MLDEELIDDK MRLAILYFKA ADFERALNLY NELVEMVASI SAVEAQKIRK HVYNLAEKPI VGACVHPKLG SILDQRAATH EKLDQFSKAL EDSRRIVKLE PISCKGYLRV GKLLLKLKQD VEAYKTYQRG VYIIEKVIKE HSVSVPEKLF SQLKTQYKSL NKTLKTKRQN KSQESQYLSR EDSHESDSSL RMNSKRAKRM GSVSKPIADP FQHLPLEIIE LIFQNLAFKQ LLSCHLVNKL WYYNLTKIPN LYCTRVNLKS NITLSEFTHG IRLVKRVAHK SYSQQIKALK MRSAINASQL QKIIEIIINE PNFSIRSLDM FDKYLSFQLL LNKLSKFSWK LNNLSNLEYL RLGVNSSLRY ENIILELFKK LKTLYIVILY SDMSGPNNQI LPTTEKYFNR LHEKSVNNSD DYESMSNLTL VNHPKLLPGE SQVAPKFETY NPYPIFLDRS FSNLVELSLV SFDFFNRLPL LGEFFCKCKS LTKLMLENNF NFRMLDLFQM LQNYNPSFRL EKLLFREPKI ISTTTMNEFS TDDLPQLNSL QSLDVYGSSL TTKGLMKLLR ICNKGNKELT TLVMGNSTYL HFKTDAFQTS NRNLFSFVQM LRIAPNLENL YLNELDIDNQ TMKQLNKDIE SIGYVNCKLK VLDLSFCNRV EGIGLIDLFK AFPSNIKQIN ENSGSAFRIR ELIIDGIEIN IATLRLLQKH NFVSTIKNDP NKKRWRQYGV NTLVPV
|
| |