Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_90139 |
Symbol | |
ID | 4839825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 1195775 |
End bp | 1197819 |
Gene Length | 2045 bp |
Protein Length | 510 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391140 |
Product | predicted protein |
Protein accession | XP_001385581 |
Protein GI | 150866098 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.405101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CATTCGTGAC AATTATTTAG AACAAAACTC ACAAATACTA GTGACATAAA GTTATATACT GAGAGGTATT ACACAGATTA GTAAAACACC ATTGTTCTCT TTTGGCTCTC GTACCCTCAT CCATGAATTA ACTGTACCAG ATCATCACAG AAACACCAGA AACCCAACAT TCGCGTCTTG CCTCATCTCA CGACAACCCT ATACCTGCTA CGGGAAGAAT ATAGAGAATC TTGTTGATAT TGTCAGGCTT TGGATACTTA TAGATCTTCT ACTACATTAA ACTGACTTCT GGAATCAAAT TTCATTAACT AAACACATCC TAGACATATT TCTAGATCTA ACTAAAGTCG TTTACCATTC GTATACATTT GCATTTATTG AAACTGTCAC CATAAGTCAG TATTCACTTT ACCCACTTTT CCATATTCAT TCACATCCTC ATAATTTATT CCAGGTCAGC ATGCTCAAAG AGAACGATAT CATATCAAAG GTGCTTCTCA CCATCGACCA CTGCAAAGCT AACATGGCTC TGTTCAAGAC AGCCATCACC AGCAACGAGT ATACTCCGCT TAACTCCAGC TGTTTCAGCC GAACCCTTGT CTGGAAAGCA TGTCTCATCA CCGATAGCTT GAAAATCCAC ACCTGGGAGT CGAAGCTCAG CGATTCTCGC GTAGTTTACC ACCAGCTCAC AAAGAGAGAC GATATGGCGG TGCCGTGGTG GCATCTTGAA TCTGATAGCA GTTTTTATTC TAGTAGAGAA ATGTCCCGTA AGCCAAGCTT GAAGAACTCC AATTCTGCTG CCAAAAGAAG CCGGAGCTTG GGTAAAGTCC CTCTCACCCG GGTTTCCAAC GTCGAAGACC CTCTTTCCAG TCACAGTAGA TCCAGAAGCA GCACGCCTAC GATTCCCTAC GAATACACGG AAGAAGATCT CGAGCTATTA CAAACCATCA TTCTCGATAT TGACCGTCTT TTCCCTGGGG AAGAATTTTT CCATTCATCC AATGCCACTT CTGTTGTAGC CAAGAAACAG ATGATCGAAA TTTTGTATGT TTGGGCCAAA TGCAATCCAC AAGTGGGCTA CAAACAGGGT ATCCACGAGA TCTTGGGTTT ATTGTACATC AATCTCAGTA AAGAAGCTGT GACAATACCT ATTTCAAACA CTATATCAGC AGATGACTTG AAAATCCTCA CAATGTTCGA TATTCACTAC TTGTCGCATG ATTTGTTTAC CATTTTCAAC AAACTTATGC TACAGAGTGG AGTCGTGACC CGCTTTTACG AAAACGAAAA TGTGCTCTGG CAGTCCATCG AGAAATTCAA CGTATACTTG ATGAAAGTGG ATCAATTAAT TCACTACAAT TTGATTCAGA AACTAAGATT AGAATCTCAA CTCTGGATTA TTCGCTATTT ACGTCTTCTA TTATTGCGTG AATTGGGAAA CGACTTGGAA ACAACCATCT TACTCTGGGA CAAGTTGGTA GCATCACAAT TCTCTCACCA TAACGGAAAT ACAATCACTG CCATCCCCGA GCTCATCATG TTCATGATCA TCACACTTTT GATCCAGTTG AAGACCCCAT TGATTACATG CGACTTCTCT GAAGGTTTGT CTTTACTTTT GCATTATCCC GTTCCATCGG GATTGAAATC TAGTGCTTCA AGAAGCGACT TCATCGCTGC TCTCTATAAG GACGCCGCTA GACTTTATGA GCGTAGAGAT AATGATCTCA AGTTGTACGA GTACGGTCTC AAACTAAACA ATACATACAA TCGCAATCTC AAAATCACCA TGAGCTACTC TGGGAGCGCT AGAAATAGCA CCGACTCAGC CGGTAGCGGA CGTGGTACTT CTATAAGCCC GTCACCTACT CCTCCCACAT CTCAGTTGCC ACCAGCTCCT CCTGGTGGTT CAAAAGAAGA ACAGATGAGG TTTGAGAAGA TGCGCCTTGA GATGCGCTTG AAGAAGAAAG CGCAACTGAT GTTACGAAAC TAAATAATTG CATTCTAGAA ACGATAAGTA ATACTTCTAA TAGATCATCA CAGTT
|
Protein sequence | MLKENDIISK VLLTIDHCKA NMASFKTAIT SNEYTPLNSS CFSRTLVWKA CLITDSLKIH TWESKLSDSR VVYHQLTKRD DMAVPWWHLE SDSSFYSSRE MSRKPSLKNS NSAAKRSRSL GKVPLTRVSN VEDPLSSHSR SRSSTPTIPY EYTEEDLELL QTIILDIDRL FPGEEFFHSS NATSVVAKKQ MIEILYVWAK CNPQVGYKQG IHEILGLLYI NLSKEAVTIP ISNTISADDL KILTMFDIHY LSHDLFTIFN KLMLQSGVVT RFYENENVLW QSIEKFNVYL MKVDQLIHYN LIQKLRLESQ LWIIRYLRLL LLRELGNDLE TTILLWDKLV ASQFSHHNGN TITAIPELIM FMIITLLIQL KTPLITCDFS EGLSLLLHYP VPSGLKSSAS RSDFIAALYK DAARLYERRD NDLKLYEYGL KLNNTYNRNL KITMSYSGSA RNSTDSAGSG RGTSISPSPT PPTSQLPPAP PGGSKEEQMR FEKMRLEMRL KKKAQSMLRN
|
| |