Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31732 |
Symbol | |
ID | 4838827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 1542725 |
End bp | 1545934 |
Gene Length | 3210 bp |
Protein Length | 1069 aa |
Translation table | 12 |
GC content | 36% |
IMG OID | 640390142 |
Product | predicted protein |
Protein accession | XP_001384601 |
Protein GI | 150865400 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACA TTCTACCTAT CTCTTGCATC TCCTGCAGAA GGAAGAAGAT AAAGTGCAAC AAGTTGAAAC CTTGCAACCA ATGCATTAAG AGATCCATAC CGTGCGAATT TCCTCCTACA TTTAGAAATA TCAAAATCAA TGAAGAAGAA TTAGATTTCG CAGCCACTTC GAGCAACATG AGTAGGGCCC TAGAAAATTT TCCTCAACAG GGATTTGCAG CTCCACAAAC TGGAACGATC GAACATAATC TGGATGCTAC CGAAGCTAGT TTTGTCAGCC TAAGAAGTGA ATTGGACTTG TTGAAAAACG AAAATTTGCA ATTATTGCAG GAGAATTTAC GGTTGAACCA ACAGATTTCC CGAACAGGCA TCCCTAATAA TCCAAGTATG TTAAAATATC AACAGCCCGT GCCTATCGTT CAAGAGGGAA GGTCAGAAGA CTTGCATGCT GCTCCGATCT CGATCCCACC AGATCAATCT GAGACAGATG AAAAATTCTT CTTTCCGCAA TCTGACATTT ACGGGATTGA GGCTCAATTT GCAAAACAAC GAAGAGCATC CCAAGCAAAA GAAAATAACA CAGATGGAAT TTCAAAGCTA CAAAATGTTT CTTCAAGCAC TGGAAGTGAT ATTACTTCTC ACCAAGCTTT GAAAAAGGCA AGGATGATGT TTGCAAATAC AGACGTATTA GATAATTCTT CTACTTCTTC CGAATCGAAA ATTTCAGCTC AATGGGAAAA ATTAAATGAC GACTCAAGTA AAGAATCAAA GGGAAACAAT TTAATCCAAC AAAGTTTGCT CAAGAAAAAA TTACCAACTT TAATTCTTGC TTTACTAAAA TATGATGATC CAGATTTTAG CTCTGATTCG GACACATTTC AAGATTTGCA GAGATGGAAT TACAATGTTA TAATTAGGCT AGTTGAACTA TTCTTCGAAA AGAATAATTA TTACGGCACA TTTATTTCCC AACTGAAAGT TTTTGAATTT TTAAAGGCAT ACCCAAATTT AAAGGATAAG GAATGGGAGT ACGATGATGA TTTGGTTTTG CTTTATTTGA TTTTGATACT TTCTGTTCAA AAATTGACAC CCAAAGAATT CTTGGATCTT GAATTACTTC CAGCATCTTC TTCTAAAAAC ATCCGAAAAT TCAGAAACTA CTTATCTAAG AACATATTGT ACAACAGTTT TGAAAAGCTT CGGCATAATT TGATTAATGA GTCTCTGCTC ACTATTCAGG CTTACATATT ATGTACCGAA TGGTTATTTC TTGAACATAA GTTTGAAGAA TGTTGGTCGA TGATGTTTCA TACTTGTTCA ATTGCTTATT CTATCGGATT GCATGTAATT GGACAAGTGA AGAGAACTCC TTCTTTGAGT ATTCCTGAGA GAATGTCCGC TGATAAAATA AATGAATCAG ATGAAACAGA CAATGAAGAA GACGATGACA GAAGAAAAAT TTGGTTTGCT TTGAAGAATC TAACAGCTCA AATATGTTCT GTCCTAGGAA GGCCAAATCC AATATCAATT CAGGTGAATG GTCAAATTTC TGAACTTGGA AATCAAAAAC TTTCGGATAA AATAAACTAC ACAACATTAA AAATTGGATT AAGTGAGTGT TTAAGGCTAT CAAATTTGAT GTTGATTGAA AACTTCATGA TTGACTTCAC TATTGAGGAA GTTCTACCTT TAGAAGAAAA ATTTAGGGAA GAGATAGCAA AATTGGAACT GGTGCTTAAT GATGATATAT TGAGGACAAA TAAAACACAA GGAGAGTCAG AGTGGAATAA AGTAGACCAA ACCAATCTTT TAATGGATTT GATAACTCTC TACATAAACA GAGCCAAGTT ACTTGAACCA TTTCTTAAGA AATTTGAAGA CCAAGAAAAG CACAACATTA TTATTGAAAG ATTTGTCAAT TCCATACTTC AAAGTTTTGA ATTGTTAAAT TATTTTGTCG AGATTTTTTT GAACCAGTTT TTTGAAAAGA ATACAATCCC TCCAAGGAAG AGATCAAGTG GTAATATACT CGCTCCAAAA GATGAGAAGA TAGAGAATCA AAGTTCCCAA CAAGATGTGA ATTCTCTAAT AAAATTTGAA AGGGTATTCC GTGTGTACTT TCCGTTTCTT ACGTCCTTCA TCTACCAAGG AATCATAGTG ATTTTCACAT TTCTTCATTG CAAATTTAAA TTGTTTGTTA ATAATAATCC ATCTTCTTTA TTAAACAACG AATTGCTCAA GCATATTGAG ATTAATTTGA ATACTTTGAC GAACTTTGAT AGCAGGATCT CAACCAGATT GAACTGTATT TCTAAACTTT GGTCTGCCAA CATCAGATAT TTGATTGATC GAGTTCTAAT TTACATCAAA ATGATTTACG AAAGACAAGA GGACAAGTTT CTGCAGTTAT CAGAAAAGAA GAGAAGAAGG GTGCTGCAAA ATAATCTACT TTCAGGGGAC CAAGATACAA ATCTTAAAGT ATTTGATTTC AATACTGGGA GACAAAGTGA GCAACCATTA GAAAATACGG AGAATCCTTT GGAAAGTCCA GAGCTTGAAT ATTTGTATGG ATTCCAATTC AACGATCCTT TTTGGTTAAC AAATCCAGAG AATTTGCCCT ATTACCTTAG TTCTCCAAGT GATGACGATA AATACAACAA CAAGCTTACT CCTACAAAGA GATCACAACC ATCCACAGAC TCAGCTATAT CTTATGGCGT TGGGTCTATG ACAGTTGAAC CATCGTTAGC CAAGCCAATC TCCAGTCAGA TGCCGGTCCC AATACATGGA GATGGTATGT ATAGCGCACC TAATCTTTCT ACTCAGCAGC AAAACTATGG AAACCTCTGG CCGAATAGTA GTGCCCCTCT GATTTCACAG AATGTCATTC AAATTCTGCA ATCACAAGGA CAATCTTTCA CAGTTGGTTT CAACCAACAA CTCTCGTTGT TGTTCAATCA GCCTACATTT GGTAGTCATA ATTCGGTATA CAGTGGCTCT ACAGCTGGGC ATATTCCTTC CCAACAAGTC TCAGGTCAAA ACCCCCAAGT ACAACTTGCT CATCAACAAG TTATCCACCC ACCAATTGCC GATCCCCAGA TTCAGGAATA CAGAATTCAG GATCAGCACC AAGAACCAGT CCAGTCCAGA AATCCATTCC GGAACACTCT GCGAAACATG GGCTCCTCTG GATCCGACGA TTCTAGTTGA
|
Protein sequence | MTDILPISCI SCRRKKIKCN KLKPCNQCIK RSIPCEFPPT FRNIKINEEE LDFAATSSNM SRALENFPQQ GFAAPQTGTI EHNSDATEAS FVSLRSELDL LKNENLQLLQ ENLRLNQQIS RTGIPNNPSM LKYQQPVPIV QEGRSEDLHA APISIPPDQS ETDEKFFFPQ SDIYGIEAQF AKQRRASQAK ENNTDGISKL QNVSSSTGSD ITSHQALKKA RMMFANTDVL DNSSTSSESK ISAQWEKLND DSSKESKGNN LIQQSLLKKK LPTLILALLK YDDPDFSSDS DTFQDLQRWN YNVIIRLVEL FFEKNNYYGT FISQSKVFEF LKAYPNLKDK EWEYDDDLVL LYLILILSVQ KLTPKEFLDL ELLPASSSKN IRKFRNYLSK NILYNSFEKL RHNLINESSL TIQAYILCTE WLFLEHKFEE CWSMMFHTCS IAYSIGLHVI GQVKRTPSLS IPERMSADKI NESDETDNEE DDDRRKIWFA LKNLTAQICS VLGRPNPISI QVNGQISELG NQKLSDKINY TTLKIGLSEC LRLSNLMLIE NFMIDFTIEE VLPLEEKFRE EIAKLESVLN DDILRTNKTQ GESEWNKVDQ TNLLMDLITL YINRAKLLEP FLKKFEDQEK HNIIIERFVN SILQSFELLN YFVEIFLNQF FEKNTIPPRK RSSGNILAPK DEKIENQSSQ QDVNSLIKFE RVFRVYFPFL TSFIYQGIIV IFTFLHCKFK LFVNNNPSSL LNNELLKHIE INLNTLTNFD SRISTRLNCI SKLWSANIRY LIDRVLIYIK MIYERQEDKF SQLSEKKRRR VSQNNLLSGD QDTNLKVFDF NTGRQSEQPL ENTENPLESP ELEYLYGFQF NDPFWLTNPE NLPYYLSSPS DDDKYNNKLT PTKRSQPSTD SAISYGVGSM TVEPSLAKPI SSQMPVPIHG DGMYSAPNLS TQQQNYGNLW PNSSAPSISQ NVIQISQSQG QSFTVGFNQQ LSLLFNQPTF GSHNSVYSGS TAGHIPSQQV SGQNPQVQLA HQQVIHPPIA DPQIQEYRIQ DQHQEPVQSR NPFRNTSRNM GSSGSDDSS
|
| |