Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_58854 |
Symbol | |
ID | 4838381 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 425674 |
End bp | 427218 |
Gene Length | 1545 bp |
Protein Length | 502 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389696 |
Product | predicted protein |
Protein accession | XP_001384378 |
Protein GI | 150865242 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3579] Aminopeptidase C |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTCCA ACACATCGAA GGAAGTCAAG ACTACTACCG AGGAAACCTT TAACGAAAAG GTTTCGTACT CGTCTTCTCT GTCGCAGTAC GATTCGTATT CACACTTATT GAGTAGAATG GATGTCTCAA GAGAAGACGT CGAAGCCAAA GAAGATGAAG ACAATGGTGA TGGCAACGGT ATTTGGCCAG GTATTTCCGT CGGATTGCTT GATGGTTGGA AAGGTGACGT CTTGAGAGAC GATAAGAACA AATTGGTTCA GAACTCTCTT GCTATCAATC CTATCCAGCT GATTATCGCC AAGTCAGACG TCGAAACTGT CTTAAAGGAC CAGTACTTCT TCAACGTCAC AGTCAAGACA ATTGGATCAC CTTCCTATTT CAACAACCAA AAACTGTCCG GTAGATGTTG GATCTTTGCT GCTTCCAATG TGTTCAGAAC TCTGGTTATC AAGAATTATA ACTTGAAGGA TGATCTGTTC CAGTTGTCGC AAGCTTATCT TTTCTTCTAT GACAAATTGG AAAAATCTCA TTTCTTCTTG GATAACATCG CCGACACTGC TGATCACGAC TTGGACTCAA GATTAGTTCA GTATCTTCTT TCCAGTCCTG TTGGTGACGG TGGTCAATGG GATATGATTG TCAACTTGGT AGAGAAATAC GGTCTTGTTC CACACCAAGT GTTCCCAGAT AATGCCCAAG CCTCAAACTC TTCTCCTTTG AACTATTTGG TCACCGAGAA GTTGAGGGAA GCTGCTTTGA TCATCAGAAG ATTGTACCAA GAAAAGGCAC CTCAGCCTGT CATTGAAATT CTTAAGGGTG CAACCGTCTA TACTGTGTTC AAGATTCTTT CTTTGGCTTT GGGTTCTCCA CCAAATGCTG ATGAACCTTT CACTTGGGAA TACATCGACA AGGATGGCAA GTACAAGTCC TATCAAACGA ATCCTAGAGA CTTCTACAGG GACCATGTCA GACTTGATGC CGCTAAACAC TTCTCGTTGA TCCACGACCC TAGAAACGAC TATGATAAGT TGTATACTGT GGACAGATTA AACAACATTT TAGGCGGTAA GAAGATCGAA TACGTCAATA CTGAAATTGA CGAGATCAAG CTGGTTGCTA TCAAGATGTT GAAGGACGAT GAGCCCATCT TCTTTGGTTC TGATGTAGGC AAGTTTGGTG ACAGGTCTTC TGGTGTTTTG GACGTTACAG CATACGACTA CAAGTTGGCT TTCAATATCT CCTTGGGTTT GGACAAGGCT GAAAGATTGA GAACCGGCTC ATCTCAAATG ACCCATGCTA TGGTGATTAC TGGTGTTCAC CTTGATCCTG TAACTCAGCT TCCTGTCAGA TGGAAGATCG AGAATTCGTG GGGTGATGCC GTCGGCGACA AGGGTTACTT TGTTATGTCG GACGAATGGT TCAGTGAATA CGTGTTCCAG ATTGTCACCA ACAAGAAGTA TGCCTCTAAG AAGACATATG ATACTTGGAA GGGTAAAGAC TTCACTGTCT TGCCTTATTA TGATCCTATG GGCTCATTAG CTTAA
|
Protein sequence | MGSNTSKEVK TTTEETFNEK VSYSSSLMDV SREDVEAKED EDNGDGNGIW PGISVGLLDG WKGDVLRDDK NKLVQNSLAI NPIQSIIAKS DVETVLKDQY FFNVTVKTIG SPSYFNNQKS SGRCWIFAAS NVFRTSVIKN YNLKDDSFQL SQAYLFFYDK LEKSHFFLDN IADTADHDLD SRLVQYLLSS PVGDGGQWDM IVNLVEKYGL VPHQVFPDNA QASNSSPLNY LVTEKLREAA LIIRRLYQEK APQPVIEILK GATVYTVFKI LSLALGSPPN ADEPFTWEYI DKDGKYKSYQ TNPRDFYRDH VRLDAAKHFS LIHDPRNDYD KLYTVDRLNN ILGGKKIEYV NTEIDEIKSV AIKMLKDDEP IFFGSDVGKF GDRSSGVLDV TAYDYKLAFN ISLGLDKAER LRTGSSQMTH AMVITGVHLD PVTQLPVRWK IENSWGDAVG DKGYFVMSDE WFSEYVFQIV TNKKYASKKT YDTWKGKDFT VLPYYDPMGS LA
|
| |