Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_58795 |
Symbol | |
ID | 4838910 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 627998 |
End bp | 632281 |
Gene Length | 4284 bp |
Protein Length | 1408 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640390225 |
Product | predicted protein |
Protein accession | XP_001384087 |
Protein GI | 150865037 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.658129 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAATA TTCTGAGCCA ACCGTTCAAA GTAAGCCATC AGAGAGTTAA CATTGATGTA GATATGCTGC GGAATCGTAT TGATGGTTTC ACGGAATTAA CTCTAGTTCC ATTTACCAAC ACTTTAAAAG TAGTGAGATT AGATTGTAGA GAAATGAAGA TCACCAGAGT CACCATCAAC AACATGAAGC CATGCAATTA CATACATAAC GACATTTTGT ACATCAATGA TGGCAAGTAC TTCGACGAAG AAATCGTACT GGCCTATGAT GTCAACTTGT TTGACTTGTA TTCAGATGAA GTGTCTATTC ATCAACATCA TATGATCAAG CACAAGCTAG GGTATATCTT TGGGGAGAGC AATTACGATC CCAGAGATCC CCATGCCGAT GTATTCTCAA CTATTGTCAA TACTGAGGAG CTTTCAGTGA TGCTTCCTGA TAATTTGCGA CTTGAATTGA CGGACATCAA CTCGATCCAT ACTCCTGGGA GCCAACCGGG AACACTTACT CCTTTGCATT TGAAACTGAA AGCCACAAAC AGCGACATTT ATACACCTAT ACAGCTTCGG ATCGAGTACG AGCTTGTGAA TCCCAAGAAT GGAGTCAACT TTGTCTCTGA TAGTATAGAG AAGCGTAACT GGCACTCGTA CACAACTAAT AACACGTACA ATCTTTCGAC TTCGTCATGG GTTCCTTGCA TAGACAATTT ATGGGACAGA AGTACGTGGT CTCTTGAAGT AATCATCCCC AGAACAGTGC GAGATATAGG GAATCCTCGT ATCATAGGCT CAGAAGAAGC TATGCGAGGG TCGCGCAACC AGAAGAAGAA ACGTAGACTA AATAGAAACG ATGATTCAGA CATCGAGGAC GATGAAGACA ACGAAGATGA TGATAACGAA AACCATGATC TAGTGGTTTG CTCAGGAGAT TTCAACAATG TTAAGGAAAC ACCGCATTCG ATCGATTTGT CAAAGAAAGT AGTCTCATGG TCCATCTTTA ATCCTGTTTG TGCACATCAT GTAGGCTGGG CTCTCGGATG TTTTGAGAAC TTTGTCATTT CCGAGTCGAC AGAGTCAAGA GAAGTAGACG ACGAGATCAA AGAGAACTTT GAAGATATAG ACAAGGATGG AACCAGTTCT CCTATAACCA TTTACTGCCT TCCCGGTCAG ATTGAACAGG CTAGAAACAC TTGTATTTTC ACTATGAGAG CTATGGACTT TTTTCTGAAG GAATTTGGTT CTTTCCCGTT TAGTTCCTAC GGAATAGTTT TTGTTCAGGA CAGTGTAGTC GACACCAACA ACTTTGCTGG TTTATCGATA TTTTCAGATT CCATTTTGTA TCCTTCTGAT ATCATAGAGC CCATGTTTAC CAGCACGGAG GTGATTCTTG AAGCCATTTC CAGTCAGTGG TCGGGAATCA GCATCACTCC TCAAACAGTT AACGACATCT GGTGCACCAT AGGTATAGCA AAGTTCATGG TCTTTCAGTA CTTGAAGGAT TTGATGGGAA CCAACGAATA CAGATTCAAG ATCAAGAAGA TGATGAATAG AATTGTTGAA GAAGATAGAG GAAAGAAACC ATTGGGATAC CATTATTTCA GATTTCCTGT CTCTGACTCA GATTTAGACT TCATCAAGTT AAAGGCTCCG ATTGTGTTGT TCATTTTGGA TAAGCGTATG ACCAAGACCG ACAAATCATT TGGTTTATCA AGAGTGTTGC CCAAGTTGTT TCTTCAGGCG ATGTCTGGAG ATCTTCAGAA CGGTACCCTT TCTACGCAGC ATTTTCAGTA TGTTTGTGAG AAAGTCAATA GAAACAAACT AGAAAACTTC TTTAAGCAAT GGGTCTATGG AGTCGGAGCT CCAATTTTTA ATATCACGCA AAGATTTAAC AAGAAGCGTG GAGTAATCGA AATGAGCATT CGTCAAATTC AACATCAAGT TACTAGGAAA AGTGGAACAA ACGCTGAATC GTTTATAAAC GATTCCATAG CATATTTGGA AGACGAGCCT ACTTTTCCAG TTCAGTCTAT TTTCACTGGA CCTATGACTA TCAGAGTGCA TGAAGCTGAT GGGACTCCTT ATGAGCATAT AGTAGATTTA AAAGAGGGAA ACACAAAGCT TGATGTTCAA TACAATTCAA AATTTAGACG TATGAAGAAG AATCGTGATG AAACAAGTGA AAATGCAGTG ACTTTCAGCA GACTTGGAGA TGTTTTAGAA TCAGAAAAGG AGATGGAAGA ATGGAACTTG GCTGACTGGG CAAAAGTGGA TGATGATCCT ATGAATATAG AAGCGTTTGA GTGGATCAGA GTGGATGTAG ATTTCGAGTG GATTGCGAGA TTTGATGTCA AACAACCTGA CTACATGTTT GGTTCACAAT TACAACATGA CAGAGACGTT GAAGCCCAAT TTGACGCCGT CCGCTACTTG GGAAACATAG AAAAACCTTC TACGATCCAC TGCACTGCAC TAACTCGTAC GGTAGTTGAT GAAAGGTATT ACTATGGGGT AAGGATTGCT GCTGCTGAGG CATTGGCAAA CTTCTCCAAT TCTGTGACAA ACTTCATTGG TGTTCCTTAC TTGGTCAAAA TTTATAGAGA GCTCTACTGT TTTCCAGGCA GTTCCATTCC TTTGAGTAAT GATTTCAACG ACTTTGGTAG ATTCTTTTTA CAGAAAGAAA TCCCTAAGCA ATTGTGTAAA ATTAGAGATA GTGATGATGA GGTTCCCGTT GTCATCAGAA ATTTGATACT CAATTTGATC AAATTCAATG ACAATACCAA CAACAATTTC CAGGACAGTT TCTACATATC AGAATTGGTT CAATCATTGA CAACTTGTGC TGTTAATTCC AGTTTTCCCA ATTCGCCGAA GGATATATTT CCAAAGTCGC ATCCTCACGG GAGCCTGGAG AAGAAGAAGT TTGTTGCCAA TGTGATTACT GAAATCAATA GACTTCAGAA GTTAGACGAA TGGATACCTT CGTATCATAA TGTAGTTTCG GTGACTTGTT TGACTCAGAA AATTCGTTTG GCATTACATG GGCATTTAGA TCTTTCGTTC GAAGACTTGT TATACTTCAC TGTTGAAAAG TTTCCAATTC AGATAAGAGT GGAAGCTTTC CGTGGTCTTT TTGTGCTTGG TGGTTTGAAG AATAGACATA TCTTGAACTA TTTCTTGAAG GCATGCCTCT TGGATGTTCG ATCAGCTGCT TACAGAGAGC TACTCATTTC GGCTCTAATT GATTCGATAT GTGTCGCTGC TGTCAGTGGC ACTCCTTCTA CGTTAGACGA TCCCGAGTTC AAGCCATTTG AAAAATTAAG CGAATCTAAA ACTGGGGCCA GTACAGCATT GGCCAATATG ATTATTGTTG AAGATGGATC ACACAATGAG ATGGACGAAA AAAGAGATGT GTTTGCGAGA GCCACTGTAA GTGGAGCTAT TGATATATTG AGAAGAGATT ATTCCATTGG AAAAGGCTTA AAGCGTACTA TATGGGAGCT ACTTCATACG TCCTTATTGA GCATCCGTGA GAAGCGTAAC TTGTTTTTGA TTTGTCAGAT CTTGTACAAA GAGATAGATC TGTTCGTAGT TAAGATTCCT ATTCCGAATG TTCCTTTGAA TGAATTGACG AAAAAGATCG TTACGAAGAA CTTGGGCAAC GGAAAGGTTG TATTTAAGAG ACAGGGTAGA TTCAAGATTC AGTTGGCATC ACAGAAACTT CTTATTGAAA AGCCTAAGGC CAAGGAACCT AAAGTTTCTC ACAAGAAGCC TGTCGACAGA ACAGTGAACG AGGTAGCCCT TAATCCTGAT GAGTCAAAGG AACCTAAATT GAAGTTGAAC CTTAGTATCA AGCCAAGCGC TCCTCCGGCA GCTCCACCTC CAACGGAATC TCCTACTAAG AAACAAAAGA AGCAAGCTCC AGCACCTAAA AAGCATATTC CGGTTGTATC TATTGGAACG CACAATTCGA GAAAATTCGA ATTGTCGTTC AAGTTCCAAG ATCACAGTTT GCCAGAGTTG AAGCCAGTGT CTGGAATTTC AGTGAAGGTA GGTAGAAGTC AAGTGTCTGT AGACGGAGGC AAAGTAAAGT TCAACTTTAA GGGTGCTTTC AAATCGAGGT TCAGAGATTT GATGGCCAAG CCAGTGGAAC CAGCACCACC ACAAAAGCCC AAACCCACAT TACCAGCGGT GACTACCGTA GAGCCTAATG CACGGTATGT TAAGATTTTA ACCAAAGAGA AGAAAGTTTT GATATCGTCT ACGCCTTTTG AAGTTTCAAA GGAG
|
Protein sequence | MRNISSQPFK VSHQRVNIDV DMSRNRIDGF TELTLVPFTN TLKVVRLDCR EMKITRVTIN NMKPCNYIHN DILYINDGKY FDEEIVSAYD VNLFDLYSDE VSIHQHHMIK HKLGYIFGES NYDPRDPHAD VFSTIVNTEE LSVMLPDNLR LELTDINSIH TPGSQPGTLT PLHLKSKATN SDIYTPIQLR IEYELVNPKN GVNFVSDSIE KRNWHSYTTN NTYNLSTSSW VPCIDNLWDR STWSLEVIIP RTVRDIGNPR IIGSEEAMRG SRNQKKKRRL NRNDDSDIED DEDNEDDDNE NHDLVVCSGD FNNVKETPHS IDLSKKVVSW SIFNPVCAHH VGWALGCFEN FVISESTESR EVDDEIKENF EDIDKDGTSS PITIYCLPGQ IEQARNTCIF TMRAMDFFSK EFGSFPFSSY GIVFVQDSVV DTNNFAGLSI FSDSILYPSD IIEPMFTSTE VILEAISSQW SGISITPQTV NDIWCTIGIA KFMVFQYLKD LMGTNEYRFK IKKMMNRIVE EDRGKKPLGY HYFRFPVSDS DLDFIKLKAP IVLFILDKRM TKTDKSFGLS RVLPKLFLQA MSGDLQNGTL STQHFQYVCE KVNRNKLENF FKQWVYGVGA PIFNITQRFN KKRGVIEMSI RQIQHQVTRK SGTNAESFIN DSIAYLEDEP TFPVQSIFTG PMTIRVHEAD GTPYEHIVDL KEGNTKLDVQ YNSKFRRMKK NRDETSENAV TFSRLGDVLE SEKEMEEWNL ADWAKVDDDP MNIEAFEWIR VDVDFEWIAR FDVKQPDYMF GSQLQHDRDV EAQFDAVRYL GNIEKPSTIH CTALTRTVVD ERYYYGVRIA AAEALANFSN SVTNFIGVPY LVKIYRELYC FPGSSIPLSN DFNDFGRFFL QKEIPKQLCK IRDSDDEVPV VIRNLILNLI KFNDNTNNNF QDSFYISELV QSLTTCAVNS SFPNSPKDIF PKSHPHGSSE KKKFVANVIT EINRLQKLDE WIPSYHNVVS VTCLTQKIRL ALHGHLDLSF EDLLYFTVEK FPIQIRVEAF RGLFVLGGLK NRHILNYFLK ACLLDVRSAA YRELLISALI DSICVAAVSG TPSTLDDPEF KPFEKLSESK TGASTALANM IIVEDGSHNE MDEKRDVFAR ATVSGAIDIL RRDYSIGKGL KRTIWELLHT SLLSIREKRN LFLICQILYK EIDSFVVKIP IPNVPLNELT KKIVTKNLGN GKVVFKRQGR FKIQLASQKL LIEKPKAKEP KVSHKKPVDR TVNEVALNPD ESKEPKLKLN LSIKPSAPPA APPPTESPTK KQKKQAPAPK KHIPVVSIGT HNSRKFELSF KFQDHSLPEL KPVSGISVKV GRSQVSVDGG KVKFNFKGAF KSRFRDLMAK PPNARYVKIL TKEKKVLISS TPFEVSKE
|
| |