Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_85755 |
Symbol | ARP5 |
ID | 4840820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 565408 |
End bp | 567751 |
Gene Length | 2344 bp |
Protein Length | 775 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640392135 |
Product | vacuolar targeting, actin-related protein |
Protein accession | XP_001386503 |
Protein GI | 150866790 |
COG category | [Z] Cytoskeleton |
COG ID | [COG5277] Actin and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.116456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.301272 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACACTTTACA CATCATATGG CTCCGACAAA AGTGAAGACG GAGGAGCCCG AGCTTCCACC GCAGAAAGTG CATCTTCTCC GAGACGTGGT GCCTCCCACA GACCTCGATC CTATCTATTC CAACTACCAG ACTGGGGTAC CAATAGCACT TGATATTGGC TGTTCCAGCT TCAAAATTGG TCTCACGAAT TCGCCGGAAC CACACAACAT CTTTCCATCA GTAGCTGCAC GCTACCGGGA CAGAAAGGCA TTGAAAACTC TTACTCTTGC CGGAAAAGAT GTCTACAGAG ACTCACTAGT CAGATCCTCC ATTAAGACAC CCTTCGATGG ACCCCTCGTG ACCAACTGGG ACTACATGGA GGTTTTACTC GACTATTCCT TGGCCCATTT GGGTGTTGTT GGCGATAATG GAAGGCTTAA TAACCCTCTC ATCTTGACGG AACCAGTAGG AGTACCGTTC TCACAGCGTA AGAACATGTA TGAAATTCTA TTTGAGGCGT ACCAAGCACC CAAAGTGACA TTTGGAATCG ACTCATTATT TTCATTCTAC GCTAACTCAA CTTCATCTAC AGCTAGTGGT CTTGTAATTG GCACTGGGCA CGAACTGACT CACGTCATCC CAGTTCTCCA TGGTAAAGGT ATTCTTTCAC AAACGAAGAG AATCGACTTT GGGGGCCATC AGGCTGAGCA GTTCCTCGGA AAGTTGTTGC TGCTAAAGTA TCCCTACTTC CCCTCTAAAT TAAATGCTCA TCATACATCC AACTTATTCC GTGATTTCTG CTACGTTCTG AAAGACTACC AGGAAGAAAT AGATCATATT TTGGATATGG ACAAGTTGGA AGAGGCAGAT ATTATAGTCC AGGCACCTGT AGAGATCAAT GTAGGAACTG AAAAGAAGAA ACTGGAGGAA GAATTGGCTC GTCAGGCTGC TAAACGTAGA GAACAGGGAA AGAGATTGCA AGAGCAAGCC CAACAGAAAC GGTTGGAGAA ATTGATCCAA AAACAAGAGG AATGGGACTA TTACTCGAAG TTCAGAGAGG AATCTGAAAA GCTCAATAAG CTGGAACTAC AGGCCCGTTT AGAAACTGAT GGCTTTGACG ATCTCGCTGA CTTCAACAAA TATATGTCTG GATTGGAGAA GTCGTTAAAA AAGGCACATG ATCAAGACAT TGGAGAAGGA GATAACCATG AGGTAGATCC AGCCAGCGCT TGGCCACTTT TAGATACACC TGATGACCAA TTGACTGAAG AGCAGATCAA GGAAAAGAGA AAGCAGAGGC TCCACAAAGC CAATTACGAT GCCAGGGAGC GTTCAAAGGA GTTAAAGAGA CAGCAGGAAG AAGAAAAGGC GCAATACGAA CGTGAACAGC AAGAATGGAG AGAAAAGGAT TTGGAAGACT GGTGTAACGT CAAGCGAATC CACTTGGCTG GCTTAATAAG TAAATACAAA GAGAGTATAA AACTTTTGGA ATCTTTCAAA GATAGAAAAT CTGCTGCTGC ACAACAAAGA ATGAAAAACA TCGCCGATTT GGCGAACGAT GAGAGCGGAT CGACCTCCGC TGCTTCAAGA AAGAGAAGAA GAAATGCCAA CTCTACTATC GACAACGACC CCAACGACAC GTTTGGTGCG AATGATGACG ATTGGGCAGT TTACAGAGAT ATCAGCAATC AGAAAATTGA AGAAGAACTA GGTGAAACCA ACCAGGAAAT CTTGAGTTTG GAAGAGGAGC TCTTAAAATT CGATCCTAAC TTCCATCACG AAGATACATT CGCTGCTTCT CAAACATTTG ACTGGAGAAA TCTGGTTTTG CACAAATTCA TCCATGGGCC ACGTCAAAAT ATCACGATAG CCATGCAGGC AGAGGGCATT AACCCAGATG AAATCGACAA TCACCCCGAG ATCATTCGTA AGAACCATCA GATCCATGTA AATGTAGAGA GAATACGTGT ACCAGAGATT TTGTTCCAGC CTCATATCGC TGGGCTTGAC CAGGCTGGTA TCTCAGAGAT TCTGAGCGAT TTGTTGAATA GGAGCTTTGG TTCCAGTTTT TATGAAGGTG GTGACTCTCT CAACTTAATC CGAGATGTAT TTGTAACAGG TGGTTTAGCC CATTTACCTA ACTTTACCAC CAGAGTCACC AACGATTTTA CAAGTTTCTT GCCTGTTGGT GCTCCTATTC GTGTACGTAC TGCCAGAGAC CCTATTGGAG ATTCCTGGAG AGGAATGCAG AAATGGGCAT CCAGTGAAGA ATGCGAAAAC AGCTACATTT CTAAGGCAGA TTACGAAGAG TATGGCCCAG AGTATATCAA GGAACACGGA CTTGGTAATG TTAGCTTACG GTAA
|
Protein sequence | MAPTKVKTEE PELPPQKVHL LRDVVPPTDL DPIYSNYQTG VPIALDIGCS SFKIGLTNSP EPHNIFPSVA ARYRDRKALK TLTLAGKDVY RDSLVRSSIK TPFDGPLVTN WDYMEVLLDY SLAHLGVVGD NGRLNNPLIL TEPVGVPFSQ RKNMYEILFE AYQAPKVTFG IDSLFSFYAN STSSTASGLV IGTGHESTHV IPVLHGKGIL SQTKRIDFGG HQAEQFLGKL LSLKYPYFPS KLNAHHTSNL FRDFCYVSKD YQEEIDHILD MDKLEEADII VQAPVEINVG TEKKKSEEEL ARQAAKRREQ GKRLQEQAQQ KRLEKLIQKQ EEWDYYSKFR EESEKLNKSE LQARLETDGF DDLADFNKYM SGLEKSLKKA HDQDIGEGDN HEVDPASAWP LLDTPDDQLT EEQIKEKRKQ RLHKANYDAR ERSKELKRQQ EEEKAQYERE QQEWREKDLE DWCNVKRIHL AGLISKYKES IKLLESFKDR KSAAAQQRMK NIADLANDES GSTSAASRKR RRNANSTIDN DPNDTFGAND DDWAVYRDIS NQKIEEELGE TNQEILSLEE ELLKFDPNFH HEDTFAASQT FDWRNSVLHK FIHGPRQNIT IAMQAEGINP DEIDNHPEII RKNHQIHVNV ERIRVPEILF QPHIAGLDQA GISEISSDLL NRSFGSSFYE GGDSLNLIRD VFVTGGLAHL PNFTTRVTND FTSFLPVGAP IRVRTARDPI GDSWRGMQKW ASSEECENSY ISKADYEEYG PEYIKEHGLG NVSLR
|
| |