Gene PICST_85755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85755 
SymbolARP5 
ID4840820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp565408 
End bp567751 
Gene Length2344 bp 
Protein Length775 aa 
Translation table12 
GC content44% 
IMG OID640392135 
Productvacuolar targeting, actin-related protein 
Protein accessionXP_001386503 
Protein GI150866790 
COG category[Z] Cytoskeleton 
COG ID[COG5277] Actin and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.116456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.301272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACACTTTACA CATCATATGG CTCCGACAAA AGTGAAGACG GAGGAGCCCG AGCTTCCACC 
GCAGAAAGTG CATCTTCTCC GAGACGTGGT GCCTCCCACA GACCTCGATC CTATCTATTC
CAACTACCAG ACTGGGGTAC CAATAGCACT TGATATTGGC TGTTCCAGCT TCAAAATTGG
TCTCACGAAT TCGCCGGAAC CACACAACAT CTTTCCATCA GTAGCTGCAC GCTACCGGGA
CAGAAAGGCA TTGAAAACTC TTACTCTTGC CGGAAAAGAT GTCTACAGAG ACTCACTAGT
CAGATCCTCC ATTAAGACAC CCTTCGATGG ACCCCTCGTG ACCAACTGGG ACTACATGGA
GGTTTTACTC GACTATTCCT TGGCCCATTT GGGTGTTGTT GGCGATAATG GAAGGCTTAA
TAACCCTCTC ATCTTGACGG AACCAGTAGG AGTACCGTTC TCACAGCGTA AGAACATGTA
TGAAATTCTA TTTGAGGCGT ACCAAGCACC CAAAGTGACA TTTGGAATCG ACTCATTATT
TTCATTCTAC GCTAACTCAA CTTCATCTAC AGCTAGTGGT CTTGTAATTG GCACTGGGCA
CGAACTGACT CACGTCATCC CAGTTCTCCA TGGTAAAGGT ATTCTTTCAC AAACGAAGAG
AATCGACTTT GGGGGCCATC AGGCTGAGCA GTTCCTCGGA AAGTTGTTGC TGCTAAAGTA
TCCCTACTTC CCCTCTAAAT TAAATGCTCA TCATACATCC AACTTATTCC GTGATTTCTG
CTACGTTCTG AAAGACTACC AGGAAGAAAT AGATCATATT TTGGATATGG ACAAGTTGGA
AGAGGCAGAT ATTATAGTCC AGGCACCTGT AGAGATCAAT GTAGGAACTG AAAAGAAGAA
ACTGGAGGAA GAATTGGCTC GTCAGGCTGC TAAACGTAGA GAACAGGGAA AGAGATTGCA
AGAGCAAGCC CAACAGAAAC GGTTGGAGAA ATTGATCCAA AAACAAGAGG AATGGGACTA
TTACTCGAAG TTCAGAGAGG AATCTGAAAA GCTCAATAAG CTGGAACTAC AGGCCCGTTT
AGAAACTGAT GGCTTTGACG ATCTCGCTGA CTTCAACAAA TATATGTCTG GATTGGAGAA
GTCGTTAAAA AAGGCACATG ATCAAGACAT TGGAGAAGGA GATAACCATG AGGTAGATCC
AGCCAGCGCT TGGCCACTTT TAGATACACC TGATGACCAA TTGACTGAAG AGCAGATCAA
GGAAAAGAGA AAGCAGAGGC TCCACAAAGC CAATTACGAT GCCAGGGAGC GTTCAAAGGA
GTTAAAGAGA CAGCAGGAAG AAGAAAAGGC GCAATACGAA CGTGAACAGC AAGAATGGAG
AGAAAAGGAT TTGGAAGACT GGTGTAACGT CAAGCGAATC CACTTGGCTG GCTTAATAAG
TAAATACAAA GAGAGTATAA AACTTTTGGA ATCTTTCAAA GATAGAAAAT CTGCTGCTGC
ACAACAAAGA ATGAAAAACA TCGCCGATTT GGCGAACGAT GAGAGCGGAT CGACCTCCGC
TGCTTCAAGA AAGAGAAGAA GAAATGCCAA CTCTACTATC GACAACGACC CCAACGACAC
GTTTGGTGCG AATGATGACG ATTGGGCAGT TTACAGAGAT ATCAGCAATC AGAAAATTGA
AGAAGAACTA GGTGAAACCA ACCAGGAAAT CTTGAGTTTG GAAGAGGAGC TCTTAAAATT
CGATCCTAAC TTCCATCACG AAGATACATT CGCTGCTTCT CAAACATTTG ACTGGAGAAA
TCTGGTTTTG CACAAATTCA TCCATGGGCC ACGTCAAAAT ATCACGATAG CCATGCAGGC
AGAGGGCATT AACCCAGATG AAATCGACAA TCACCCCGAG ATCATTCGTA AGAACCATCA
GATCCATGTA AATGTAGAGA GAATACGTGT ACCAGAGATT TTGTTCCAGC CTCATATCGC
TGGGCTTGAC CAGGCTGGTA TCTCAGAGAT TCTGAGCGAT TTGTTGAATA GGAGCTTTGG
TTCCAGTTTT TATGAAGGTG GTGACTCTCT CAACTTAATC CGAGATGTAT TTGTAACAGG
TGGTTTAGCC CATTTACCTA ACTTTACCAC CAGAGTCACC AACGATTTTA CAAGTTTCTT
GCCTGTTGGT GCTCCTATTC GTGTACGTAC TGCCAGAGAC CCTATTGGAG ATTCCTGGAG
AGGAATGCAG AAATGGGCAT CCAGTGAAGA ATGCGAAAAC AGCTACATTT CTAAGGCAGA
TTACGAAGAG TATGGCCCAG AGTATATCAA GGAACACGGA CTTGGTAATG TTAGCTTACG
GTAA
 
Protein sequence
MAPTKVKTEE PELPPQKVHL LRDVVPPTDL DPIYSNYQTG VPIALDIGCS SFKIGLTNSP 
EPHNIFPSVA ARYRDRKALK TLTLAGKDVY RDSLVRSSIK TPFDGPLVTN WDYMEVLLDY
SLAHLGVVGD NGRLNNPLIL TEPVGVPFSQ RKNMYEILFE AYQAPKVTFG IDSLFSFYAN
STSSTASGLV IGTGHESTHV IPVLHGKGIL SQTKRIDFGG HQAEQFLGKL LSLKYPYFPS
KLNAHHTSNL FRDFCYVSKD YQEEIDHILD MDKLEEADII VQAPVEINVG TEKKKSEEEL
ARQAAKRREQ GKRLQEQAQQ KRLEKLIQKQ EEWDYYSKFR EESEKLNKSE LQARLETDGF
DDLADFNKYM SGLEKSLKKA HDQDIGEGDN HEVDPASAWP LLDTPDDQLT EEQIKEKRKQ
RLHKANYDAR ERSKELKRQQ EEEKAQYERE QQEWREKDLE DWCNVKRIHL AGLISKYKES
IKLLESFKDR KSAAAQQRMK NIADLANDES GSTSAASRKR RRNANSTIDN DPNDTFGAND
DDWAVYRDIS NQKIEEELGE TNQEILSLEE ELLKFDPNFH HEDTFAASQT FDWRNSVLHK
FIHGPRQNIT IAMQAEGINP DEIDNHPEII RKNHQIHVNV ERIRVPEILF QPHIAGLDQA
GISEISSDLL NRSFGSSFYE GGDSLNLIRD VFVTGGLAHL PNFTTRVTND FTSFLPVGAP
IRVRTARDPI GDSWRGMQKW ASSEECENSY ISKADYEEYG PEYIKEHGLG NVSLR