Gene PICST_65353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65353 
SymbolARP4 
ID4838049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp34821 
End bp36360 
Gene Length1540 bp 
Protein Length494 aa 
Translation table12 
GC content46% 
IMG OID640389364 
Productactin-related protein 
Protein accessionXP_001383280 
Protein GI150864457 
COG category[Z] Cytoskeleton 
COG ID[COG5277] Actin and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAAACTGTTG CTATGTCCTC AACAGCTAAC AGCGCTTCAG TGTATGGTGG GGATGAGATC 
AACGCCATAG TGCTAGATCC AGGCTCGTAC CATACGCGGA TTGGGTACGC TGGAGATGAT
TTCCCTAAGG TTATAACTTC CTCCTACTAC GCTTCTGCGT CGAATGAGCC AATGGAAGCC
GAGAAGGAAG ACTCCAAAAT TGGTAGCAAG TCAACCAAGA GGATTTTTGG AGATGCCATC
GATGTTCCTC GGTCGAATTA CAACGTTCAT CCCATACTCA AAGATTCAGT AATTGTTGAC
TGGGATGCTG CTTTGGACCA ATACCACCAT TTCTTCAAGA ATGTAATGAA CGTCACGTAT
GAAGAGCAGC CAGTGTTGAT CACAGAGCCC GTATGGGCAG AGCCAAAGTA TCGTCAGACT
TTGGTGGAGA ATTTCTTTGA ATACTACGAT TTCCCAGCAT TATATTTGGC CAAGGCTCCA
TCTTGTGTCT CTTTCCAACA GGGTAGACCG AACTGTTTGG TGGTGGATAT TGGCCATGAC
TCTGTGAGTG TGACCCCTGT CATTGATGGT ATATGCATGA TGAAGAATAC CATGCGAACG
CATTATGCTG GTCAGTTTTT GGTGGATCAA GTCCAAGACC ATCTAGCCAA GTACAAAGAT
TTATCTGTAG AGGGTACTTA CAAAATCAAG TCAAAGACAC CTACAGTATA CCCTGAAAAT
GCAGAGTTCA CCACAAGAAC GCTTCCTGAA GATATCACGG CGTCGTATGA TGAGTACCAG
AAACTGAAAA TCTGGCACGA GTTCAGAGAA ACTATGCTAG AGGTCCCAGA GCGCAAACTA
GCCAATAACA ACATGCAGCA GCTGGCCACC ATGAAGGAGT TCTACACTCT GGATGCCAAT
ACCAGATTGT TTGAATTCCC TACTGGACAG TCGTTACTGT TGAACTATGA TAGGTTTGTG
TTTGCGGATT CGATCTTTGA TCCTTCTATC TATAAATTTG CCAACCAAGA GTTGACCAGC
AAGTATCCCC CCAACAACGG AGTTCTTTCG ATTAAGAGTA AGTATGACGA CTACAGACCA
CTAAAGAGAG TGCGCAAGGC AGAGTCTAAC CAGTCCACGC CTCCGCCGGG TGACAGTCCT
ACCAAGCCCA GCAAGAACGG CAAGCACGAA GTCCGAGGCT TGTCGCAGTT GATCACTCAT
ACGTTGTCAA CCATTGACAT TGATCTACGC ACCTCAGTCG CACACAACAT TATTGTGACT
GGTGGGGTTT CGTTGGTGCC TCAATTGACG GAAAGATTGT ACAACGAGTT GACCAACACC
AATCCAGGGC TCAAAATCAG GTTACACGCT GTGGGAAATT CAACAGAAAG GTTGAACCAG
GCATGGATCG GAGGCAGCGT TCTAGCATCG TTGGGAACGT TCCACCAGAT GTGGGTCAGC
AAACATGAGT ACGAAGAGGC AGGGGCTGAA AGAATCTTGA ACCAGAGATT TAGATGAACT
GTATGTATAG TAGATGTAAA ATCAATTTAA AAGTACCTTT
 
Protein sequence
MSSTANSASV YGGDEINAIV LDPGSYHTRI GYAGDDFPKV ITSSYYASAS NEPMEAEKED 
SKIGSKSTKR IFGDAIDVPR SNYNVHPILK DSVIVDWDAA LDQYHHFFKN VMNVTYEEQP
VLITEPVWAE PKYRQTLVEN FFEYYDFPAL YLAKAPSCVS FQQGRPNCLV VDIGHDSVSV
TPVIDGICMM KNTMRTHYAG QFLVDQVQDH LAKYKDLSVE GTYKIKSKTP TVYPENAEFT
TRTLPEDITA SYDEYQKSKI WHEFRETMLE VPERKLANNN MQQSATMKEF YTSDANTRLF
EFPTGQSLSL NYDRFVFADS IFDPSIYKFA NQELTSKYPP NNGVLSIKSK YDDYRPLKRV
RKAESNQSTP PPGDSPTKPS KNGKHEVRGL SQLITHTLST IDIDLRTSVA HNIIVTGGVS
LVPQLTERLY NELTNTNPGL KIRLHAVGNS TERLNQAWIG GSVLASLGTF HQMWVSKHEY
EEAGAERILN QRFR