Gene PICST_48682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_48682 
SymbolARP7 
ID4840048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp760977 
End bp762266 
Gene Length1290 bp 
Protein Length429 aa 
Translation table12 
GC content45% 
IMG OID640391363 
Productgeneral RNA polymerase II transcription factor 
Protein accessionXP_001385505 
Protein GI150866037 
COG category[Z] Cytoskeleton 
COG ID[COG5277] Actin and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0296051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.933986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTACA CTTCTCCTGC TGTTGTGATT GACAACGGCT CGTACACTAC GAAAGCCGGG 
TTTGCTCTGG AGGACTTACC GTCACTAGTG TTTAGCACCA ACTATGCGGT AGACAACAAG
ACCGGCAGCG TAATTGTAGG AGACGACGAG ATCTGTGCCC AACCGGAAAA CGAAGTCATG
ACACTTCTTG ACAACGGCCT TATCTACAAC TTTGACAATA TTGTGCACAA CTGGCAGTAT
GTGTATGACA ATATAGACAA CCACAATGCC ATAGATGCCA AAGAGTTTCC TCTTGTCTTG
ACAGAACAGT CATGGAACAC CTCCAAAAAC AGATTGACCG CCACCCAAAT AGCGTTCGAG
ACATTGGAAG TGCCCATCTT CTCACTTGTG AAAACACCCA TTGCTCAGTT GTACAGAGCT
GGCAGATCTA CTGGTCTTGT AATCGATGTA GGAGCTTCTG TCACTAGTGT AACTCCCATT
TTGGACGGTA TAATCCAGCA CAAGTCGTGT TTCCATCTGA AATATGCCGG CAACTTTGTC
AATCTTCATG TATTGGACTA TTTACAGCTG CAGCTGAAGC AAGTCGTCAA TAATTTGTTG
CCCAAGCAGT ACCACGGAGG ATCTGATTCA TTCAAGACCT ACTACATCAG TCACAATGTT
CTTCAGGACT ACAAGAGCTT GGCCTTGAAC TACCAGCTCA GAAATTACCA GTTACCAAAC
AACACTCACA TTCCTGTAGG CGACAGTACT AACTTCTTGG AGAGTTTGTT TCAGCCCACA
TTACGTAAGT TGCCAGATGT AGTTATTCCA GAACCGGTTG TGGACAAGCC CCACACCCAT
GGCTTGACAA ACTTGATCTT CTTGTGCTTG AAGAGCTTGG AGGCGTCATT ATTACCTCCC
ACAAACGACT CATCGTCACA CAACAAGTTG GCCAAGTTCA CAGAGATATT CAAGGAGTTG
CTTTCCAACA TATTGATCAC TGGAGGCACT TCCAACGGGT CTGGTTTGCC AGAGTCCATC
ATCAACGACA TCAGGGCCAT GACCCAACAA TACTACTCCA ACTATCCATT CTCATATTCC
ATCTATCCTA TCAGGCACAG TACAGGGGAC TCCAACGAAA CATGGGACAG ACAGTTTGGT
GCCTGGATGG GAGCTTGTAA TTTGGCCAGT ATGTTGAACG ATAGCAACGA GCAGTCCAAC
AGTGTCAAGA TTGCATTGGA TAATTGGTTT GTCACAAAAG CTGATTATGA GGAGTTGGGT
GAGGATTTGA TTGTTGAAAA ATTCAAGTAG
 
Protein sequence
MAYTSPAVVI DNGSYTTKAG FASEDLPSLV FSTNYAVDNK TGSVIVGDDE ICAQPENEVM 
TLLDNGLIYN FDNIVHNWQY VYDNIDNHNA IDAKEFPLVL TEQSWNTSKN RLTATQIAFE
TLEVPIFSLV KTPIAQLYRA GRSTGLVIDV GASVTSVTPI LDGIIQHKSC FHSKYAGNFV
NLHVLDYLQS QSKQVVNNLL PKQYHGGSDS FKTYYISHNV LQDYKSLALN YQLRNYQLPN
NTHIPVGDST NFLESLFQPT LRKLPDVVIP EPVVDKPHTH GLTNLIFLCL KSLEASLLPP
TNDSSSHNKL AKFTEIFKEL LSNILITGGT SNGSGLPESI INDIRAMTQQ YYSNYPFSYS
IYPIRHSTGD SNETWDRQFG AWMGACNLAS MLNDSNEQSN SVKIALDNWF VTKADYEELG
EDLIVEKFK