Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_48682 |
Symbol | ARP7 |
ID | 4840048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 760977 |
End bp | 762266 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391363 |
Product | general RNA polymerase II transcription factor |
Protein accession | XP_001385505 |
Protein GI | 150866037 |
COG category | [Z] Cytoskeleton |
COG ID | [COG5277] Actin and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0296051 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.933986 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTACA CTTCTCCTGC TGTTGTGATT GACAACGGCT CGTACACTAC GAAAGCCGGG TTTGCTCTGG AGGACTTACC GTCACTAGTG TTTAGCACCA ACTATGCGGT AGACAACAAG ACCGGCAGCG TAATTGTAGG AGACGACGAG ATCTGTGCCC AACCGGAAAA CGAAGTCATG ACACTTCTTG ACAACGGCCT TATCTACAAC TTTGACAATA TTGTGCACAA CTGGCAGTAT GTGTATGACA ATATAGACAA CCACAATGCC ATAGATGCCA AAGAGTTTCC TCTTGTCTTG ACAGAACAGT CATGGAACAC CTCCAAAAAC AGATTGACCG CCACCCAAAT AGCGTTCGAG ACATTGGAAG TGCCCATCTT CTCACTTGTG AAAACACCCA TTGCTCAGTT GTACAGAGCT GGCAGATCTA CTGGTCTTGT AATCGATGTA GGAGCTTCTG TCACTAGTGT AACTCCCATT TTGGACGGTA TAATCCAGCA CAAGTCGTGT TTCCATCTGA AATATGCCGG CAACTTTGTC AATCTTCATG TATTGGACTA TTTACAGCTG CAGCTGAAGC AAGTCGTCAA TAATTTGTTG CCCAAGCAGT ACCACGGAGG ATCTGATTCA TTCAAGACCT ACTACATCAG TCACAATGTT CTTCAGGACT ACAAGAGCTT GGCCTTGAAC TACCAGCTCA GAAATTACCA GTTACCAAAC AACACTCACA TTCCTGTAGG CGACAGTACT AACTTCTTGG AGAGTTTGTT TCAGCCCACA TTACGTAAGT TGCCAGATGT AGTTATTCCA GAACCGGTTG TGGACAAGCC CCACACCCAT GGCTTGACAA ACTTGATCTT CTTGTGCTTG AAGAGCTTGG AGGCGTCATT ATTACCTCCC ACAAACGACT CATCGTCACA CAACAAGTTG GCCAAGTTCA CAGAGATATT CAAGGAGTTG CTTTCCAACA TATTGATCAC TGGAGGCACT TCCAACGGGT CTGGTTTGCC AGAGTCCATC ATCAACGACA TCAGGGCCAT GACCCAACAA TACTACTCCA ACTATCCATT CTCATATTCC ATCTATCCTA TCAGGCACAG TACAGGGGAC TCCAACGAAA CATGGGACAG ACAGTTTGGT GCCTGGATGG GAGCTTGTAA TTTGGCCAGT ATGTTGAACG ATAGCAACGA GCAGTCCAAC AGTGTCAAGA TTGCATTGGA TAATTGGTTT GTCACAAAAG CTGATTATGA GGAGTTGGGT GAGGATTTGA TTGTTGAAAA ATTCAAGTAG
|
Protein sequence | MAYTSPAVVI DNGSYTTKAG FASEDLPSLV FSTNYAVDNK TGSVIVGDDE ICAQPENEVM TLLDNGLIYN FDNIVHNWQY VYDNIDNHNA IDAKEFPLVL TEQSWNTSKN RLTATQIAFE TLEVPIFSLV KTPIAQLYRA GRSTGLVIDV GASVTSVTPI LDGIIQHKSC FHSKYAGNFV NLHVLDYLQS QSKQVVNNLL PKQYHGGSDS FKTYYISHNV LQDYKSLALN YQLRNYQLPN NTHIPVGDST NFLESLFQPT LRKLPDVVIP EPVVDKPHTH GLTNLIFLCL KSLEASLLPP TNDSSSHNKL AKFTEIFKEL LSNILITGGT SNGSGLPESI INDIRAMTQQ YYSNYPFSYS IYPIRHSTGD SNETWDRQFG AWMGACNLAS MLNDSNEQSN SVKIALDNWF VTKADYEELG EDLIVEKFK
|
| |