Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28889 |
Symbol | |
ID | 4851629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2382655 |
End bp | 2384220 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393337 |
Product | predicted protein |
Protein accession | XP_001387029 |
Protein GI | 126275093 |
COG category | [R] General function prediction only |
COG ID | [COG1161] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.691875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.346364 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTGA AGAAGCCAAC ATCTAAAAGA GTCTCGACTC GTATGAGAGA AGGGATCAAG AAGAAGGCTG CTGCTCAACA GAGAAAAAAC AGAAAGCATG CTAAGAAGGA CGTCACCTGG AAATCTAGAA ACAAGAAGGA TCCAGGAATT CCAGCCTCTT TCCCTTACAA GGACAAGATT ATCAGCGAAT TGGAAGACAA CAGAAGAATA GAAAAGGAAA AGAGAGAAGC CTTGAAGTTG CAGAGACAGA TGGAAAGAGA GGCTGCCCTT GCCAGAGGAG AAATTGTAGA TGATGACGAG ATGGACCAAG ACGAAGACGA AGGGGAAGAA GGAGGGTTGG CTGCACTTTT GGAGTCTGCT CAAAGCGCTG CTAAGGCTTA TGATGGAGAA GATGATGTTG AAGACGATAA TGGTATGGAT GAAGACGATG ATGAAGAATA CGAAATTGAA ATCGAACAAG ACGAAGATGA CGAAGACAAC TCTGAGGTGG AAAAATCTCG TAAGGCCTTT GACAAGATCT TCAAAACTGT CGTAGAAGCC TCAGATGTCA TCTTATATGT CTTGGATGCC AGAGATCCAG AAGCTACTCG TTCCAAGAAG GTTGAACAAG CCGTCTTACA GAACCCAGGT AAGAGATTGA TTCTTGTATT GAACAAGGTT GATTTAATTC CAACGGAAGC TTTGAACCAG TGGTTGAACT TCCTTAAGTC TTCTTTTCCA ACAGTTCCAG TCAAAGCATC TCCAGGAGCT TCTAACTCTA CAACCTTCAA TAAGAACTTG ACTTCTACTA TGACAGCCAA CTCCCTTTTG CAAGCATTAA AGAGCTATGC AGCCAAGTCT AACTTGAAGA GATCCATAGT TGTAGGTGTT ATTGGATATC CTAACGTTGG TAAGTCTTCC ATTATTAATG CATTAACTAA CAGGCATGGC AACAACTCCA AGGCATGTCC AGTTGGTAAC CAGGCAGGTG TTACTACCTC TATGAGACAA GTCAAGATCG ACAACAAGTT GAAGATCTTG GACTCCCCCG GTATAGTTTT CCCTGACGAA GTAGCTAACT CCAAGAAGTT GTCCAAGAGT CAGCAAGAAG CTAAATTAGC ATTGCTATCT GCCATTCCAC CAAAGCAAAT AATAGACCCA GTTCCAGCCA TTCAGATGCT TCTTAAGAAG CTTTCCAAAG ACAATGAAAT GGCTGAAGGC TTAAAAAATT ATTACCAGTT GCCCGCTTTA CCCTCTGCGG ATTTGAATGA GTTCACGAAG CAATTTTTGA TTCACGTGGC TAGATCCAGA GGAAGATTAG GCAAGGGTGG TATTCCTAAC TTGGAGTCCG CTGGTATGGC TGTTCTTAAC GATTGGAGAG ATGGTAGAAT CATTGGTTGG ACCTTGCCAA AGGCTTCGAA GAGTGCCAGT GAAGCTGCTG ATGCTGCTAA TATTGACGGC CCAAAGAGTT CATTGAGAGG AGAAAAAGAA CCTCCCAAGG TTGAACAGAC CACTGTTGTT ACCGCATGGG CCAAGGAATT CGACCTTGAT GGTTTGTTGG GTGACAATTT CGGTTTACAA AACTAA
|
Protein sequence | MRVKKPTSKR VSTRMREGIK KKAAAQQRKN RKHAKKDVTW KSRNKKDPGI PASFPYKDKI ISELEDNRRI EKEKREALKL QRQMEREAAL ARGEIVDDDE MDQDEDEGEE GGLAALLESA QSAAKAYDGE DDVEDDNGMD EDDDEEYEIE IEQDEDDEDN SEVEKSRKAF DKIFKTVVEA SDVILYVLDA RDPEATRSKK VEQAVLQNPG KRLILVLNKV DLIPTEALNQ WLNFLKSSFP TVPVKASPGA SNSTTFNKNL TSTMTANSLL QALKSYAAKS NLKRSIVVGV IGYPNVGKSS IINALTNRHG NNSKACPVGN QAGVTTSMRQ VKIDNKLKIL DSPGIVFPDE VANSKKLSKS QQEAKLALLS AIPPKQIIDP VPAIQMLLKK LSKDNEMAEG LKNYYQLPAL PSADLNEFTK QFLIHVARSR GRLGKGGIPN LESAGMAVLN DWRDGRIIGW TLPKASKSAS EAADAANIDG PKSSLRGEKE PPKVEQTTVV TAWAKEFDLD GLLGDNFGLQ N
|
| |