Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_72172 |
Symbol | |
ID | 4838370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 1007798 |
End bp | 1008978 |
Gene Length | 1181 bp |
Protein Length | 275 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640389685 |
Product | predicted protein |
Protein accession | XP_001384501 |
Protein GI | 126135954 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.143035 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GGAGAATGGT AGACCGTTAA ATAATTAACT AATCATGCTT GCGTGTAACT TTAGCACTTG GACAAGGAGG GCTACTTGAT TTTTGGAATT ATCGCCACCA TATCTTTCAA ATCCTATATA AGGAGATCAG TTTCCCGCTT GCAAGTAATT GTTTCTAACA TTTTCTCAAC ATCAGAACAT TGATTTTTCA ACCGGCTTAA GCACTCATAT CTCCACCCCA CATAAAATTT CTATACATAC TAGCAAAATG GTCTACCCTT CAAATTTCAA AGTTGTCAGC AGAAAATTGA ACCCTTCCAC TGTGGTTGCA TCTGCACCTT TCACCAGGGC CAACAAGTTT AACTTCGGTG CTCGTATGGC TGTGTTCAAC TACGATAACC AAGTCATCGT CTGGTCCGCT TTACCGTATG GTACGGAGGT CAAGAAGACG TTGGAATTGT TGACCGGAAG TGATGCTAAG CCAAATATTA CTCACTTGAT AATTCCAGAT CGTGAACACA CAATTGCGGC AAAGTCTTTC AAGGAAGAGT ATCCAGAGTT GAAGATTATT GCCATGGATA CCGTATCTAT TCCTGGTGTT GAAATCGACT ACAAATTTTC TGCCAAAGAG GGTAACAAAT TGATTGACAG AAAGGTTCTT GAGGAGGAAA TTGGCATCAA GGAATCAGTT ATCTTAGACA ATTTCGAGTT CGTCTATTTA CCATTGCATG CTAATCAAGA ACTAGTAACC TTTGACAAGA AGGCCAAGGT TGTGTATGAA GCCGATTTGC TTTTCAACTT GGGTGTTCCA GGTACTACTA GCGGTAAGGT CACGTTAGAG CAGTATTCTC CAGAAACTGG ATACAAGCAA GGATTCAACC CTCACTCTGG TTGGTCTTTC TTAACTAGAT ACATGCAACC ACACAGTAAG GTTGGTACAT TTCTTTTCAA TAAACTTGTC CAGACTGCCA AGAGTAAGGA TGGTCTTAAA GCTATCTACA ACTGGGACTT CGACACCATT GTGCCTTGTC ATGGTAACTT GATTGAAAAA GATGCCAAGG CCTCCTTCAA GAGTGCATTC CACGGAGCAT TTTAGAGGAG AAAATAACTA CAGAGCGGCG GTGGGCAATT AATTGTCCTA ATTTTGTATT TCTTACTTTA ATATGTTATG CTTAATTCAA ATTTAATGTT TTATTACTTT G
|
Protein sequence | MVYPSNFKVV SRKLNPSTVV ASAPFTRANK FNFGARMAVF NYDNQVIVWS ALPYGTEVKK TLELLTGSDA KPNITHLIIP DREHTIAAKS FKEEYPELKI IAMDTVSIPG VEIDYKFSAK EGNKLIDRKV LEEEIGIKES VILDNFEFVY LPLHANQELV TFDKKAKVVY EADLLFNLGV PGTTSGKVTL EQYSPETGYK QGFNPHSGWS FLTRYMQPHS KVGTFLFNKL VQTAKSKDGL KAIYNWDFDT IVPCHGNLIE KDAKASFKSA FHGAF
|
| |