Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_34117 |
Symbol | |
ID | 4850971 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 592605 |
End bp | 594725 |
Gene Length | 2121 bp |
Protein Length | 516 aa |
Translation table | |
GC content | 42% |
IMG OID | 640392679 |
Product | predicted protein |
Protein accession | XP_001387344 |
Protein GI | 126273932 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00012281 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.103056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTAA ATAATATTGA ACTCATCAGG AATTGCGCTT CCTGCAGGAG GCTACATCTG CTTTGGTCTT TCCTTCTTAA CTATGCTATT TCAGGTTGAT GATGAAATTA CAAACAAAGG CTAAACACGG TTTGTGGAAT TCAGTCGTTA AAGCGAAATA TCCGAGTCTA ACAGAAGCAC TGGCAGAATT TTTGACTTTG TATTTTCCCT TTTTTACCAA TTGCTACTTC CTTCTATCTG CGTCTGTCTC TGTAGCGCTA ATAACCATTC TTCACTGTTT GTGGTCTTCA ATCCAATAGC CACAATATTC AAGAGTCAGA AGATAGTCAA GAAGAATCGA GTTGTTTTTT CTTGAATTTC ATAAGATTTC ACAAGAAGGC CATTTCAAAG AATTTTAATA CTACTTTTCC TCAATATAAA GCTATCATCA CTAGCCTAGA GTTGAAGTTA GACAGAGAAG AGTATAGAAT ACTCAACACT ACAATTGTCA ACTTATAATT CAGTATGAGC TACAACAATA AGAAGTTCGA TCCTCCTTCC TACCCTCCTC CGTCAGCAAA CAACTACGAC CCTCAGAACC AACAGGGTTA TGGAGGGTAC CAACAGCCTC AATACTCGCA GGAGCCTGAA GAGGAATTGT ATGATCAAAC CGACTTGGAA AATGGCCAAT TCCAGAATTA CAGCGAGAAG CCAGTGAGTT CCGAAAACTT TGAGGAAAGT TTTAAGATAG AAAAGCCAAA ATGGAATGAC TGGCCATTCA CACTTTTCTT CCTCGCTGTC GTTGTTGGAT TCGTAGCTGT AGCAGTAATT ACTATCAATG CCTTGAGAGC TAGGTTCGGT TTTGAAGGAA CAGGCATCTA TGGCTCGTCC AATACCTTCA GTTTGAATAC GAATACGATC GTCTTGTTCG CTTTTGTCAT TGTCGTGGGG TTGGTATTGT CTACTTTGAT TATGGTATAC GCTCGTATGG CTCCCCGGAT TTTTATCACT ACTGGTTTGA TCTTGAATGT GGTATTGGGC ATCGGTACAG CCATTTATTA TTTCGTAGTT CACTATTACT CTGCTGCCAT TGTGTTCTTG GTGTTCTCGC TCTTCTCGGC CTACTGCTAC TGGAGTTGCC GTAGCAGGAT CCCGTTCAGT GCGACCGTTC TTGAGATTAC CATTGATGTC ATGAAAAGAT ACCCCTCAAC CTTGGTAGTT TCCCTCATTG GTATAATTGC TTCGGCTGCT TTCAGTGCTT TGTTTAGTAT TGTAATTGTC GCTACTTATG TTAAGTTTGA TCCTAACCCA AACAACGAAG GTTGTTCTGT TGGTGGAGGC AACTGTTCGC AAGCCAAGTT GGTAGGTGTT CTTGTGTTTG TGTTCTTTGC TGGATTCTAC ATATCTGAAG TTTTCAGAAA TGTCATCCAT GTTGTCATAG CTGGTATTTA TGGTACCTGG TATTACTTGG CTGGATCCGA TCAGGGTGCC CCAAGAGTTC CAGCTTTAGG TGCTTTGAAG AGAGCCTTGA CTTACTGTTT TGGTTCTATT TGTTTCGGTT CTTTGATCGT AGCTTTTATC CAACTTCTCA GGGCATTTAT CCAAGCTCTT AGACAGAATG CCTTAGCTGG TGGTGACAAC TGCGCCTTCT GTGCTCTCTG TATTCTTGAT TTGATCGTTG GTTTCATCGA CTGGATGGTC CGTTACTTCA ACCATTATGC TTACTGCTAC GTTGCTTTAT ACGGAAAGAG TTATCTCAGA TCAGCAAAGG ACACCTTCGA CTTGCTCCGT TATAAAGGTA TGGATGCTTT GATTAATGAC TGTTTCATTA ATACTGCATT GAATTTCTAT GCCTTGTTTG TTGCCTTCGT CACTGCTCTC TTGTCTTTCC TCTACTTGAG ATTCACTGAA CCAGATTACA ATGCTGACGG TAACTTCTAT GCGCCAGTTA TGGCGTTTGC CTTCTTAATC TCTGGACAGA TCACCCGTGT TGCTACTTCA GTCATCGAGT CTGGTATTTC TACATTCTTC GTCGCCTTGG CTAAGGACCC AGAAGTGTTC CAGATGACTA ACAGGAACAG ATTCGACGAG ATCTTCAGAA ACTACCCCCA GGTATTACAG AAGATCACCA GTGACCATTA G
|
Protein sequence | MQKFDPPSYP PPPQYSQEPE EELYDQTDLE NGQFQNYSEK PVSSENFEES FKIEKPKWND WPFTLFFLAV VVGFVAVAVI TINALRARFG FEGTGIYGSS NTFSLNTNTI VLFAFVIVVG LVLSTLIMVY ARMAPRIFIT TGLILNVVLG IGTAIYYFVV HYYSAAIVFL VFSLFSAYCY WSCRSRIPFS ATVLEITIDV MKRYPSTLVV SLIGIIASAA FSALFSIVIV ATYVKFDPNP NNEGCSVGGG NCSQAKLVGV LVFVFFAGFY ISEVFRNVIH VVIAGIYGTW YYLAGSDQGA PRVPALGALK RALTYCFGSI CFGSLIVAFI QLLRAFIQAL RQNALAGGDN CAFCALCILD LIVGFIDWMV RYFNHYAYCY VALYGKSYLR SAKDTFDLLR YKGMDALIND CFINTALNFY ALFVAFVTAL LSFLYLRFTE PDYNADGNFY APVMAFAFLI SGQITRVATS VIESGISTFF VALAKDPEVF QMTNRNRFDE IFRNYPQVLQ KITSDH
|
| |