Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_65566 |
Symbol | |
ID | 4838967 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 577096 |
End bp | 580307 |
Gene Length | 3212 bp |
Protein Length | 734 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640390282 |
Product | predicted protein |
Protein accession | XP_001384073 |
Protein GI | 150865023 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.445286 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TGTTTTCATC ATCCAACTCC ACCATTTTTT GGTTCACCAG ACAAGTTCCT CCTACAAACA CCTTCACGAC ACAGCATCCA CCCGCAAATA CTCACACCAT CCTACACTAT AAATCCGCTT TCTGACTCGC CGCCATGAGT CACCACAGTA CTCCTCATTT GTACACCAAG CGAACTCCAG GATCGTTGGA AAACCCTGGT TCCAACCCCT TGGCACAGGC TCTTGCCACT AATTCCTCCT CAAAACAGCA ACAGCAACAG CAACAACAGC CGTCACAACA GAATATCTCT TCATCTGTTT CTGCTGCCAA TGCTGCCAAT GCCGTTGCTT CTGGTAAACC TACCATGGAT TCTTCTCGTT CAGTGTCAGA CTCTACGGCT ACCAACGCCA AACAACTCTT AAACGCTTAT GTTTACGATT TCCTCGTCAA GTCACGTTTG CCGAACACGG CAAGGATATT TGTCAATGAG GCAGAAGTTC CCTCCGTCCA AAGCAGTGCT ATTGTTGCTG GCTCCAAGCT AGGCTCCCAC CAGCTGTCAC AAAAGAACCT GCCCCAGATT AACCTGGGCG CCAACACCAA CACAAACACC AACACAAACA CACCACAGAC GCCAAACCTG TCGTACCAGC AATTCCAGAA GGAAAACAAC TTGCCCAACT TGCTGGTCGC TGTCGATGCA CCCCAGGGAT TTCTCTTTGA GTGGTGGCAG GTATTCTGGG ACGTCTTCCA GGCAAAAAAC TCGTTTTCTC CAACTTCTGG ATTCAAGCCA AACAACATCA ACATCAATGC TGCCAATGCC GCATCGCAAA ATATGGCTTT CCAGTACTAC CAGCTCCAGC TCATGAAACA AAGACAACAA CAAGAAATAG GTCTTTCTAC CAATGGACAA CCGATGATGT TTGCACCTAA TGGTGGCAAT GCTAGTATGG CTGGAAATGT CAATATGGCT GGCAATACGA ATGGCAATCC TATGCTTCAG CAACAACTCA TGTTTCCGCA ACAACAACTA CAAAACCTAC AACATCCACA GAATCAGCAA CCTCCTCGTT CACTACAACA GCAACAGATT CCTCAACAGG CCCAACCGGG TACTGGTACG GGAGCTCAAT CACAGGGTCC AGTACAAGCT GGACAGCCAC AGTTGCTTCC GCCAGGATCA CAAGGAGTAC CACAGACAAT TGAACAGCAA CAGCAACAGC GACTTGTAAT GCAAATGATG ATGAAACAGC AGCAACAGGC TCAGCAAGCT CAACAACAGC AGCATCCTTT ACAGCTTCCT ATGGGTGTTA ACCCCATGGA TCCTCTGCAA CAGCAACAAC AGTTATTGGC AGGTATGGGA CCCAACAACA ATAACAACAG CACCAATAAC AATAACAACA ATAATAATAT GAACTTACAA CAGCAATTAT TCTTATCACA GCAACAACAA CAGAATCAGA GTAGAATTCA GCAACATGCT CAGAATCAGA TGAATAGTTT CAGACAACAG GCAGCAGCAG CTCAACAGGC ACAACAACAA CATGATGGTT CTAACAATAA CTCTCCAGCT AATGGACATA GATTGAGCCA GATGACACCG CAATCGGATC CTGCCCAGGC TCCACCAATG AATGGAATGA ATTTTTCACA ACAGCCAAAC ATGATAAACG GGCAAGTACC ACCTCCCTTT GTCCAGCAGC AACAACAGCA ACAGCAGCAG ATGCGTATGA ATAGTGTTAG CAACGGCAAG AGTGTAGGCA CCAAGCAGGG AAGTCCCATC ATGCTTGCCG GAGCTGTGGG ACCACAACTG GGCAACAGGA ATGGAAATGT TAGTGTTCCA GGTAATGTAA ACAACAGCAA TCCCAATCGT AACATGAATG CTTTACAAGA CTATCAGATG CAGTTGATGT TGTTGGAAAA GCAGAACAAG AAACGTCTTG ATATTGCCAG AAACAGCGGC GACGTCAACT TGCTTAGTTC CGGACTTATA GGACAAGCAC AAGCACAACA ACAAGCTGGA CCTGGACCAC AGCAACAAGG TCAACAAGCC CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCCGCAGAA TGCTCAAACT AATCGTCTGT TTACCCAGAA GCCATCACCA GCAACTGCTA GCAGTTCACC TGTGCTACAC AATAAACCTT CTCCACAGTC TACAACTGCA AAAAGAAAGA AGGAAACTAC TGGTAAGCGA GGTAGAAAGG CAAGTGCAGC GGGACTCAGT GGAAGCACTC CAGCGATGAA CCCAGCTAAC ACGCCTAGTT TACTCAAGAA AGAGTACACC ACACCTCTTA CGCCAGCTTC AGAACCAGCT AGTGACCCCA AGAGAAAACG AAAGAGCACA ACTGGCAACA CTGATTCGCC CAAGAAGCAA GCAACTGCCA AAACCACGGC AACAGCAGCT TCTACGACCT CAGCGGTAGC AGAAGCCAAA AAGGAAAAGC CGATTAAGGA AGAAGAAGCC CCTGTTCCGG AGACCAAGAA GAAAGAAGAG TCGGAAATGC CTCCTCCTAC ATCTTTTTCG GATCCTCTTG GACAGTCCGA CCAAATATTC TCTGTAGAGT TGCTTGGAAA CGGTAGCACC GATTCACAAA ATTTCTTTGG CGCAAACGCC CAGGGCAACC AGGGTGGTAT TGACGACATT GACTTTGATT TCGGTCAGTT CTTGGAGAGC AACGGCGATG GCATTAATGA TGGCATCGGT GGCTTCAACT GGGGCAACGT GGATGCAATT GAAAACGGCG AGTAGCACTG AAACACACCA CTTCTGAAGC TTTTCTTGCA ACATGAATAT CAGGAACATT TTCAGAAAAA GAAATCATGA ATATCATGGA TTCTTCATCT AAGAAAATTA TATTATAGTT GAAAGCTTAA GAATTTACAT TAAAGCTGGA GTTTAAGAAG CCACAAATAT CCTGGAAACA CTTCTTTAGA GATTGAAACA TTTCTCAGAT TATCAACATG ACATTCTTCA CTATTAGGGC TCATCCAGAC TGTGTCAAAT GCCTCTAATA TTTTATTCTC ATTGGTTTGT TTTATTTATT CCTTTATTCT TATTTCTCGA CATATACCTC TTGAGAGTGA CGTTTTCAGG TTCTTTATTT GTATTGTACT ATTATTCGTT GTTATTGTCG GCTCATTAGT CTATATATAC TATGCGATCT TCATATTAAT AACATTCCTC GGACTTCGTT AA
|
Protein sequence | MSHHSTPHLY TKRTPGSLEN PGSNPLAQAL ATNSSSKQQQ QQQQQPSQQN ISSSVSAANA ANAVASGKPT MDSSRSVSDS TATNAKQLLN AYVYDFLVKS RLPNTARIFV NEAEVPSVQS SAIVAGSKLG SHQSSQKNSP QINSGANTNT NTNTNTPQTP NSSYQQFQKE NNLPNLSVAV DAPQGFLFEW WQYYQLQLMK QRQQQEIGLS TNGQPMMFAP NGGNARSQGV PQTIEQQQQQ RLVMQMMMKQ QQQAQQAQQQ QHPLQLPMGV NPMDPSQQQQ QLLAGMGPNN NNNSTNNNNN NNNMNLQQQL FLSQQQQQNQ SRIQQHAQNQ MNSFRQQAAA AQQAQQQHDG SNNNSPANGH RLSQMTPQSD PAQAPPMNGM NFSQQPNMIN GQVPPPFVQQ QQQQQQQMRM NSVSNGKSVG TKQGTVGPQS GNRNGNVSVP GNVNNSNPNR NMNALQDYQM QLMLLEKQNK KRLDIARNSG DVNLLSSGLI GQAQAQQQAG PGPQQQGQQA QQQQQQQQQQ QQQQPQNAQT NPTASSSPVL HNKPSPQSTT AKRKKETTGK RGRKASAAGL SGSTPAMNPA NTPSLLKKEY TTPLTPASEP ASDPKRKRKS TTGNTDSPKK QATAKTTATA ASTTSAEEEA PVPETKKKEE SEMPPPTSFS DPLGQSDQIF SVELLGNGST DSQNFFGANA QGNQGGIDDI DFDFGQFLES NGDGINDGIG GFNWGNVDAI ENGE
|
| |