Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66654 |
Symbol | |
ID | 4851803 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2870728 |
End bp | 2874232 |
Gene Length | 3505 bp |
Protein Length | 860 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393511 |
Product | predicted protein |
Protein accession | XP_001386884 |
Protein GI | 126275648 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1893] Ketopantoate reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CACAAGGTGT ATAAACGGCT TCTGCAACTA GCTCAGGTTC TCCCGAAGCC ACGTCTCAAC CACCGAAAAT CTGAAACAAG TCTAGACCTT CGTTCTATAC GACCGAGACC TGATTTGATT TCTGATAGAA ACTGATTCGC TTGATTTACT AGAATTTTAC TGGGTTTTCC ACATCTTCAA ACTGAAACCA ATATAGTAAT ACTAGAAGCT GCCGATCCAC GTTCCCAGAA CAGACACTCG CTAATATCGG ATCAGACTTA AATCTACAAG ACCGTCGCAC AAAGTTCACT GTAGTTGACT AAGAAGACTG AATCTTCAGA AAGGACCTTA AAGGATCAAA TAAGTTGAAT AAAACCATCT GGATTTGAAC CAGTTCATCA ACTCTCAAGG ATCCTCGACG GCTTGATATC AAAAACTCAT AAAACTTTCT ATTGTTAGTT GGACTGTGCG ATAAGATACT GAAATCATAC AAAACTCCAC TTTTTCAAAT TGCTGAAATT TCTCATATCA CTGAATACTT AATACCATCC AGTGCAATTC AAGTTTTCGG ATCTTTGAAT CTTTGAATCT TTGAATCTTC ACATTATCGA ATCTTCGAAA TACTGAATTC CTCAGTAGCC TCATTTTAAA ATTTTAACAA CCATAATGTC TGCCGTTCAG GTTCTTGCTG TTGGAGTTAA TCCCAACGTT GCATTTTATG CATGGCGACT CCATGAAACG AAATCGGTTG AGGTCACTGT GGTCAACTCA ACGTTGTCTA ACTCCACTAT CTACTGGACA TCTTCTTCGT TTGGAGACAA CATTGGCTAC AAACCGCAAC AGTGCTATCG CTCTGTGGCC GAAATACCTC GTAAATCGTC CGTGTCGTAC GACATTATTG TATTGAGTAG TCCATCTCTT CAAGATTTCC AGAAGCTCTG TTCTTCTTTT GCTCCCTACA TGAGAGACGA TTCTATCATC TTGATCGAAT CCACAGGTTT TGTCAACTTG GAGCCGTTTG TTCAGCTTTG CTTTCCCAAG AAGAGCAATT TGTGTATCAT GTCGATCATG AACGAAAGTG ACGTCAGACA AGTTGGAAAC AACAGGTACT TACACACTAT AAGAAACTCA GACACCAGAA TCTACTTTGG AACGTCTTTG GGTTCCAACA GGATCAACGC CAACCCAAAT TTTCAAAGAA TGTACAAACT CTTGCAAACA ATCCAGAAAG ATTCAGCCAA TAACATTAGT CTCTTGAAAT CTTTGAATCC GAAGGAGTTC ATGACTTACC AGTGGAAAAT GGCCATTCCT CGTATCATAT TCAACCCTTT GTCGATTATC TTCGAATTTG AATTCCCCCA GATGTTGGAG TCGCAGATTC TTTGTAAGCC ACTAGTGACA GGTATTATCA ACGAGTTGTT CAAGATCATC AAGAAAATGG ACTGTAAACT TGTGAAAGGA TTCGAGAATG AGGAGAACTT GTTGAAAAAC TGGACTTCTC TTTTTCCAAA AACATCCAAC AATAGCGAGT ACATTAACTC CAACAACTTG TTCTACAACT TCTACAAGAA CTATGACTTG GACGTTGACT TACTTCTCTT GCAGCCCATC TTATTAGCGG ACGATCACGG CATCAGAACT CCCTATTTGG AGAACTTGTA TTCGACTATG TGTCAGTACG CCAAGCTTAA TAATGGTAGT TCTCTTTTCT TTGCCAGAAA GGGAGGTGAC CAGGACAAGC TTGAGACAAT CGGCCAATTG GACGAAGAAA TCAAGAAGAG AACTGCCCAG AAGAGCTCAT TAGACTCCAC TATAAGAGAA TCTGAAAACA AGAAGGCTCA GCATGAAAAC GCTATTGTCG CATTACTGGT TCAGATCCAC GATAAGGAAA CTTATTTGGC CAGTTTGCTC TCAAAGCAAG AGCAGAAATT GAAACAGTTG GAAGGTCAGT ATAACCAAAA GTTCAAGGAC TTGGATGATC AGTATCAAAG ACAATACGAA CAGAACCAGC AGCGATTGCA GGAGAAGCCA TTGCCTGAGA GGAATGATTC TTCCAACCCC GTTAATGGAA ATAGCAGTGG TAATGTTCAC GGCAATGGCA ATGGTAACAG TAACAATAAC AACAATAATG GAGGGAGCGG TAATGGTAAC ACTAATGGCA ACAGCAATAG TAATGGCGTC GTTAATGGCC CCGTGAACCG CAACAGTTCT CCATCTGGCA CCGACAGCTT GACGGACTTG ACGGATATTG CTTTGTATGG AGCAGCTATC AACAACGAGT CGTCAAATCA ACAGCAATAC GCTCCATACG CCTCTGGAGA TGCCAATGGA AACGTCCAGT TGGTCAATGG AGATCTTCCC AAACACCTCT TGGACAAGGA AATGGAATTA AGAAGACGCG AGCAGGAGAT TCTCCGTCGC GAGTCTCTTG CTGGTGACCA TAACTACCAA CAGCAACAGC AGCAGCAGCT CTACGTCGAT ACCCAGTTTG GGCCACAACA GCAATATGCT CAACAACAGT ATGTTCAAAA GTATAACCAG TATAACAACG TTCCCAATGG ATATCAGCAG TCGCCCATAG ACCAGCAGCC TCCACATGGA TTACCACCCA ATGGAATGCC GCAGAATGCC CTTCCGCCTA ATTTGAGGTC TAACCTGATG ATGCTGACCA GATATCAGCA GCCCCCACAA CCCAATGGAT ACTACAATGC TCCTCCCAGA AGAGTCAGCT CTATTCCTAA CAACGTGAAT GGATACTTTG AAGGTGGATA CCAACCACAG CAGTATCCAC AACAGCAGCA GTATTATCCA CAACAGCAAT CTAGCTTCAA CAATGCTGCA CCCATTGATC CGATGCTTGA ACTGAGATTC AAGGCTAAGA AGCAAAATCG TCGTTCTCAG ATGCCTTTAG GTACCGGCTC GATGAGTGGA AACTTGGATG GCTTGGATAT GGGAGGTCGT GGTGGAATGC CGATGCCGGG TGCGCAAAAG AACAGACCGA TGGGACCACC AGGCAACTAT AGAAGATCTA CGGGCCCTGT CAACACTCTC AACGGCCAGT TCAACAACCT TGCTGTGAAT AACACCGGCG GGCTGATGCC CATGAACCAG CATCCACTGC AGAATTACTT GCAACCACCT TTGTTGCACG GTTCCAATGC TTCAAGCAAC TCCTCCACCA ATTCCAACGA TACGCCCAAG ACCCACAGCA GCAACGACAA CGTTAACATC AACGAAAGCG TGCTGTTGTC GGTGCCTGCA GCTGATACTT CAGCTAAACC ACTAGGAGGT ATCTCGCAGT CTAACAGAAG AAGTCTCACC GACAGACTGC TGAAGAAAAA GGGCGGAATC TTTGGGAAAA AGGGATAGGG AATGCTGAAA AACGCTTAAT ATAATTTGTA ACTGTATTTA TATAGTGGGA CAAGGCCACT ATGTGGAGTA GTTGATAAGA CGTTGACTAG TGGTGGCTGA TCCTGAGGCT TGTTTGCTTT TTTAGTATTC GTATATATTT ATTGATTTTA TTGAT
|
Protein sequence | MSAVQVLAVG VNPNVAFYAW RLHETKSVEV TVVNSTLSNS TIYWTSSSFG DNIGYKPQQC YRSVAEIPRK SSVSYDIIVL SSPSLQDFQK LCSSFAPYMR DDSIILIEST GFVNLEPFVQ LCFPKKSNLC IMSIMNESDV RQVGNNRYLH TIRNSDTRIY FGTSLGSNRI NANPNFQRMY KLLQTIQKDS ANNISLLKSL NPKEFMTYQW KMAIPRIIFN PLSIIFEFEF PQMLESQILC KPLVTGIINE LFKIIKKMDC KLVKGFENEE NLLKNWTSLF PKTSNNSEYI NSNNLFYNFY KNYDLDVDLL LLQPILLADD HGIRTPYLEN LYSTMCQYAK LNNGSSLFFA RKGGDQDKLE TIGQLDEEIK KRTAQKSSLD STIRESENKK AQHENAIVAL LVQIHDKETY LASLLSKQEQ KLKQLEGQYN QKFKDLDDQY QRQYEQNQQR LQEKPLPERN DSSNPVNGNS SGNVHGNGNG NSNNNNNNGG SGNGNTNGNS NSNGVVNGPV NRNSSPSGTD SLTDLTDIAL YGAAINNESS NQQQYAPYAS GDANGNQQQQ QLYVDTQFGP QQQYAQQQYV QKYNQYNNVP NGYQQSPIDQ QPPHGLPPNG MPQNALPPNL RSNLMMLTRY QQPPQPNGYY NAPPRRVSSI PNNVNGYFEG GYQPQQYPQQ QQYYPQQQSS FNNAAPIDPM LELRFKAKKQ NRRSQMPLGT GSMSGNLDGL DMGGRGGMPM PGAQKNRPMG PPGNYRRSTG PVNTLNGQFN NLAVNNTGGL MPMNQHPLQN YLQPPLLHGS NASSNSSTNS NDTPKTHSSN DNVNINESVL LSVPAADTSA KPLGGISQSN RRSLTDRLLK KKGGIFGKKG
|
| |