Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66087 |
Symbol | |
ID | 4840704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 745779 |
End bp | 748370 |
Gene Length | 2592 bp |
Protein Length | 803 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392019 |
Product | predicted protein |
Protein accession | XP_001386339 |
Protein GI | 150866671 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes [COG0801] 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase |
TIGRFAM ID | [TIGR00525] dihydroneopterin aldolase [TIGR00526] FolB domain [TIGR01496] dihydropteroate synthase [TIGR01498] 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.512929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAACG ACAAAGTCTT TGCCAAAGAT ATCGCTGCTA CAGCCATAAC CGGCAAGGAC GCATGGAACA GACCTTCACC CCAACCGATA ACTATTTCTG TTAGTCTAGA TACCGATTTT CACCAAGCGT CTGTGACTGA CAACTTGAAG TACTCGTTGA ACTATGCTGT GATTTCCAGA AACATCTCTG AATTCATGAA AGCCAACGAA CACAGAAACT TCAAGTCCTT AGGGAACATC GCTGAATCTG TAAGTGAAAT TGTTCTAGAT CAGAAGAAAG GAGGAGGATC CAAAGCTGAA GTTGTAGTCA AAAGTGTCAA GTCTGAGATC AGAGCTGACA GTGTTGAGTA CAGGTTGAGC CGTTCTAAAA CTGATTCTGA ACCAATTCCT GATGAAATCG CCGTCAGAGG TTTACGTTTG CTTACAATCA TTGGTGTTTT TACATTTGAA CGATTCCAGA GACAGATTGT AGATATAGAT TTGCAACTCA AGTTGCACAA GGAATCAGAC GTGTCTATAC ATGAGATCAT AGATGATGTT GTTTCCTATG TAGAGCTGTC GAACTTCAAG ACTGTAGAAG CATTGGTGAT GAAAATTGGT CAGTTGATCT TCCAGAACCA CCAAAAAGGT GTAACCTCTG TATTTGCTAA AGTAACCAAA CCCAATGCCA TCAGCTATAC CGAAGGAGTT GGAGTGTCTT CCTTGATGAC CAAGGCTTCT TTTGAGGGTG TAGAACCAAT TCCTAAGTCA GATGGATTTT CCACTCATTT GCACCCTTCA GAAAAGTTCA ATCTTCCCAC TGCTGCTGAA GATGCCGAGA GTCAAGCTAA CGAAGAGCAC GTAGCATATA TAGCATTTGG ATCCAATGAA GGTTGCCAGG TAGAAAATAT CCAGACAGCT TTGAAATTGT TGGAAAAATA TGGTGTAAAG GTTCAGTCCA CTTCGTCGTT ATATATTTCC AAGCCTATGT ACTACTTGGA CCAGCCAGAC TTCTACAATG GAGCTGTGAA GGTGACTTTT AAGAACAAAA GCCCACACGA CTTGTTGGCC ATACTTAAGA AGATAGAGTA CGAAGACATC AACCGAAAGA AAGAGTTCGA CAATGGTCCT CGTTCTATAG ATTTGGATAT AATACTCTAC GACGATATCA GTGTTAACCA CGAAGACCTA ATTATTCCTC ATAAGTTGAT GTTGGAAAGG ACGTTTGTGT TGCAGCCCAT CTGTGAATTA TTACCTCCAG ACCATATTCA CCCAGTCAGT GCTGAGCCTA TCCATAACCA TCTTCTGCAG CTTCTCTTGA GTAAGCCCAA CGAGTCTGTC CAAGAGTCAT CGGATTTGTT GCAGTTAATT CCGGTTCCGC GACTCGATAA CAAGGACACT ATCTTGAAGT TTGACCAGTT GAAGAATCTG CACTCCACTT TAATAATGGG AATTGTGAAC ATAACACCAG ACTCGTTTAG TGACGGAGGT GTGAACTTTT CCAAATCTGT AGACGAGGTC TTGAAGACAG CAAAGAGTCT CATTGGCGAT GGTGCACATA TCTTGGATAT AGGGGGTGTC TCTACTAGAC CTGGCAGTAA GGAACCTACT GAAGAAGAAG AGTTGCGGAG AGTGGTGCCT CTTGTAAAGG CTATAAGAGC TTCATCAGAT AAACAATTGT CGTCTTGTTT AATTTCTGTA GACACTTACA GAGCCAAAGT GGCAGAAGAG TGTTTGAAAG CTGGTGCCGA CATCATCAAC GACATTTCAA TGGGCTTATA TGAGGAAGCT ATATTTGATG TAGTAGCAAA ATATGGCTGT CCTTACATTA TGAACCACAC CCGGGGTACG GCTGCTACGA TGAGTAAGCT TACGAATTAC GAGTCCAACA CTAACGATGA CATCATCGAA TATCTAGTGG ATCCCATTTT GGGCCATCAG GAGTTAGATT TGACGCCCGA AGTGAATAAC CTTATCAACG GAGTTTCCCG CGAACTCAGC TTGCAGATGC TCAAGGCGTT CGACAGAGGT GTGAGAAAAT GGCAGATCAT TGTCGACCCC GGAATCGGCT TTGCCAAAGA CTTATCGCAG AATCTACAAC TCATCAAGCA CGCTTCTCTT TTCAAGCAGT ACTCGGTACA AGTGAACGTC GATATCAGCG ATAGCAATCA CCGTAATGCT AGCAATGGTA TCAAACATAT GTATGTCAGT TTCAATGGCC TAGCTACTTT GTTGGGTACG TCTCGTAAAA AATTCTTGGG CTCCATTGTC AATCAACCTG AGGCTTCCAA GCGTATGGTC GCTACTGCAG CTTCAGTTAT AGCATGCATA CAGCAGAAAA CTGATATTGT CCGTGTACAT GATGTCAAGG ATATAAAGGA AGCTGTGTTA ACGGGAGACG CTATCTACAG AGATCTGTAC AAGAAATCGT AATAGCCTAC TGTGGCTACC GTGACGAATG GAGCAACGGA GCGACTAGAT AGTCCAGAAT GCCACTATAT AATCGTACCT TGGTGATATC TAGTTGACGG ATGTCTCGAT ATTAGAACTG GTGACTTTAT AGTACCAAAT AATTCCATTG TCTAGCAAAT AACTTATCTA CGAAATAATC TG
|
Protein sequence | MLNDKVFAKD IAATAITGKD AWNRPSPQPI TISVSLDTDF HQASVTDNLK YSLNYAVISR NISEFMKANE HRNFKSLGNI AESVSEIVLD QKKGGGSKAE VVVKSVKSEI RADSVEYRLS RSKTDSEPIP DEIAVRGLRL LTIIGVFTFE RFQRQIVDID LQLKLHKESD VSIHEIIDDV VSYVESSNFK TVEALVMKIG QLIFQNHQKG VTSVFAKVTK PNAISYTEGV GVSSLMTKAS FEGVEPIPKS DGFSTHLHPS EKFNLPTAAE DAESQANEEH VAYIAFGSNE GCQVENIQTA LKLLEKYGVK VQSTSSLYIS KPMYYLDQPD FYNGAVKVTF KNKSPHDLLA ILKKIEYEDI NRKKEFDNGP RSIDLDIILY DDISVNHEDL IIPHKLMLER TFVLQPICEL LPPDHIHPVS AEPIHNHLSQ LLLSKPNESV QESSDLLQLI PVPRLDNKDT ILKFDQLKNS HSTLIMGIVN ITPDSFSDGG VNFSKSVDEV LKTAKSLIGD GAHILDIGGV STRPGSKEPT EEEELRRVVP LVKAIRASSD KQLSSCLISV DTYRAKVAEE CLKAGADIIN DISMGLYEEA IFDVVAKYGC PYIMNHTRGT AATMSKLTNY ESNTNDDIIE YLVDPILGHQ ELDLTPEVNN LINGVSRELS LQMLKAFDRG VRKWQIIVDP GIGFAKDLSQ NLQLIKHASL FKQYSVQVNV DISDSNHRNA SNGIKHMYVS FNGLATLLGT SRKKFLGSIV NQPEASKRMV ATAASVIACI QQKTDIVRVH DVKDIKEAVL TGDAIYRDSY KKS
|
| |