Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_41374 |
Symbol | |
ID | 4836786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 333435 |
End bp | 336386 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388101 |
Product | predicted protein |
Protein accession | XP_001382295 |
Protein GI | 126131540 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [S] Function unknown |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit [COG3535] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGAG AAAGAAAGTT ATTAATTGGT ATTGACGTTG GTGGAACCAA CACTGATTCA GTTTTGTTGG ATCCTTCCCT TGTCTCGGAT ACCACTACCA GAGGTATCAT TGCATGGAAT AAGGCCAACA CCACTTCCGA TGTATCTGAC GGTATCGAAG CTGCTTTGAC GGAATTGTTC ACTTTGGCAC CCAAAGTCTA CAAGGAAGAT GTTGGTGCTG TAACTATTGG AACCACCCAC TTCCTTAACG CTGTTATTGA ACAGGATAGA GGCAAGCTCG ATAAGGTTGC TGTTCTAAGA TTCGTTGGTC CGTATTCGCA GAAGACTGAG CCCTTTTGTG AATTTCCTGC TGGGTTGAAA GATATTTTGA AGGGCTATGT TGCATACCTT GATGGTGGAC ACTACGTTCA CGGCGAAGAA GTTAGTGAAC TTAACAAAAA GGAAATTCAC GACCACTGTA TGAAGATAAA AGAATTGAAC ATTCACGCTG TCGTTCTTGT AGCTCAATTC TCTCCTTTGA AAAATGAACA CGAGAACATT GCTGAGGGCA TAATCAAAGA AGTTCTTCCT GATATACAGA TTGTCAAGTC TTACGAAATT GCTGGAATTG GTTTCTTAGA AAGAGAAAAT GCCGCGATTC TAAATGCTGG TATCTTGAGA TTTGCAAACA AGGTTATCGC TTCATTTAAT GCTGCCGTTC GGAGGGTGGG TTTGAATTGT CCAGTTATGT TAACTCAGAA TGATGGAACT GTGTTGCCAT CTAGTGCTGC AAGAAAGTGT CCGATAAATA CGTTTTCTTC TGGTGCCACT AATTCCATGA GAGGTGCTTC TATTCTTTGT AGTGGAGATG AGTCCATCAA GGGACAATCG GTTTTGGTTG TTGATGTTGG TGGGACCACT ACTGATATAG GTGTCTTGTT GCCAACTGGT TTCCCAAGAC AATCTGCCTC TTTCTCATAT GTAGGAGGTG TTAGAATGAA CTTCTCAATG CCTCAAGTTC ACAGCTTTGG TCTTGGCGGT GGTTCGAAGG TAAGATTCAA TAAAAAGATC ACCATTGGTC CTGACAGCGT TGGAAATGAA ATTCGTAAGC AAGCAATTAT CTTTGGAGGT GATACATTAA CAGCTTCTGA TATGGCAGTT GGAATTGGCA AGCAAGCTGG TCTTGAACCC GAATTATTCA ACATTGGAGA CCCGCAAAAA TTAGATGGCA AGTTGAACGA GAAACATATG AAGGAATTTC AGGATGAAGT CAAATACTTG TTGGAAAAGC ACATTGATAG AATGAGAACT TCAGCAGAAC CTGTTCCTGT TTTGGTTGTT GGCGGTGGAT CTTTCATTGT GCCTTCAGAT ATCGAAGGCT CTTCGAAAGT TCTCAGACCT CCATACCACG GTGTTGCTAA TGCTATTGGT GCTGCCATGT CCAAGATTTC TGGGCGAAAG CATCTCATTA AAGTTGTTCC AAACGAAAAA GAAGCTAAAG AAAATGCTCT ACAAGAATGC ATCGAAGAAG CAAAGGAAAA TGCAGTAGTA AAAGGCGCTA TTAGAGATTC TTTGACTATA GTTGAGTTGT CGCACGATCC TATTCCATAT ATTCCAAACA CTTACGAATT TATCGTTAAG GTGGTTGGAG ATGCAGACCA TTCAAGGGTA CCAGAAGTGA ATTTGGAGAA ATCATTGGAC AACCTCAGTG GTGGTTCTGT ATTAAAGGAA GTAACCACCC CAGTTGAGGA ATTTACTATT GAAAATGTAG ACATTGAAAA ATACAAGCCA AAAATTGAAA ACAGGGAATG GATCATCAGC GAAATTGACT TGGAATTTTT GAAAATTGGG ACCTACATCT TGGGATGTGG TGGTGGAGGA ACTCCTCATC CTACTTTCAT CGACATAAGA AATATGCTTC GAAATGGTGC TACTATCAGA GTTATTGACA TTGACGATGT TTCTAAGTAC ACCGATGGAA AGAGGTCTAT TATCTGTGTC GGATTTGCTG GTTCGCCAAC TGTTGCATCT GAGATGTTGA AGGCCGACGA ATTGCTTGAA GCAGCCAAGT CATTGATTCA ATTTACTGGC CAGGAAGCTA AGGTTGTGTG TCCATTAGAA ATTGGAGGAG GAAATGGTTT CACTGGGTTT GAAGTTGGTG CCTCTAATAA GTTGAATATT CCAGTTGTTG ATTCTGATTT TATGGGACGT GCCTACCCAA CACTCTGGCA AAGTTCTGCA AATGCTATTT ATGAGAAGTT TCCATATTGG CCAGCTGCCG TAAGCAATGG TAACAGTAGT TCAATGTTAA TTTCAGAAGC TAGTAATTGC GAGTCGTTGG AGAGATTGAT ACGTTCTACT TGTGTTGAAG TTGGAACTCA TGTTGGAGTT GTTATGGCAC CAATGACTTC TGAAGAATTA ACTGGTGGTA CTGTTCCTGG TTCGATCTCC TTGGCTTGGC GTATTGGACG AGCTGTACTT TTGGCCCGGC AAAAATTGGA GCATGATTTG ATACCTCAAA GAATCATTGA ATCTGTTGGT GGTAAATCCT CTGGTAGTCA TCTTTTCACA GGGAAGATTG TCGACGTTAG CAGAAAAGTA CACAAGGGGC ATGTATACGG TGAAGTTATA ATCGAAGAAC CTGAAACCAA GAAGCAGATG GTCATTCCAT TCAAGAACGA GAACATTCTT TGCCGTGTGA GGGAAACCGC AGAAGAGGAA GGCAAAGTTG TTTGTGCGGT ACCTGATTTG ATTGCTGTGC TTGAATCGGA AACTGGTGAA GCACTTGGTA CCCCAGATTA CAAGTACGGT CTCATTGTCA ATGTTATTTC AATTTCTCCT AGCAATATTT GGACCGATAC AGAAAAGGCT ATGGCGATTG GTGGCCCAGC TAGTTTTGGA TTTGATGAAG TTGAATATGT TCCAGTTGGA ACTTACACTA GACCAGTTTC CGTCATTGAG GAATATTGTT AA
|
Protein sequence | MTRERKLLIG IDVGGTNTDS VLLDPSLVSD TTTRGIIAWN KANTTSDVSD GIEAALTELF TLAPKVYKED VGAVTIGTTH FLNAVIEQDR GKLDKVAVLR FVGPYSQKTE PFCEFPAGLK DILKGYVAYL DGGHYVHGEE VSELNKKEIH DHCMKIKELN IHAVVLVAQF SPLKNEHENI AEGIIKEVLP DIQIVKSYEI AGIGFLEREN AAILNAGILR FANKVIASFN AAVRRVGLNC PVMLTQNDGT VLPSSAARKC PINTFSSGAT NSMRGASILC SGDESIKGQS VLVVDVGGTT TDIGVLLPTG FPRQSASFSY VGGVRMNFSM PQVHSFGLGG GSKVRFNKKI TIGPDSVGNE IRKQAIIFGG DTLTASDMAV GIGKQAGLEP ELFNIGDPQK LDGKLNEKHM KEFQDEVKYL LEKHIDRMRT SAEPVPVLVV GGGSFIVPSD IEGSSKVLRP PYHGVANAIG AAMSKISGRK HLIKVVPNEK EAKENALQEC IEEAKENAVV KGAIRDSLTI VELSHDPIPY IPNTYEFIVK VVGDADHSRV PEVNLEKSLD NLSGGSVLKE VTTPVEEFTI ENVDIEKYKP KIENREWIIS EIDLEFLKIG TYILGCGGGG TPHPTFIDIR NMLRNGATIR VIDIDDVSKY TDGKRSIICV GFAGSPTVAS EMLKADELLE AAKSLIQFTG QEAKVVCPLE IGGGNGFTGF EVGASNKLNI PVVDSDFMGR AYPTLWQSSA NAIYEKFPYW PAAVSNGNSS SMLISEASNC ESLERLIRST CVEVGTHVGV VMAPMTSEEL TGGTVPGSIS LAWRIGRAVL LARQKLEHDL IPQRIIESVG GKSSGSHLFT GKIVDVSRKV HKGHVYGEVI IEEPETKKQM VIPFKNENIL CRVRETAEEE GKVVCAVPDL IAVLESETGE ALGTPDYKYG LIVNVISISP SNIWTDTEKA MAIGGPASFG FDEVEYVPVG TYTRPVSVIE EYC
|
| |