Gene PICST_41374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41374 
Symbol 
ID4836786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp333435 
End bp336386 
Gene Length2952 bp 
Protein Length983 aa 
Translation table12 
GC content41% 
IMG OID640388101 
Productpredicted protein 
Protein accessionXP_001382295 
Protein GI126131540 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[S] Function unknown 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit
[COG3535] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAGAG AAAGAAAGTT ATTAATTGGT ATTGACGTTG GTGGAACCAA CACTGATTCA 
GTTTTGTTGG ATCCTTCCCT TGTCTCGGAT ACCACTACCA GAGGTATCAT TGCATGGAAT
AAGGCCAACA CCACTTCCGA TGTATCTGAC GGTATCGAAG CTGCTTTGAC GGAATTGTTC
ACTTTGGCAC CCAAAGTCTA CAAGGAAGAT GTTGGTGCTG TAACTATTGG AACCACCCAC
TTCCTTAACG CTGTTATTGA ACAGGATAGA GGCAAGCTCG ATAAGGTTGC TGTTCTAAGA
TTCGTTGGTC CGTATTCGCA GAAGACTGAG CCCTTTTGTG AATTTCCTGC TGGGTTGAAA
GATATTTTGA AGGGCTATGT TGCATACCTT GATGGTGGAC ACTACGTTCA CGGCGAAGAA
GTTAGTGAAC TTAACAAAAA GGAAATTCAC GACCACTGTA TGAAGATAAA AGAATTGAAC
ATTCACGCTG TCGTTCTTGT AGCTCAATTC TCTCCTTTGA AAAATGAACA CGAGAACATT
GCTGAGGGCA TAATCAAAGA AGTTCTTCCT GATATACAGA TTGTCAAGTC TTACGAAATT
GCTGGAATTG GTTTCTTAGA AAGAGAAAAT GCCGCGATTC TAAATGCTGG TATCTTGAGA
TTTGCAAACA AGGTTATCGC TTCATTTAAT GCTGCCGTTC GGAGGGTGGG TTTGAATTGT
CCAGTTATGT TAACTCAGAA TGATGGAACT GTGTTGCCAT CTAGTGCTGC AAGAAAGTGT
CCGATAAATA CGTTTTCTTC TGGTGCCACT AATTCCATGA GAGGTGCTTC TATTCTTTGT
AGTGGAGATG AGTCCATCAA GGGACAATCG GTTTTGGTTG TTGATGTTGG TGGGACCACT
ACTGATATAG GTGTCTTGTT GCCAACTGGT TTCCCAAGAC AATCTGCCTC TTTCTCATAT
GTAGGAGGTG TTAGAATGAA CTTCTCAATG CCTCAAGTTC ACAGCTTTGG TCTTGGCGGT
GGTTCGAAGG TAAGATTCAA TAAAAAGATC ACCATTGGTC CTGACAGCGT TGGAAATGAA
ATTCGTAAGC AAGCAATTAT CTTTGGAGGT GATACATTAA CAGCTTCTGA TATGGCAGTT
GGAATTGGCA AGCAAGCTGG TCTTGAACCC GAATTATTCA ACATTGGAGA CCCGCAAAAA
TTAGATGGCA AGTTGAACGA GAAACATATG AAGGAATTTC AGGATGAAGT CAAATACTTG
TTGGAAAAGC ACATTGATAG AATGAGAACT TCAGCAGAAC CTGTTCCTGT TTTGGTTGTT
GGCGGTGGAT CTTTCATTGT GCCTTCAGAT ATCGAAGGCT CTTCGAAAGT TCTCAGACCT
CCATACCACG GTGTTGCTAA TGCTATTGGT GCTGCCATGT CCAAGATTTC TGGGCGAAAG
CATCTCATTA AAGTTGTTCC AAACGAAAAA GAAGCTAAAG AAAATGCTCT ACAAGAATGC
ATCGAAGAAG CAAAGGAAAA TGCAGTAGTA AAAGGCGCTA TTAGAGATTC TTTGACTATA
GTTGAGTTGT CGCACGATCC TATTCCATAT ATTCCAAACA CTTACGAATT TATCGTTAAG
GTGGTTGGAG ATGCAGACCA TTCAAGGGTA CCAGAAGTGA ATTTGGAGAA ATCATTGGAC
AACCTCAGTG GTGGTTCTGT ATTAAAGGAA GTAACCACCC CAGTTGAGGA ATTTACTATT
GAAAATGTAG ACATTGAAAA ATACAAGCCA AAAATTGAAA ACAGGGAATG GATCATCAGC
GAAATTGACT TGGAATTTTT GAAAATTGGG ACCTACATCT TGGGATGTGG TGGTGGAGGA
ACTCCTCATC CTACTTTCAT CGACATAAGA AATATGCTTC GAAATGGTGC TACTATCAGA
GTTATTGACA TTGACGATGT TTCTAAGTAC ACCGATGGAA AGAGGTCTAT TATCTGTGTC
GGATTTGCTG GTTCGCCAAC TGTTGCATCT GAGATGTTGA AGGCCGACGA ATTGCTTGAA
GCAGCCAAGT CATTGATTCA ATTTACTGGC CAGGAAGCTA AGGTTGTGTG TCCATTAGAA
ATTGGAGGAG GAAATGGTTT CACTGGGTTT GAAGTTGGTG CCTCTAATAA GTTGAATATT
CCAGTTGTTG ATTCTGATTT TATGGGACGT GCCTACCCAA CACTCTGGCA AAGTTCTGCA
AATGCTATTT ATGAGAAGTT TCCATATTGG CCAGCTGCCG TAAGCAATGG TAACAGTAGT
TCAATGTTAA TTTCAGAAGC TAGTAATTGC GAGTCGTTGG AGAGATTGAT ACGTTCTACT
TGTGTTGAAG TTGGAACTCA TGTTGGAGTT GTTATGGCAC CAATGACTTC TGAAGAATTA
ACTGGTGGTA CTGTTCCTGG TTCGATCTCC TTGGCTTGGC GTATTGGACG AGCTGTACTT
TTGGCCCGGC AAAAATTGGA GCATGATTTG ATACCTCAAA GAATCATTGA ATCTGTTGGT
GGTAAATCCT CTGGTAGTCA TCTTTTCACA GGGAAGATTG TCGACGTTAG CAGAAAAGTA
CACAAGGGGC ATGTATACGG TGAAGTTATA ATCGAAGAAC CTGAAACCAA GAAGCAGATG
GTCATTCCAT TCAAGAACGA GAACATTCTT TGCCGTGTGA GGGAAACCGC AGAAGAGGAA
GGCAAAGTTG TTTGTGCGGT ACCTGATTTG ATTGCTGTGC TTGAATCGGA AACTGGTGAA
GCACTTGGTA CCCCAGATTA CAAGTACGGT CTCATTGTCA ATGTTATTTC AATTTCTCCT
AGCAATATTT GGACCGATAC AGAAAAGGCT ATGGCGATTG GTGGCCCAGC TAGTTTTGGA
TTTGATGAAG TTGAATATGT TCCAGTTGGA ACTTACACTA GACCAGTTTC CGTCATTGAG
GAATATTGTT AA
 
Protein sequence
MTRERKLLIG IDVGGTNTDS VLLDPSLVSD TTTRGIIAWN KANTTSDVSD GIEAALTELF 
TLAPKVYKED VGAVTIGTTH FLNAVIEQDR GKLDKVAVLR FVGPYSQKTE PFCEFPAGLK
DILKGYVAYL DGGHYVHGEE VSELNKKEIH DHCMKIKELN IHAVVLVAQF SPLKNEHENI
AEGIIKEVLP DIQIVKSYEI AGIGFLEREN AAILNAGILR FANKVIASFN AAVRRVGLNC
PVMLTQNDGT VLPSSAARKC PINTFSSGAT NSMRGASILC SGDESIKGQS VLVVDVGGTT
TDIGVLLPTG FPRQSASFSY VGGVRMNFSM PQVHSFGLGG GSKVRFNKKI TIGPDSVGNE
IRKQAIIFGG DTLTASDMAV GIGKQAGLEP ELFNIGDPQK LDGKLNEKHM KEFQDEVKYL
LEKHIDRMRT SAEPVPVLVV GGGSFIVPSD IEGSSKVLRP PYHGVANAIG AAMSKISGRK
HLIKVVPNEK EAKENALQEC IEEAKENAVV KGAIRDSLTI VELSHDPIPY IPNTYEFIVK
VVGDADHSRV PEVNLEKSLD NLSGGSVLKE VTTPVEEFTI ENVDIEKYKP KIENREWIIS
EIDLEFLKIG TYILGCGGGG TPHPTFIDIR NMLRNGATIR VIDIDDVSKY TDGKRSIICV
GFAGSPTVAS EMLKADELLE AAKSLIQFTG QEAKVVCPLE IGGGNGFTGF EVGASNKLNI
PVVDSDFMGR AYPTLWQSSA NAIYEKFPYW PAAVSNGNSS SMLISEASNC ESLERLIRST
CVEVGTHVGV VMAPMTSEEL TGGTVPGSIS LAWRIGRAVL LARQKLEHDL IPQRIIESVG
GKSSGSHLFT GKIVDVSRKV HKGHVYGEVI IEEPETKKQM VIPFKNENIL CRVRETAEEE
GKVVCAVPDL IAVLESETGE ALGTPDYKYG LIVNVISISP SNIWTDTEKA MAIGGPASFG
FDEVEYVPVG TYTRPVSVIE EYC