Gene PICST_40725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40725 
Symbol 
ID4836987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1334987 
End bp1337863 
Gene Length2877 bp 
Protein Length958 aa 
Translation table12 
GC content41% 
IMG OID640388302 
Productpredicted protein 
Protein accessionXP_001382479 
Protein GI150863858 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTATTG AAGAATACGA ACTACTTGAT CAAGAGCCAA GAGACTTGGA GCTGCAACAG 
TCGTCACCGC CAGAATCTGC CTCGACACCT AGCCCCTACA ACGAAGTCGA AGAAGAGCCA
GAGTTACGAG ATTCGCTACA ATCAGATTCG TCTCAACTAT TTGACGATAT AGACGTTTAC
ATGTCACGTA ATAGTGCTGA AATCGATGAT TTCAGCAATA GCCCTTTGTT TCAGTCCGTT
TTTCTGAAGT ACCAGGGAGC TAACGTGTGG ATGAAAAGAG TTTGTATGGG ATTATGCATT
TTTTCTATAT TATTATGGCT AGCCGGTCTT CTTGTGTATT CACAGATGTC CCTATCTTCA
GCCGTCAAGA GTATAACCTG GCAGACAGAC GTAGAAGTTA GTGGTAAGAA TATCACTTTG
AACAAATACA GCCCTAAATA CGCCAATTTG ACCATTGATC AGATGAGAAA GTCGAAGTAC
GCAGCATACA AAACGACCAT AAAATGGCTA GAACCACAAC AATACCCCAA AGATACGGCT
CCAGTACCTC GTGGATCGGG ATTCTACTTA GAAAGAGATT CTGACCGATA CAATATAAGG
CAGATGAACA CCGTTTTTAC TGCCCCTTTT ATAGAAAGAA CCCAATTTGC ATACAGAAAC
AATTTCTTCT ACATAACGGA TGTCATATTG AACCCACACA AGCCAATTGA CGATCCAGAC
AACTATCATA TAGTTGTTAC AGACAAACTC GAACAATGGA GGAGTCTGAG TTTCGCATTG
TACTGGATAT ACAATCCATT AACAGCACAG TATATACCGA TACAACCACC CCAGAATTTA
AAGAAACTCC AGAACGATAA CCAGTTGCAG GAAGAGATTT TGGACAAGTT GCACTTCGCT
GAATTTTCTC CAAAGGGTGA TTTCGTCGTT TTTGGCTTCA ATCACGATAT CTTCTTGCAA
GATGTAGTCT CGAACGAAAT TCAGAGAATC ACTAATACAG GTTCTACCAG CATTTTCAAC
GGGAAACCGG ATTGGGTGTA TGAAGAAGAA GTGGCTGCTG ATTACAAGTT AATTTGGTGG
TCACCAGACC AAGAGAACTT GGTGTTTGCT TCTTTAAATG ATACGCTTGT CCAAGAATTT
GAATTGGACT ACTATATCAA AGACAGCACT GAAGTAGGTA CTCAGTACAA AGAACTGCTG
GAAAATAAAT TCGAAGATGT CAACCAGTAT CCTATAAAGA CGTCAATTAA GTACCCCAAA
CCTGGAACTT CGAACCCTAT ATTGTCATTA TTTAATTACA GACTTTCTGA CAAGTCTATC
AAAGAAATCA CCAAGTTACA GGATGGCTTG GGAGAAGATT TCATCTTGTA TAAAGCTGCA
TGGGTAGACA GCAAGAACTT TCTCATGAAG CTCACAGACA GAACAAGTGC CATTCTCAAG
AAAAAGGTAT TCCAGCCAGC TATATCATCT GAAGTTATTG AAGTCAATTC TATGAATGTG
ACTCAGGAGT ATGGAGGATG GGTTGATAAA CTTTCGCAAA TTGCCATTGT AGAAACAGAT
GACGACAAGG AAAATCTGTA CATTGACAAA GTAGTGGTCA ATGGTTTCAC TCATATAGCA
CTCTTCGAAT CAGCCACATC CAAAGACTAT GCCAGATTAT TGACATCCTC CAATACGTGG
GAAGTTCCAC TTAGCTCTCC ATTAGTTCAC GACAAGCAGT TTAACGTTGT CTACTTCTTG
ACCACTATAA GAAGTTCCAT GGACGCCCAT CTATATGCTG TTGATCTTTC TACTGATGAC
AACAAGTTGA TACCTATCAC AAGCTCTGAA GTAGACGGGT TGTACCAAGT TGAGTTCGAC
CAGGCGGGCC AACATTTGAA CTTGTTCTAC AAAGGCCCAA AACAGCCATG GCAGAGACTA
GTGAATATGG CCGAAGTTCA TGAATTCATT TCTTCTGGAG ATTTCAAGGG AAATGGTGTA
GACGAACTCA TCTTGAAGAG TGAAGTCATC AATCATTTTG ACGTTACTGA AGGTAACTTA
AAGGATACCA ACATTCCTAC AAAAGTGTAC AAGACTATTC AAGTAGGCAA ATATGACGAC
GGAAGCCCAC TTCGGTTGAA TGTGATTGAA ATCTTTCCCC CTAACTTCAA CCCTCACAGG
GCAAAAAAGT ACCCTTTACT AGTGTATGCT TATGGAGGTC CTGGCTCTCA AACTGTAGAC
AAGTCGTTTG ACATAGATTT CCAGCACATT GCTAGTGCTT CGTTGGATGC CTTGGTGTTG
GTTATAGATC CTAGAGGTAC AGGTGGGCAA GGCTGGAAAT TCAGTAGCAC CGCTAAGAAT
AGACTAGGCT ATTGGGAACC CAGAGATATC ACCACTATAA CTTCAGAGTA TATAACCGTA
AACAAGAAGT TTATCGACCA GTCCAGAACT GCAATCTGGG GCTGGTCCTA CGGCGGCTTC
ACTTCCTTGA AGACATTAGA GTTTGACCGT GGAAAGACCT TTAAATATGG TATGGCAGTG
GCACCAGTTA CAAATTGGCT ATTCTATGAT TCGGTGTATA CTGAAAGGTA CATGAATCCA
CCAAAAGTAA ATGGAAACTA CGAGAAGTAC GGCCGAATCA GCGATTACAA GAATTTCAAG
TCGTTGAAGA GGTTCTTGCT AATGCATGGA ACATCTGATG ACAACGTCCA CCTTCAGAAT
CTGCTCTGGC TACTTGACAA GTTCAATCTT GGAGAAGTTG AGAACTACGA TGTCCATTTC
TTCCCTGACA GTGACCATGG AATCTATTAC CACAATGCCA ACTCGATAGT TTTTGACAAG
TTGCTTCACT GGTTGAGAGA TGCATTTATG GGCAAGTTCG ACGGGTTGTA TAGATAG
 
Protein sequence
MPIEEYELLD QEPRDLESQQ SSPPESASTP SPYNEVEEEP ELRDSLQSDS SQLFDDIDVY 
MSRNSAEIDD FSNSPLFQSV FSKYQGANVW MKRVCMGLCI FSILLWLAGL LVYSQMSLSS
AVKSITWQTD VEVSGKNITL NKYSPKYANL TIDQMRKSKY AAYKTTIKWL EPQQYPKDTA
PVPRGSGFYL ERDSDRYNIR QMNTVFTAPF IERTQFAYRN NFFYITDVIL NPHKPIDDPD
NYHIVVTDKL EQWRSSSFAL YWIYNPLTAQ YIPIQPPQNL KKLQNDNQLQ EEILDKLHFA
EFSPKGDFVV FGFNHDIFLQ DVVSNEIQRI TNTGSTSIFN GKPDWVYEEE VAADYKLIWW
SPDQENLVFA SLNDTLVQEF ELDYYIKDST EVGTQYKESS ENKFEDVNQY PIKTSIKYPK
PGTSNPILSL FNYRLSDKSI KEITKLQDGL GEDFILYKAA WVDSKNFLMK LTDRTSAILK
KKVFQPAISS EVIEVNSMNV TQEYGGWVDK LSQIAIVETD DDKENSYIDK VVVNGFTHIA
LFESATSKDY ARLLTSSNTW EVPLSSPLVH DKQFNVVYFL TTIRSSMDAH LYAVDLSTDD
NKLIPITSSE VDGLYQVEFD QAGQHLNLFY KGPKQPWQRL VNMAEVHEFI SSGDFKGNGV
DELILKSEVI NHFDVTEGNL KDTNIPTKVY KTIQVGKYDD GSPLRLNVIE IFPPNFNPHR
AKKYPLLVYA YGGPGSQTVD KSFDIDFQHI ASASLDALVL VIDPRGTGGQ GWKFSSTAKN
RLGYWEPRDI TTITSEYITV NKKFIDQSRT AIWGWSYGGF TSLKTLEFDR GKTFKYGMAV
APVTNWLFYD SVYTERYMNP PKVNGNYEKY GRISDYKNFK SLKRFLLMHG TSDDNVHLQN
SLWLLDKFNL GEVENYDVHF FPDSDHGIYY HNANSIVFDK LLHWLRDAFM GKFDGLYR