Gene PICST_56033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_56033 
Symbol 
ID4837401 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2294614 
End bp2297004 
Gene Length2391 bp 
Protein Length767 aa 
Translation table12 
GC content43% 
IMG OID640388716 
Productpredicted protein 
Protein accessionXP_001383195 
Protein GI150864404 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.517516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACGA ACTTGCTATA TGTGCCATTA CGGCAGTCTC GTCCCATAGA TATGGGCTCT 
GAATTACGCG AGGTTATCCG AAAAGACTAC TTCCAGACTC CATCTTCATT TGAACCTGAT
CTCATGAGAA TTTCCAATGC TCGAAACAAG ATCACTCTAC TAACGAACGA AACGATTAGC
CAAAAGAGCG AAATTTTACT CAAAGAGTAC TACGTCTACC TTCTTGCAGT TATGAAGAAA
TTTCTGGATG GCTGTGTCGA GTTTGGCTGG TATGGCACTT TGACTTATGG CCCTAGTGGT
CCTACCAAGT CTCGTTCACT TAAGGTAGAA TTGTGGAATA TCGTATTTCA ATTGGGAAGT
TTCTACTCGC AGATGGCTCT ACAAGAATCC AGATTTACTG ATGATGGCTT GAAGAATGCG
TGTGCGCTTT TTCAGCAGGC TGCTGGCTGT TTTGAGTATA TTTGTCAGTT AGTGAAAAGG
GAAACAGACC AGAGTTCCAA TTCTTTGGCA ATACCGCGAG ATTTTTATGG CGACACTGTG
CTCTGTTTGA AGTTCTTGAT GTTGGCACAG GCTCAGGAAA CAATCTGGCA GAAAGCTCTT
GGTAACACTA CTTTGAAAGA TACGGTTATC GCTCGTTTGT CTATGCTGAC ATCTGATCTC
TACGGCCAGG CTTTGGAATA TGGAAACCGT TCTGATTACA TCAAGCTTGA GTGGATTAAC
CATATAGGTG TTAAGAAGTT CCACTTCAAA GCAGCAGCAT ATTACAGAAT GTCAATTGTA
AGTCAGGATA GCTTTGAGTA TGGGGAACAG GTGGCACTTT TGCGAGTGGC GTCTTCATCG
TGTGACTCGG CTCTCAAGTA CAAAAAGTAC GTAACTCAGC TTGTAGTAGA AGACTTGCGG
GGTTTGAACC AGACGATCAA GGATGTTTTG CGTGGAGCAG AGAAGGACAA CGACTTGGTG
TACATCAAGC CTGTTCCCGT CGAAAAGGAT CTCAAGCCCA TAGCAGCCGT TTCTATGGTT
AAAGCTACCG TTCCTTCGGA TCTTGAGACT CCAGTAGAGA CTAGGAAACT GCTCTTCAAT
GATCTTTTGC CCTATATTGT TATTCAGGTC GCACAAGCCT TTCGGGAAAG ACAGGATAAG
TATATATATG AACGTTTTGT TGAGCCTATT CAGGCACTAA ACAACATGTT AGTCAAATTT
ATTACCGAAA GAGGTCTTCC TGCTTCGATC GATGCACTTC AGCAGTCGGA AAATTTGCCC
GATTCTATCA TCCAGCATTC CCAGGAGATC TTGGCCTTTG GTGGAACTGA CATTATTGAA
GATTCCATCA CGGAAATCAA CAAGCTTTCT ATGGAATGTC AACAATTAAT AGACCATTGC
AATGGAAGAC TCACCCTTGA TGCTAAAGAG GAAGATATGA TGCGGCAGAG GCATGGCCGT
GAACATTGGA ACCATCAGAC GACAGAAGTT GCCGCTCGTG CACTTATAGA GAGAATAGAA
AAGATGATTC AATATCTAGA TCAAGCTAGA GACGGAGATA GTTGTGTTCT CACTAAGTAC
TACGAAATCA AGCCATATTT GGAGATCTAT TGTGGAGGAT ATAAGCCATT AAGCGAGTTC
ATTCCCAACT CGGACTACTC CAAGGTCGAC AAGAACATGA GCAATATCAT TACGGATTTG
AGAAACGCCG TAAATCAGGT ATCTGTACTA GAAGAACAAC GCAAAAGGTT TCTTCTGCAA
GTAGAGTTGA AGGCTCGAGA ACACAATATC TTGCCTAGCG TGATTGAAGA GTTCAAGCTG
AAACAAAATG AGATGTACGA TGAAAATGGA AATGTAAATG AGAGATCGTT TGAAGTAGTC
TACGACAAAC ATATCAAGCT CTTCAGCAAA GAGATGAAAT TCATGGAGAG CACTAAAAGT
ACCCAGATCT CGTTGGAAAA CGATATAGAT ACCTTGAACA GCCGCTTTAT TTCTGACTAC
AACACCAGAA GTAGTGACTC GCAGGTTAAA CGAAAAGAAG CACTACAGTT GCTTGAGGCT
GTTTACTCCA AGTATCTAGA GGTAATCTCT AACTTGAGCG AGGGATCGAA ATTCTACAAT
GACTTTTTGG TCAAAGGCAA CGGAGTACTA AGTGAGTGTG AAGATTATCT TAATCAACGA
CGCTTAGAGA GCAGAGAACT AGAACTTACC ATCAGCAAAC TGTTCAGGTC TGGTCCATCT
CAACATTCAC ATGGATATGA CGAAGAACTC AGTCCAACTT CTGTTCATGA AAGTCGAGAA
ATGGAAAGAT TGCGCAAAGA GGTAGAAGAA GAGACTCGTG CAACAAGTGT TGGGGCTCCC
ATAACTAAGC CTGGTATATG GAGTCCAGAC CAGGGCATCA AGTTTGACTG A
 
Protein sequence
MNTNLLYVPL RQSRPIDMGS ELREVIRKDY FQTPSSFEPD LMRISNARNK ITLLTNETIS 
QKSEILLKEY YVYLLAVMKK FSDGCVEFGW YGTLTYGPSG PTKSRSLKVE LWNIVFQLGS
FYSQMALQES RFTDDGLKNA CALFQQAAGC FEYICQLVKR ETDQSSNSLA IPRDFYGDTV
LCLKFLMLAQ AQETIWQKAL GNTTLKDTVI ARLSMSTSDL YGQALEYGNR SDYIKLEWIN
HIGVKKFHFK AAAYYRMSIV SQDSFEYGEQ VALLRVASSS CDSALKYKKY VTQLVVEDLR
GLNQTIKDVL RGAEKDNDLV YIKPVPVEKD LKPIAAVSMV KATVPSDLET PVETRKSLFN
DLLPYIVIQV AQAFRERQDK YIYERFVEPI QALNNMLVKF ITERGLPASI DALQQSENLP
DSIIQHSQEI LAFGGTDIIE DSITEINKLS MECQQLIDHC NGRLTLDAKE EDMMRQRHGR
EHWNHQTTEV AARALIERIE KMIQYLDQAR DGDSCVLTKY YEIKPYLEIY CGGYKPLSEF
IPNSDYSKVD KNMSNIITDL RNAVNQVSVL EEQRKRFLSQ VELKAREHNI LPSVIEEFKS
KQNEMYDENG NVNERSFEVV YDKHIKLFSK EMKFMESTKS TQISLENDID TLNSRFISDY
NTRSSDSQVK RKEALQLLEA VYSKYLEVIS NLSEGSKFYN DFLVKGNGVL SECEDYLNQR
RLESRELELT ISKLLRKEVE EETRATSVGA PITKPGIWSP DQGIKFD