Gene PICST_29713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29713 
Symbol 
ID4836878 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp941537 
End bp943930 
Gene Length2394 bp 
Protein Length770 aa 
Translation table12 
GC content43% 
IMG OID640388193 
Productpredicted protein 
Protein accessionXP_001382951 
Protein GI150864217 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.585457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAG CTCCCAACGA TGTCATCAAG TTAACCCGAC TGTAAGTACA AACTCTTTTT 
CTACTCTAGC CTGCTATATT CTATTCTTAT CTTCTGTGCA ACCGTACTAA CAATAATTCT
AGCCACACCA ACCCAAAGAA GGGTCACAAG CTCGGCTCAG CTTTGGCAAC ATGGAAGAAG
ATCAAATCGG CTGCCGCTGC TACTGGTACT GCTGCTGCTG AAACTGAGTC AACTTCTACT
GATGAGCTTC TAAGTGATCT AAGTACGGAA AACTCACTCT TTGAGTACGG TAAGAATGGC
GAGATCATCA TCGTCTACCG TGACAGCCGA ATCTTGCAAG TCTGGGACTG GAAAGCGCCT
TCCGAGCAAA CGAGAACTGA ATCGGAAACA TTGTACTTGA ACTTCCAGAC CAAGATCGAA
GTAATGCCTT CAAAGAACCA GTTATCGATC ATAACGTTCA AGCTACTCGA CCTACCTGTA
AATGATCCCA ACCTGATCGC GGTCGTTTTA CTTGTCAAAA CACAAACTGA AACAACAGTT
AAGTATTCAT TGATCACTAA GAAAATCAAC TTTTCTTCAT CTTTCCACAA CTCCCAAGCT
GTAGAATTGT CCCCGGTTTT TCAAAACCTG ACAGATTTCT CGTTGAAAGC CAGCAAGAAG
TTTGTCGTGG TCGCCAACAA CGAAGGATTT ATCTATATAT ATCGCTATAA TGTTGCTGAT
TTCAAATTAA CTCCAGCCAA CGCCGACTTC AGCTTGCCGT CTATACGAAA GAGCTCAACT
CAACCCGTGA TCAGCAGACA CCATCTTCAG TCTGTAGGTG ATGGAGATAT CCTTTTGCAG
ACCAGTTGTA ACCAAGATAA CTGTCCAATA TTTGATATCG AAGACAACTG GCTCGTCTAC
TCGCCTACAA AGTTCGAGTA CAAACACTTG AAAGCCATCA GTAGTTCCGC GCCTAGTGTT
TCAGCGAACC CCATGGCTCA GGATCCAGTG ATAACTCTTC CGCTGAATAA CGAAACCGTA
TCCACTCATA GCAATCTCTA TACTCCGGTG AAATTGCCGG CTTCAGGGCC ATTGTTGAAC
AAACTATTGT CAACAATATC AAACACCGCA TTAGATGGAC TTTTCCGGTT ATCTGAAATC
AGTTCTTCCA AGGTAAAGTC GTATATGAAC TCAAAGAGTA AAGAAACCTA TAAGGCACCA
ACGATTAATT CTATCAGCAA ATCGCTAGGT AAACTCTTGT ATTCTACAGC TTCTACTACA
GCTACAACAT TAGAGAATAG CACGAGAAGC TTAAAACCCA ACAACAACCA GATTATCAAA
GTCATAGATC TTTCCAACGA CAAGGTTTTA GGCGTCTTCA AGCCTTTGGG AGGTGTTTCC
AACGTTTCGC TCTCGCCTTA CGACTTACAT CTTGTACATT CGAACTATAG AGGAGACACT
TTGTTCATGT GGGATTTGTA TAGATTGCCC AGTGAAGTGT CCTTGATAGG CAAGTTCACC
AGAGGAAAGA CATCAGCTAT CATTGAAGAA ATCTTCTGGT TCAACAACAA CTACGGAGAT
CAAATTAATA GTAGTAGTTC TGGAAACTCA AATAACGAGC CTAGCATCAA GGGTATGAAC
TCGGGCTTTG GGTGTATTAC CAAGTCTACT GGTTCTGTTC ATTGGTTCAA CATCAACTAC
TTGTCTGGTA ATATGAACAA CAATTTCCCC AACAGTCTAA ATAAGGAGAA GGTGCGAAGA
AATCTGCAAT CGAGCCAGTT CTTGGACTCG TGGATTTTGT CGTCGTTGAA AGCTCGCAGA
TTTGTTGCTT TGCCTGAACT TTGTAACTCC ATTGCACCCA CTGCATCCTT CGGGGACTCT
GATCCTGGAT GTGTAGCAAA TCGTCTTGCT ATTAACCAGT TGGCAATTAT CGATAGCGAT
AACCAGTTGA AGCTCATCTC GACTTTGAAT GGAAGACATC TCTACAAGTA CGAGTTGCCT
ATTGCGCCAG TAGCTGAATC GTTCATACCT TTTAATTCTC GACGGGCAGA AGCTAAAGTG
GAAGATAGCA AGGATCGCGT CAATCCGTTA TCACAGGCTG AGATCGAAAC GAGTGTCCCT
TTCTTGAACT TGATCAACAA CAAGAACATC GAGTTTGCTG TGTTTTCTTT TGAAGGAGAG
GAAGGAGATA GAAACAACTT TTTCCACTGC TTCAAGGAGT TTGGCAACGA TGTTCCAGAA
AAGGTGATCA AGTTTGAGAA TGGAAACCAT CGGTCAAATA AGATAATCTT TGACTTGAAG
AAGGACGAAG ACGTAAAGCC CGAGGACAGA TTGGCATTGC TCGATGGGTT GTATATTGAC
CAAGGTGAAG GAAGCATCGA AGCCCAGAGC AGTCCTGTTG TAGATGACCA GTAG
 
Protein sequence
MPEAPNDVIK LTRLHTNPKK GHKLGSALAT WKKIKSAAAA TGTAAAETES TSTDELLSDL 
STENSLFEYG KNGEIIIVYR DSRILQVWDW KAPSEQTRTE SETLYLNFQT KIEVMPSKNQ
LSIITFKLLD LPVNDPNSIA VVLLVKTQTE TTVKYSLITK KINFSSSFHN SQAVELSPVF
QNSTDFSLKA SKKFVVVANN EGFIYIYRYN VADFKLTPAN ADFSLPSIRK SSTQPVISRH
HLQSVGDGDI LLQTSCNQDN CPIFDIEDNW LVYSPTKFEY KHLKAISSSA PSVSANPMAQ
DPVITLPSNN ETVSTHSNLY TPVKLPASGP LLNKLLSTIS NTALDGLFRL SEISSSKVKS
YMNSKSKETY KAPTINSISK SLGKLLYSTA STTATTLENS TRSLKPNNNQ IIKVIDLSND
KVLGVFKPLG GVSNVSLSPY DLHLVHSNYR GDTLFMWDLY RLPSEVSLIG KFTRGKTSAI
IEEIFWFNNN YGDQINSSSS GNSNNEPSIK GMNSGFGCIT KSTGSVHWFN INYLSGNMNN
NFPNSLNKEK VRRNSQSSQF LDSWILSSLK ARRFVALPEL CNSIAPTASF GDSDPGCVAN
RLAINQLAII DSDNQLKLIS TLNGRHLYKY ELPIAPVAES FIPFNSRRAE AKVEDSKDRV
NPLSQAEIET SVPFLNLINN KNIEFAVFSF EGEEGDRNNF FHCFKEFGND VPEKVIKFEN
GNHRSNKIIF DLKKDEDVKP EDRLALLDGL YIDQGEGSIE AQSSPVVDDQ