Gene PICST_31039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31039 
Symbol 
ID4838197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1590341 
End bp1592263 
Gene Length1923 bp 
Protein Length640 aa 
Translation table12 
GC content39% 
IMG OID640389512 
Productpredicted protein 
Protein accessionXP_001383589 
Protein GI126134129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000183375 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.324549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAAC TCAAGAAAGG AAGAAGAAAC CAGAAAGCCA GACTCAATCC CATTGGAACC 
AACCCCAAGG CATCCAGCAA GGATGAGAAA AGAGACGAGA ATACGAGACA GTCCAAGATC
TTGCCCTTGA TCAACAAGTT GAAGTCTACA GTAGCAAATG ATAGATCTAT GGCTTTGGGT
GCTATTACTG TTTTGGCTGA AGACGATAGG ATGAGATCGT TGTTATTGAA AGAAAAGCTA
GTTAGTATTG TCATGGAACA GTGTTTGAAC GATGCCAACG ACGAGATCAT AGTAGAATCA
TTCGGCTTGT TGAGAAATTT GGGAATTGAG GAGGGCTATG ACGTGTTGAA ATACTACTGG
AGATCCAATA TTTGGACTGC TATTGAAGCT GCCATAGCTA AAATTCAAAC TTCATACAAG
TATTTGGTAG AAAATGGTGC CAATCCAGCA GGTAATAAAA ACGACAAGTC CAAAGTTCAA
TTGTTGTACG ATTTCACGGA GAATATTTTG TCGCTTATCA TCGTATTGGC CAGTGGTTCA
GATGATTTGT ACGAATCCAT TTTCGACAAG ATCGATCCAA TCTTGAAATT TGTCTCTGAC
TTGCTAACTT CAGAAATTAG CACGACTTCT GGATTCAAGA TTTCCACTAA ACTTTTCAAC
TCCTTATTAG AATTCATCTA TGAATTCTCC ACTGAGTCAG AAGATTTTAT CAGGAAGTTC
AGTTTGGACA TCAACTTCGA TTTTGCCAAA TTGTTTGCAT ACGTAGAAGC CAAAGAGTCC
CATTACAACA ACCTCACTAG AGTATATATC GAAGGTATCA AGTTCAACAA TTTCGAAGTA
TTGAGCCAAG GAGAAAACAA GTACGAGGTC GGCACCCAAT TCTTGTCCAA CATCTTCGAA
ATCATCGTAA CAACTGATTT AGAACTTCTC AAGAAAAACA TACATGCCAT TAAAAACCCA
GACAATTCTA GCAAGCCTAT TCAGAAGGAT GCCAACACTT TGGAAACGGA CTTGTCGAAG
GGTTACGACT TAGCATTACA AACAAAATTA GAATTGACCA CATTAGAAGT GGCCCTAGAC
TTGATTACTT CTTTGTTGGA GTACTTGTCT ACCAACGATA ACGATGAACC TCTCAATTTG
CCTACACCAA CAGTTGAGGT CTTATTAAAC AAGATCTATC CAAGTTTAGT CGAATTGAAC
AACTTTGAAA ACGCCAATAA AGGTATGTTA TCATTGCGTG AAAAGATCTT GGTAGCATTG
AACAATTTGA CGTGGCTCAT GTTGTCTAAC GAATCGCTTC CAGTTGCTTG GTTCGAAAAG
TCACTTGCAT TATGGGATTT GATCATAAGT TCAGCAAATA CTTCTGACGT TGTTTTACAG
AAAAACTGCC TCAACGTATT ATGGGGTATC ACTAAGTCGC TCGGGCCAGA AGTTAGAAGC
AAATTGCAGC CTTCTATCGT TGACGAATTG ATTGCAAAAT GCCAAACTAT TATGGAAAAG
CAAAATGAAG CCGACGAAAC TGATATAGAG TTGTACCTTT CTATTGTTGG TTTCCTTGGT
AACTTGGCTC CAGTCGTAGG CAACACTCAA ATCACGTCTA AAATTTCCCA GTTCTTGATT
ACTACCATCG AATTATTATG CAACTCTACA GTTACTACTG TCACCAGCCC GGCCAAGATA
GAAGTTGTAT TGGAGGCCAT AAACGTCATT TACGAGATTT TCGGTGATAT CGAATTTGAC
TACGACTACG AGATCTTTGT GCAACAAGGC TACTTGCAAA AATTGTCCCT TTTGGAACCA
AAGGTCAAGG AATTGTACAA GAAGATCGAT AAAAATAAGT ATCCACATCT CAAAGCTAAA
GGTGAAGAGA CGTGGATTAA TTTGGGCAGA TTTATCCAGT ATAAACAGAG TGAAAGAGCG
TAG
 
Protein sequence
MGKLKKGRRN QKARLNPIGT NPKASSKDEK RDENTRQSKI LPLINKLKST VANDRSMALG 
AITVLAEDDR MRSLLLKEKL VSIVMEQCLN DANDEIIVES FGLLRNLGIE EGYDVLKYYW
RSNIWTAIEA AIAKIQTSYK YLVENGANPA GNKNDKSKVQ LLYDFTENIL SLIIVLASGS
DDLYESIFDK IDPILKFVSD LLTSEISTTS GFKISTKLFN SLLEFIYEFS TESEDFIRKF
SLDINFDFAK LFAYVEAKES HYNNLTRVYI EGIKFNNFEV LSQGENKYEV GTQFLSNIFE
IIVTTDLELL KKNIHAIKNP DNSSKPIQKD ANTLETDLSK GYDLALQTKL ELTTLEVALD
LITSLLEYLS TNDNDEPLNL PTPTVEVLLN KIYPSLVELN NFENANKGML SLREKILVAL
NNLTWLMLSN ESLPVAWFEK SLALWDLIIS SANTSDVVLQ KNCLNVLWGI TKSLGPEVRS
KLQPSIVDEL IAKCQTIMEK QNEADETDIE LYLSIVGFLG NLAPVVGNTQ ITSKISQFLI
TTIELLCNST VTTVTSPAKI EVVLEAINVI YEIFGDIEFD YDYEIFVQQG YLQKLSLLEP
KVKELYKKID KNKYPHLKAK GEETWINLGR FIQYKQSERA