Gene PICST_81383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81383 
Symbol 
ID4837236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2246870 
End bp2249500 
Gene Length2631 bp 
Protein Length630 aa 
Translation table12 
GC content41% 
IMG OID640388551 
Productpredicted protein 
Protein accessionXP_001383189 
Protein GI150864399 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.184243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.358909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAACAGTCAG TGCTTGCTCA TTTCATCTGC CACAGCCATA CACACTCAAC CCCGTGACTT 
CTGGTGAAGC GTAGTATCCA TAGAATAGCA ATATTGTTCG TTTCATCTGC TTCATAGAAC
CAATCTACAC AGATCACCCA ACACCACTGC CACTACCAAT AAACATCCAC CTACATTGCT
ACTGATTGCA CACGATCTCG CCATCTTTTT GTTATTGATA CTGTTCTCAT TGTAATTGTT
GTGATTGAAA TTGGAGTCTT ATCGTAATTG TTATTAATTG TTTTGTTGTA GTTTTCATTT
TTCAATCTCA GTTCAAATCA TTCAAATCAT CAATAATAAA TTCAATTTAA GTTTTCACAG
AACTGAATAT CATAGTCTTC ATTAGCTGAC CATCATCATA ATTATCATAA TTCACATTAC
ATAATGGACA ACTACAACGA CGACTTGTAT CCTTTGGCAT TGCTCATGGA CGAGTTGAAA
CATGACGATG TGTCCAACCG TGTGGAAGCC ATGCAGAAAT TGGACAATAT CGCCATCGCT
TTGGGCCCTG AACGGACGCT CAAGGAATTG TTGCCCTTTT TAAACGACGT AGCCCAGGAC
GATGAGGAAG AAGTCTTTGC TGTATTGGCT TCAAAGCTTG GAGACTTTGT CCCGCTTGTA
GGGGGCCACG AGAACTGTGA ACCATTAATT CAGATCTTAA CCATTCTCGC ATCGATGGAA
GAACCCATTG TCAGAGATAA AGCCATCGAC TCATTGTACA AGATCAGTTT AGAGTTGACT
CTCGACGAGT TGACCGGTAT ATTCTTGACA TTGATTCGTA GCTTAAGTCA AGGTAATTGG
TTTTCTAAGA AGGTAGCCAG TTGTGGTTTG TACAAAGCTG TAATTCTCAA GGTAAACTCT
TCTGCAAGAA GGGATTTGTT GAATTTGTAC TTGAAATTGG TTACTGACGA CTACCCCATG
GTCAGGAGAG CGGCAGCCAA CAACTTACCT CATCTCATCA ACCTTCTCAC GGAATTCACC
GAAGAAAAAC CCAACGACGT CAACAAGATC AACAACGAAG ACTGGGAAAT AATCTCGAAG
ATGTTCCAGC ACCTCATCAA TGACGACCAG GACTCGGTCA AATTCTTAAG TATTGATGTT
TTGATTGCCA TTCTTGAGTT CTTCCAAAAG ATCAACGAAT ACAGCTTCAA CTCCGACTTT
TTGACCAGCG CTTTGAAGTT GATCAAGGAT GAAAGTTGGA GAGTGCGTTA CACTGCTGCT
GACCGTTTCA CCAAGATCGC CAAAAACTTC ACCAATGAAG AAAGTGACTT GTTCCAGTTG
ATCGATCCTT TTATCTCGTT GATGAAAGAC AATGAGGGTG AAGTAAGAAA AGCTATCGCT
AAGCAATTGC CTAGTTTCTG TGAGCTTTTG ACCAAATACC AATCCACTAG AGCCACTATT
CTCTCTAAGA TCATCCCTGT AGTGAACGAG TTGAGCCAGG ACTCCCAAGA TAACGTCAGA
GCCTCGTTGG CATCCACCAT CACAGGCTTG TCGCCCATCT TAGAGAAGCA ATCCACCATA
GATAAGCTTT TGCCCATTTT CTTAGTAATG TTGAAGGACG AGTTCCCAGA CGTGAGATTG
AACATCATCT CCAACTTGTC TGTTGTGGAT GAAACCATTG GTATCAACCT CTTGTCGACA
AACTTGTTGC CTGCCATTAC TGAGTTGGCT CAAGACTACA AGTGGAGAGT CAGATTGGCC
ATCATCGAAT ACATTCCCAA GTTGGCTAAA CAGCTTGGTG AGTCTTTCTT CAACGATGAG
TTGTTGTCGT TGTGCATGTC GTGGTTGTGG GATCCCGTAT TTGCCATTCG TGATGCTGCC
GTCAACAACT TGAAGGATTT GACCATCATC TTTGGTTCAG ATTGGGCCAA CAACGAAATC
ATCACTCGCT TGTTGAATAA CGGCGACAAG ATTGACGAAG ACGACAAGAT CGACTACTCT
AACTTCATCA TCAGAATAAC ATGCCTATTT GCCATCACCA AGTTGATTCC CGTCGTCGAC
TACCAAATAA TAGTGAAGAA GGTATTGCCC TTCATCAACA GTTTAATCAC AGACGCTGTG
CCCAACATAA GATTCAACGT AGCCAAGTCG TACCTCATAT TGGTGGAGAC ATTTGTACGC
AACAAGAGCA AGTTGCCCAT CAAGGACGAA GAGTTGAAAA AGTTGATCAA CTTGGAAATT
CTTGCTAACT TAGAAAAGTT GCTGAACGAC ACTGATGTTG ACGTCAGATT CTACGCTAGC
AAAAGTATTC AGGGTATCCA AGACTTGTTG AACTAAAGTA GTACGACGTA AAAGAAAAGA
ACGAGAAACG AATCACTCCT TACATGATAT TATTAAACTA TCTTCCTCAT AAAGTTCATG
ACTTTTTGTA TTCATTTCAT TTGTTTCTTC ACCTGTTTTT TGTACCCTGT TTGAATAAGT
ATTATAGTCT TGCTGCTTTC TTATCTGATC ATCCTTTTCT CGGTTAACAG CAACTCATTA
TATGATAGGT GCATCAATAT TTAATTCAAC GTTGTTAGCT TTCATTTTTA TCATTTCTTT
GATACTAAGT ATATCGTTTT TGCGTTTAAA TATACATCGA AATTAATCGT G
 
Protein sequence
MDNYNDDLYP LALLMDELKH DDVSNRVEAM QKLDNIAIAL GPERTLKELL PFLNDVAQDD 
EEEVFAVLAS KLGDFVPLVG GHENCEPLIQ ILTILASMEE PIVRDKAIDS LYKISLELTL
DELTGIFLTL IRSLSQGNWF SKKVASCGLY KAVILKVNSS ARRDLLNLYL KLVTDDYPMV
RRAAANNLPH LINLLTEFTE EKPNDVNKIN NEDWEIISKM FQHLINDDQD SVKFLSIDVL
IAILEFFQKI NEYSFNSDFL TSALKLIKDE SWRVRYTAAD RFTKIAKNFT NEESDLFQLI
DPFISLMKDN EGEVRKAIAK QLPSFCELLT KYQSTRATIL SKIIPVVNEL SQDSQDNVRA
SLASTITGLS PILEKQSTID KLLPIFLVML KDEFPDVRLN IISNLSVVDE TIGINLLSTN
LLPAITELAQ DYKWRVRLAI IEYIPKLAKQ LGESFFNDEL LSLCMSWLWD PVFAIRDAAV
NNLKDLTIIF GSDWANNEII TRLLNNGDKI DEDDKIDYSN FIIRITCLFA ITKLIPVVDY
QIIVKKVLPF INSLITDAVP NIRFNVAKSY LILVETFVRN KSKLPIKDEE LKKLINLEIL
ANLEKLSNDT DVDVRFYASK SIQGIQDLLN