Gene PICST_81668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81668 
Symbol 
ID4837402 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1812055 
End bp1815229 
Gene Length3175 bp 
Protein Length995 aa 
Translation table12 
GC content44% 
IMG OID640388717 
Productpredicted protein 
Protein accessionXP_001383119 
Protein GI150864344 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TGTGAATCTT TTCCGTACTC TTGATTACTT TTATTGACTT TTTCTTTCTT CCATTCTTCC 
ATTCTTCTTC CATTCTTCTA CATTAGCCCC AATTCCCTAG TCAACGAAGA TGGCTACGAA
CTTCCATCTC TACCACCCCT CGCTCAGACG CGATCGAGAA GAGTCGGCAA CCGGTGGAGA
CCTTAAGACT ACTCCGACGA ATACCTCTTC CAGCAACAAC AGCAACAGCA ACAGCACAAG
TCTATTTGGA AGCAGCCCAA AGCACACTCT GCATTCTCAT GCACATCATA GAGGATTCTC
GTTCGACAAG ATCAAGAGCA AGGTGAGTCG TCGCTCGCTG GAGCCAGAAG TGATACCAGG
TTCTCCAGAC AGAAGAAACT CTGCTTTTCC CATGATTGTG ATCTCAGAAT CAAATGAGAC
AAGTCGTAAC CAAAGTCAGA ACGAAAACCA AAACCAAATG AATGTCTCTG CAGCAAATAC
AAGACGTCTC AGAGCTCAGT CGCTTTCGCT CATCCACAGC CAGTCCCGGT CCCGTTCTGG
GTCCCGTTCC AGAAGTCTTG TTCCCAACGG AAACTCAACT TCTCGGAAAC AAACGAAGGA
ACTCGTGAAG TTAGAAACGG CTCACATAAT CCTGAAGAAA TTGGAATCGA TTTTGCTGGA
TCTCGGACTT CAGTCTCCTA TACCGTTGAA AGCTACCAAC AACACCTCCA GTGGCTCCAT
AGCTAAGTCG GTCAAGGTAT ACATCGCCAA CACAAACGAT TGCATCTTCT TAGCACCAGC
ATCTTCCGCC AGTTTCACCT ATGAAGACGT CGAGAACGGG GGTGCAATTC CACATGATGA
TGAAGACGAC GACGAAGGAA TGGATAGTCT CGTTGTAGAC TCTGGCAGAA GCAATAGCAA
CTTCACTTCC ATACGACGTG GATCCGTAAT CTCAGATGAT GAGTCTGCTG TAACGTCAGA
CGAGGAAGCG GTTCTGCCAG ACTTGGCTAC ACCCAGAAGA CTTAAGAAAA AAATGAGGTT
CTTCAACTCG CCCAACTATC TCTGCACCAA GATCGATTCA GACATGCCGA TCCCACACAC
TTTTGCTGTC GTAATCGAAC TCGAAAAGGA TTCCACTTCT GTGAGAGACG TAAAGTTCGA
TTTTCTGTCA GTCACGAATA TCTTGTGGCC ATCTGGAGAT CCTTATAGCC GAACCCATTC
AAAGGAGCGG TTCAAGATTG GCAGTATGGA GTGGTCTACA AGTCTCGGAG ATTCCGATTT
CTACATTAAC ACCAACAACT CGAACGATGT ACGCATCAAA AATATCACTC CTGATGATCT
CGCCAGAAGG ACAAGAGAAT ATAAACTCGT GAATATCCGC AATCTTGCTG ATGGAACAGA
CAACGCCAAC ACCAGTCGTA AAAACTCTAT CTCACTAGAC TTTAACGATC TGCCCTTAAA
TACTCATGGC AATAGCAATG GTCATAGCAG TAATAGCCAT GGAGGGAACA GTAATAGTAG
TGAAGTGTAC AAGGCTGGTC TTTATGTGTT TCTCTTGCCG ATTATCTTAC CTCAGCATAT
CCCTCCTACG ATCATCTCAA TCAACGGCAC GCTTTTGCAC ACATTGAGTA TCAACTTCAA
CAAGACAAGT GACATGCTCA ACAGAAAGGT CAAAGTCTGC TCCACGTACA ATTTGCCTAT
GGTGAGGACG CCTCCCTCAT TTGCTAATTC GATTGCTGAT AAGCCCATCT ACGTTAACCG
TGTATGGAAC GATTCGTTGC ATTACATCAT AACTTTTCCC AAAAAGTATG TGTCGTTGGG
ACTGGAACAT GTAGTCAATG TAAAACTTGT TCCGCTAGTC AAAGACGTTA TTATCAAGCG
CATCAAATTC AATGTTCTAG AGAGAATAAC ATATGTCTCT AAGAACTTGT CTAAGGAGTA
CGACTTTGAC AGTGATGATC CTTATTGTGT GAAAGCACAT TCATCGGACA ATAGAACTAG
AGAAAGAGTC GTTTCTCTTT GTGAGTTAAA GACAAAATCG AAACAGAACA GTATGATAGG
GGTTCCTGGA GATCCTTATA AGGAGGAAGT TGTCAAGTGT CCCGACAATA ACTTGTTGTT
TTCATGCTAT GAACCTGATG AGTATGAAAA CTTTAGACTA GAGGACTCGA ATACAAACAA
GCGTAAGGGA AAAGACAAGG AAGAGACACC TACGATGATT GCTTCACCTC TCGATATCAA
CATAGCTTTG CCTTTTTTAA CCACCAGAAT GGACAAGACC ATGATGACTA GTACAGAAGA
AGATCCAGCG CATCTTCATA GAAGTTCTGT TTCCAGAAAG GCTTCTATCA CTACTGAAAG
TTTGAACAGT ACTTCTGGCG GAGGCTCGCC TTCTTTCCAG CCTACTTCTC CCATAATAGG
GGCTTTGGAA ACAAACCTTT CCCATAGACA TAGCATGGAT TCGTACGATC CCGTTTCGTC
TGACTACATA AAACCAAACT CTTCAATGTA CTTGTCAGAT GACAACAGCG CGAAACTGAC
GCCTCCTGAA AACATTCAGA AAGGGTTTAC GTTGGTTTCA AAGGCTTTGT ATCCTGATTC
TAACTTCAGA CACATCCAGA TCAGCCACAG ATTGCAGGTA TGTTTCCGGA TTTCGAAGCC
AGATCCGAAG GACGGCTTCA AGATGCACCA TTACGAGGTT GTGGTCGATA CGCCTTTGAT
TCTCTTGAGT TCCAAGTGTA ATGAGGGATC AATCCAGTTG CCCAAGTATG ACGACCTAGA
AGGTGTTTTT TCTACTGTAG ATACCGAAAT CTCGTTCAGA ACACCCGACT TTGAAAGAAA
CGGAATTTCC ATCAAAAGGC TCGATGAAAA TAGCTCTGTT GAACCATTGC CTTCTTTTGA
AGAAGCCACT TCTTCACCTT CGTCGCCGAT TACCAGATCA ATATCCATAG GTGAAGATCC
ATTGAGCAGA ATTCCATCAA ATAACTTAAT CCCTCTGTCA AATCCGTACC CAGACGAGCC
GGCTCCAGCG TATGAACGTT CTCTGACAAC TTCTAACCAT GGAAGCCGTA ACAACAGCTT
TGTAGCTTCG TCAAATATTG ACGAAGTTGT CAACAGCGAT TCCAACAATA GTTCTTCTTC
TAGCTTGAGA AGGTCAACGT TGAGAAATTC GTTGCTGCAT TCCTTTGCTC CGTCT
 
Protein sequence
MATNFHLYHP SLRRDREESA TGGDLKTTPT NTSSSNNSNS NSTSLFGSSP KHTSHSHAHH 
RGFSFDKIKS KVSRRSSEPE VIPGSPDRRN SAFPMIVISE SNETSRNQSQ NENQNQMNVS
AANTRRLRAH LVPNGNSTSR KQTKELVKLE TAHIISKKLE SILSDLGLQS PIPLKATNNT
SSGSIAKSVK VYIANTNDCI FLAPASSASF TYEDVENGGA IPHDDEDDDE GMDSLVVDSG
RSNSNFTSIR RGSVISDDES AVTSDEEAVS PDLATPRRLK KKMRFFNSPN YLCTKIDSDM
PIPHTFAVVI ELEKDSTSVR DVKFDFSSVT NILWPSGDPY SRTHSKERFK IGSMEWSTSL
GDSDFYINTN NSNDVRIKNI TPDDLARRTR EYKLVNIRNL ADGTDNANTS RKNSISLDFN
DSPLNTHGNS NGNSNSSEVY KAGLYVFLLP IILPQHIPPT IISINGTLLH TLSINFNKTS
DMLNRKVKVC STYNLPMVRT PPSFANSIAD KPIYVNRVWN DSLHYIITFP KKYVSLGSEH
VVNVKLVPLV KDVIIKRIKF NVLERITYVS KNLSKEYDFD SDDPYCVKAH SSDNRTRERV
VSLCELKTKS KQNSMIGVPG DPYKEEVVKC PDNNLLFSCY EPDEYENFRL EDSNTNKRKG
KDKEETPTMI ASPLDINIAL PFLTTRMDKT MMTSTEEDPA HLHRSSVSRK ASITTESLNS
TSGGGSPSFQ PTSPIIGALE TNLSHRHSMD SYDPVSSDYI KPNSSMYLSD DNSAKSTPPE
NIQKGFTLVS KALYPDSNFR HIQISHRLQV CFRISKPDPK DGFKMHHYEV VVDTPLILLS
SKCNEGSIQL PKYDDLEGVF STVDTEISFR TPDFERNGIS IKRLDENSSV EPLPSFEEAT
SSPSSPITRS ISIGEDPLSR IPSNNLIPSS NPYPDEPAPA YERSSTTSNH GSRNNSFVAS
SNIDEVVNSD SNNSSSSSLR RSTLRNSLSH SFAPS