Gene PICST_30809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30809 
Symbol 
ID4838228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp996945 
End bp998663 
Gene Length1719 bp 
Protein Length572 aa 
Translation table12 
GC content44% 
IMG OID640389543 
Productpredicted protein 
Protein accessionXP_001383476 
Protein GI126133903 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0100169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATACG GCGTCAAGGT GAATAATGTC TTCGCAAAGG CAGAAGAAAG CAGCGAATCC 
GTACAAGTCC AAGAGCGCGC TGGAAACTTT TCCTCATCCT CTCTACCTGA AGAGAACAAG
CAGAACGATG TCGTAGTACA ACAACAAGAA GAAGAAGAAA AAGACTCCAA CAGCATATGG
GCCTACTTGG CAAATGCCTC GAAGAAGTTG GACTCGTTAG GAGTAGAAAC TAGAGGTATC
GAGAGAATAC AGCCATACGA AAGGTCTACC AATAAGACTA AACAACTTAT CTCTGTCATA
GGACTTTGGC TTTCTGCCTG TGGTGGGTTG AGTTCGATGT CTTCTTTCTA TTTAGGCCCG
CTTTTGTTTG AATTGGGTTT GAAAAAGACT TTGGTGTCTG GTTTAATTGG TCAAGGACTT
GGTTGTGCCA TTGCTGCCTA CTGTTCATTG ATGGGACCCA GATCTGGATG TCGTCAAATG
GTGGGAGCCA GATTTCTTTT CGGTTGGTGG TTCGTCAAGC TTGTTTCACT TGTCAGTATC
ATTGGAGTCA TGGGTTGGTC TGTTGTGAAC TGTGTTGTAG GTGGTCAAAT CTTGGCCAGT
ATCAGTGACG GAAAGATTCC TCTTGTGGTA GGTATTGTCA TAATTGCTGT CATCTCTCTT
GCAGTGGCTA TTGGTGGTAT CAAGCAGTTG TTGAAGGTGG AAACTCTTCT TGCACTTCCA
GTAAACTTTG CCTTCTTGCT TTTGTACGTT GTAGCCTCCA AGAAGTTTTC TTATCTTACC
ATGAACGATC CTATAGACGA CCATGCTACT CTTAAGGGTA ATTGGTTGTC TTTCTTTTCT
TTATGTTATT CCATTACTTC TACTTGGGGT TCCATTGCTT CTGATTACTA CATTTTGTTC
CCTGAAAACA CACCCGATTT GCATGTTTTC AGTATCACCT TCTTTGGCAT TCTTATTCCC
ACAACTTTTG TAGGTGTTGC TGGTCTTCTC ATCGGTAATG TTGCTTTGAC TTATGAGCCA
TGGGGCGATG CTTATGCTGA ATTCGGTATG GGTGGTTTGT TGAACGAAGC CTTCAAGCCA
TGGGGAGGAG GAGGAAAATT CTTGTTGATT CTTATCTTCC TCTCGTTGAT TTCCAACAAT
ATCTTGAACA CTTACTCAGC TGCCTTTGGA ATTCAATTGG CTGGACGTGT TCTTTCCCGT
ATTCCTCGTT GGCTCTGGGC CTTTGTGATC ACGGCGGTGT ATTTGGTCTG TGCCCTTGTA
GGGAGATACA AATTTGCTAC GATTTTGGGT AACTTCTTGC CTATGGTCGG GTACTGGATA
TCCATGTATT TCATAATATT GCTTGAAGAA AATATCATCT TTAGAACAGA CGCTTTCAAG
CACTTATTTA CCAAAGAATT CCCGCCAGAG TCTGAAGAAA CTGAAGGTAC ATCTAGAACC
ATAGTGATGG CTAAGAACAG TGCTAAGAAC CAACACTACA ACTTTGAAAT TTGGAACGAC
TACGACAGAT TGACACATGG CTTTGCTGCT ACAGCCTCTT TCATTGTAGG AGCTGCTGGA
GCTGCTGTGG GGATGTCACA GACATACTGG ATTGGACCTG TGGCTTTGGC TATGGGCGGT
GCGTACGGAG GAGATATAGC CATGTGGTTA TGTATGGGAT TCAGCGGAGT AGCATACCCA
GGATTAAGGT ACCTCGAGTT GAAGAAATAT GGACGTTAG
 
Protein sequence
MAYGVKVNNV FAKAEESSES VQVQERAGNF SSSSLPEENK QNDVVVQQQE EEEKDSNSIW 
AYLANASKKL DSLGVETRGI ERIQPYERST NKTKQLISVI GLWLSACGGL SSMSSFYLGP
LLFELGLKKT LVSGLIGQGL GCAIAAYCSL MGPRSGCRQM VGARFLFGWW FVKLVSLVSI
IGVMGWSVVN CVVGGQILAS ISDGKIPLVV GIVIIAVISL AVAIGGIKQL LKVETLLALP
VNFAFLLLYV VASKKFSYLT MNDPIDDHAT LKGNWLSFFS LCYSITSTWG SIASDYYILF
PENTPDLHVF SITFFGILIP TTFVGVAGLL IGNVALTYEP WGDAYAEFGM GGLLNEAFKP
WGGGGKFLLI LIFLSLISNN ILNTYSAAFG IQLAGRVLSR IPRWLWAFVI TAVYLVCALV
GRYKFATILG NFLPMVGYWI SMYFIILLEE NIIFRTDAFK HLFTKEFPPE SEETEGTSRT
IVMAKNSAKN QHYNFEIWND YDRLTHGFAA TASFIVGAAG AAVGMSQTYW IGPVALAMGG
AYGGDIAMWL CMGFSGVAYP GLRYLELKKY GR