Gene PICST_30904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30904 
Symbol 
ID4838259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1234129 
End bp1235730 
Gene Length1602 bp 
Protein Length533 aa 
Translation table12 
GC content40% 
IMG OID640389574 
Productpredicted protein 
Protein accessionXP_001383868 
Protein GI150864871 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.779542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0693376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCG AAAGCAAAAA GGAAAAATTT GTTATTTCCA GAGATTTTGA GAATGGAAGT 
GAAGACTCTG CTCTGTTTGC ACCTCCCAAG TATATCTCTT GGATCTACAA GTTGGATAAC
TTTGGCATCG AGACTAGAGG GATAGAAAGA GTATCCACTG AAGAAAGAAG ATCTTTGGCT
GCTAACAAAT CTTCTTTAAA TCGTTTTTTG CATGTCTTTG GACTTTGGAT TGCTGCCTGT
GGAGGTTTAA CTACGATGTC ATCCTTCTTT CTCCCCACAT TATTGTTCGG TTTGAGCATG
AAAGACAGTT TAATCTGTGG GTTGATTTCT ATGAACCTCG GATGTTTGGT TCCAGCATAC
TGTTCCACTA TGGGTCCAAA GTCTGGTTGT CGTCAGATGG TGGGAGCTAG ATTTTTGTTT
GGTCAATGGG GAGTCAAGTT CGTTTCATTA ATCTGCATTG TTGGAGGAAT TGGTTGGTCC
GTAGTCAATT GTGTTCTTGG TGGTCAGATA TTGATTGCTA TCAATAACAA TATTCCATTA
TCTGTTGGTA TTGTAATTAT AGGAGTAATC AGTTTGGTGA TTGCTATCTT TGGTATTAGA
GTTCTTCTCA ATTTCCAAAC CATCTTGTCT ATTCCCTTGA TCATAGCTTC GATTTTGTTT
TATGTTGTAG TACTTAAGAA AGTTGACTAC GTACATGAAT CCAACGTCTT GGTTGCTGAA
CAAGGATTTT CCGGATTGAC CGTTCGTGGT AACTGGTTAT CATTCTTTGC TATTGGATAC
TCTGTCACAG CTACATGGGG TTCTGGAGCT TCCGATTACT ACATCTTATT TCCCGAGGAA
ACCCCAAGCT ACCAAATTTT TATTGTGACC TTCCTTGGGA TTGCCGTACC ATCCACTTTT
GTTGCTATCA TCGGGACTAT TTGTGGCTCT ATTGCTTATG CTTATCAACC GTGGAATGAT
GCCTACAACA ATTTCGGTAT TGGAGGGTTG ATCAACGAAT GCTTCAAGCC TTGGGGAAGA
TTCGGTCTGT TTATCGTGGT TCTACTCTAT ATCTCCCTTA TCTGTAATAA CATCATGAAT
ACCTATTCTG TTGCCTTTGA GTTCCAATTG ATCGACAGAA AGCTTGCTTA TATCCCTCGT
TGGTGTTGGG CTATTGTCAT GACTATTATC TATGTTGTTT TGAGTGTCGC CGGTAAAGAA
CACTTTTCCA CAATTCTAAG CAACTTCTTA CCTATGTTGG GCTACTGGAT CTCTATGTAT
ATAACTATTC TTTTGGAAGA AAACATCTTC TTTAGATCCT CCATCAAAAC CAAGACTCTT
CACCATAACG AATTTGACGA AAAGGCCAAC AGTGATTGGA GTTACAACTG GTCAAACTGG
AACAAGCCCA AGGGTATCAC TATGGGTTTA GCAGCTGACC TTTCATTTGC AATTGGAGTT
GTTGGTGCTG TTTTGGGAAT GAACCAAGTA TATTTCCAGG GCCCAATTGC TAAGAAGATT
GGAGACTACA GTGGTGACGT AGGTATGTGG CTCTGTGGTG GGTTCACTGG AGTGGTCTAT
CCATTCTTGA GATACTGGGA ATTGAAAAGA TTTGGTCGTT AA
 
Protein sequence
MDIESKKEKF VISRDFENGS EDSASFAPPK YISWIYKLDN FGIETRGIER VSTEERRSLA 
ANKSSLNRFL HVFGLWIAAC GGLTTMSSFF LPTLLFGLSM KDSLICGLIS MNLGCLVPAY
CSTMGPKSGC RQMVGARFLF GQWGVKFVSL ICIVGGIGWS VVNCVLGGQI LIAINNNIPL
SVGIVIIGVI SLVIAIFGIR VLLNFQTILS IPLIIASILF YVVVLKKVDY VHESNVLVAE
QGFSGLTVRG NWLSFFAIGY SVTATWGSGA SDYYILFPEE TPSYQIFIVT FLGIAVPSTF
VAIIGTICGS IAYAYQPWND AYNNFGIGGL INECFKPWGR FGSFIVVLLY ISLICNNIMN
TYSVAFEFQL IDRKLAYIPR WCWAIVMTII YVVLSVAGKE HFSTILSNFL PMLGYWISMY
ITILLEENIF FRSSIKTKTL HHNEFDEKAN SDWSYNWSNW NKPKGITMGL AADLSFAIGV
VGAVLGMNQV YFQGPIAKKI GDYSGDVGMW LCGGFTGVVY PFLRYWELKR FGR