Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30809 |
Symbol | |
ID | 4838228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 996945 |
End bp | 998663 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640389543 |
Product | predicted protein |
Protein accession | XP_001383476 |
Protein GI | 126133903 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1457] Purine-cytosine permease and related proteins |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0100169 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATACG GCGTCAAGGT GAATAATGTC TTCGCAAAGG CAGAAGAAAG CAGCGAATCC GTACAAGTCC AAGAGCGCGC TGGAAACTTT TCCTCATCCT CTCTACCTGA AGAGAACAAG CAGAACGATG TCGTAGTACA ACAACAAGAA GAAGAAGAAA AAGACTCCAA CAGCATATGG GCCTACTTGG CAAATGCCTC GAAGAAGTTG GACTCGTTAG GAGTAGAAAC TAGAGGTATC GAGAGAATAC AGCCATACGA AAGGTCTACC AATAAGACTA AACAACTTAT CTCTGTCATA GGACTTTGGC TTTCTGCCTG TGGTGGGTTG AGTTCGATGT CTTCTTTCTA TTTAGGCCCG CTTTTGTTTG AATTGGGTTT GAAAAAGACT TTGGTGTCTG GTTTAATTGG TCAAGGACTT GGTTGTGCCA TTGCTGCCTA CTGTTCATTG ATGGGACCCA GATCTGGATG TCGTCAAATG GTGGGAGCCA GATTTCTTTT CGGTTGGTGG TTCGTCAAGC TTGTTTCACT TGTCAGTATC ATTGGAGTCA TGGGTTGGTC TGTTGTGAAC TGTGTTGTAG GTGGTCAAAT CTTGGCCAGT ATCAGTGACG GAAAGATTCC TCTTGTGGTA GGTATTGTCA TAATTGCTGT CATCTCTCTT GCAGTGGCTA TTGGTGGTAT CAAGCAGTTG TTGAAGGTGG AAACTCTTCT TGCACTTCCA GTAAACTTTG CCTTCTTGCT TTTGTACGTT GTAGCCTCCA AGAAGTTTTC TTATCTTACC ATGAACGATC CTATAGACGA CCATGCTACT CTTAAGGGTA ATTGGTTGTC TTTCTTTTCT TTATGTTATT CCATTACTTC TACTTGGGGT TCCATTGCTT CTGATTACTA CATTTTGTTC CCTGAAAACA CACCCGATTT GCATGTTTTC AGTATCACCT TCTTTGGCAT TCTTATTCCC ACAACTTTTG TAGGTGTTGC TGGTCTTCTC ATCGGTAATG TTGCTTTGAC TTATGAGCCA TGGGGCGATG CTTATGCTGA ATTCGGTATG GGTGGTTTGT TGAACGAAGC CTTCAAGCCA TGGGGAGGAG GAGGAAAATT CTTGTTGATT CTTATCTTCC TCTCGTTGAT TTCCAACAAT ATCTTGAACA CTTACTCAGC TGCCTTTGGA ATTCAATTGG CTGGACGTGT TCTTTCCCGT ATTCCTCGTT GGCTCTGGGC CTTTGTGATC ACGGCGGTGT ATTTGGTCTG TGCCCTTGTA GGGAGATACA AATTTGCTAC GATTTTGGGT AACTTCTTGC CTATGGTCGG GTACTGGATA TCCATGTATT TCATAATATT GCTTGAAGAA AATATCATCT TTAGAACAGA CGCTTTCAAG CACTTATTTA CCAAAGAATT CCCGCCAGAG TCTGAAGAAA CTGAAGGTAC ATCTAGAACC ATAGTGATGG CTAAGAACAG TGCTAAGAAC CAACACTACA ACTTTGAAAT TTGGAACGAC TACGACAGAT TGACACATGG CTTTGCTGCT ACAGCCTCTT TCATTGTAGG AGCTGCTGGA GCTGCTGTGG GGATGTCACA GACATACTGG ATTGGACCTG TGGCTTTGGC TATGGGCGGT GCGTACGGAG GAGATATAGC CATGTGGTTA TGTATGGGAT TCAGCGGAGT AGCATACCCA GGATTAAGGT ACCTCGAGTT GAAGAAATAT GGACGTTAG
|
Protein sequence | MAYGVKVNNV FAKAEESSES VQVQERAGNF SSSSLPEENK QNDVVVQQQE EEEKDSNSIW AYLANASKKL DSLGVETRGI ERIQPYERST NKTKQLISVI GLWLSACGGL SSMSSFYLGP LLFELGLKKT LVSGLIGQGL GCAIAAYCSL MGPRSGCRQM VGARFLFGWW FVKLVSLVSI IGVMGWSVVN CVVGGQILAS ISDGKIPLVV GIVIIAVISL AVAIGGIKQL LKVETLLALP VNFAFLLLYV VASKKFSYLT MNDPIDDHAT LKGNWLSFFS LCYSITSTWG SIASDYYILF PENTPDLHVF SITFFGILIP TTFVGVAGLL IGNVALTYEP WGDAYAEFGM GGLLNEAFKP WGGGGKFLLI LIFLSLISNN ILNTYSAAFG IQLAGRVLSR IPRWLWAFVI TAVYLVCALV GRYKFATILG NFLPMVGYWI SMYFIILLEE NIIFRTDAFK HLFTKEFPPE SEETEGTSRT IVMAKNSAKN QHYNFEIWND YDRLTHGFAA TASFIVGAAG AAVGMSQTYW IGPVALAMGG AYGGDIAMWL CMGFSGVAYP GLRYLELKKY GR
|
| |