Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_39453 |
Symbol | |
ID | 4851668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2497845 |
End bp | 2498966 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | |
GC content | 48% |
IMG OID | 640393376 |
Product | predicted protein |
Protein accession | XP_001386814 |
Protein GI | 126275209 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0480355 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCAAA TTTCAAAGGT TCTTGTAGCA GCTTCAGTTG CTGGTTTGAC CGCTGCTAAA AAGGTGTTTA TTGACAACGA TGGATTGGCT CCCTTGCAAG TTTTGTTTCC TTTGTTGGCC GGTTGGGAAG TCCTCGGTAT CTCTACCTCG TTCGGTTCCT CTTCTACAGT GGATTCTGCA GGTGCAGCCT ACGACGTGTT AACAGCCTAC AACTTGACTT CTTGTATTCC ACACTATGTA GGTGCCCAAC AGCCATTGTT GAGAACTCAG GACACCTTTG ACACCTGGCA ATCCCTCTTC GGAGAATTGG TATGGCAAGG TGCCTTCGCT CCTTCCTACG AAGATCTTTA CTCTTGGGAC AATATCACCT ACAACGACTC TGTCCCAGGT GCCGTAGCCT TGATTGAAGC CGTCAAGGCC AACAAGGATA CTGACCCTGT CTATATCTAT GCTGCAGGTA TGATGACAAC CGTTGCTCAG GCCATTTCCC TCTACCCAGA CCTCGTCAAG GATGCTGCCG GTTTGTACAT TATGGGAGGG TATTTTGATC AACAATTCGC AGCCGGCACT GGAACTCCTA TTGTCAATGA CATAAACACC GACATCAACT TGATGCAAGA TCCAGAAGCT GCCCAAATCG TCTTGACTGC CAACTGGACT GAATTGTACA TCGGTGCCAA CGTCACCAAC TACTTGGTTC CATCCCAAGA ATTGTACGAC AGACTCATCA CCAAGGCCGG TGGCTACAGT GTGTTGGAAG AAAACTCCTA CTTAGAACCA GTCTTGAACT TGGTTGCTAC GGGAAACTAC ACTGAAAATA CTTCTGAACA GACCCTTCCA TTCTGGGACG AAGTAGTCTC TGCCTTCATG GTGTGGCCAG ACATGGTTCA AAGCACAACA AACTTCTCTG TAGCTGTGGA CACGCAGTTC TACTCTCCAT TCTACGGAAG TTTGAGAATC TGGGGTTCTG AGTTTGCTCC AAAGGGCCAA ATCACCGGTA ATGCCACCAT CGTCAACAAG ATCGACGACA GCAGATTCTA CGACTTATTG GTTTCTACAT ACTTCATGGA CTGGAGACAG TATTGTGAAG TTGGCGGTCC AGTCACTTTA GAAGGCTACT AA
|
Protein sequence | MVQISKVLVA ASVAGLTAAK KVFIDNDGLA PLQVLFPLLA GWEVLGISTS FGSSSTVDSA GAAYDVLTAY NLTSCIPHYV GAQQPLLRTQ DTFDTWQSLF GELVWQGAFA PSYEDLYSWD NITYNDSVPG AVALIEAVKA NKDTDPVYIY AAGMMTTVAQ AISLYPDLVK DAAGLYIMGG YFDQQFAAGT GTPIVNDINT DINLMQDPEA AQIVLTANWT ELYIGANVTN YLVPSQELYD RLITKAGGYS VLEENSYLEP VLNLVATGNY TENTSEQTLP FWDEVVSAFM VWPDMVQSTT NFSVAVDTQF YSPFYGSLRI WGSEFAPKGQ ITGNATIVNK IDDSRFYDLL VSTYFMDWRQ YCEVGGPVTL EGY
|
| |