Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_41479 |
Symbol | |
ID | 4837120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2737590 |
End bp | 2739395 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388435 |
Product | predicted protein |
Protein accession | XP_001382759 |
Protein GI | 126132468 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GGTGATTACG ACTACAATGA CGCAAACAAC TACTCCACCC ACTATGTTGA TGAATACAAC CCAAAGGGTT TGAGAGTCCC AACTGACGAA GAATCTCAAT CCCTCAGAAG GATTTTGGGT AGAGCTTCTT ATGCTTCTTA CTTGATCTGT TTGTGTGAGT TGGCTGAAAG AGCCTCTTAC TATTCGGTCC AGGGTATCTT GTCTAACTTT ATTCAAAGAC CTATGCCTGA AAATTCCCCT CACGGATGGG GTGCACCAGC TGACAGAAAC TCGAATGTTT CTGCCGGTGC TTTGGACCAA GGTCTTCAAG CTGCTAACGC CCTTACCCTT TTGCTTACTT TCCTTGCTTA CGTTGTACCA TTATATGGTG GTTTCATTGC CGATACCAAG ATTGGTAAGT TCAAAGCTAT TTGGGTTGGT GTTATCGCTG GTTTTGTTTC TCACGTTTTG TTCGTTATCG CAGCTATCCC ATCTGTCCTT AAGAACGGCG GTGCTGCTTT GGCTCCAACT GTGCTTGGTA TCATTACTTT AGCTTTCGGT ACTGGTTTCA TCAAGCCAAA CTTGTTACCT CTTCTTATGG ACCAATATAG AGAACAGACT GATGTTGTCA AAGTCTTGCC ATCTGGTGAA AATGTTATTA TCGATAGACA AAAGACTTTG GAAAGAATGA CTTTGATTTT CTATTGGGCG ATTAACATTG GTGCTTTTTT CCAATTGGCC ACTTCTTATA TCGAAAGAGA TGTTGGTTTC TGGTTGGCTT TCTTCATTCC CATAATCATA TACTTGGTTT TGCCAATTGT CTTGGTTTTC TTGCAATCTA GATTGGTCAG AGATACTCCA CAGGGTTCCG TCCTTGAAAA CGCTTGGAGA GTTACAAGAG TCACTTTCTC TAAAGGGTGG ATCGGTAGAT GGAGGAATAA CACCTTGTGG GAGTACGCGA GGCCATCCGT CATGCTTGAA AGAGGAAGAG AATTTTACAA TGAAAATACA AAATCTCCAA TCACTTGGGG TGATCAATGG GTGTTGGACA TCAAGCAAAC TGTCAACTCT TGTAAGATTT TCATCTACTT CCCAATCTTT AACTTGGCTG ATAGTGGTCT TGGTTCTGTC GAAACTTCTC AAGCTGGTGC CATGACCACT AACGGTGTTC CAAACGATTT GTTCAACAAC TTTAACCCAT TGACCATTAT TATCTTGATT CCAATTCTTG ACTACCTTGT CTACCCTATG TTGAGAAAGT ATAGAATTGA ATTCCGTCCA GTTTGGAGAA TTTTCCTTGG TTTCATTTTG GCTGGTTCTT CTCAAATTGC CGGTGCAATC ATTCAATGGA AAATTTACAA GACTTCACCA TGTGGTTACC AAGCTACTAC TTGCTCTGAA GTGTCTCCAT TGTCGGCTTG GCAAGATGTT TCTTTGTACA TTCTTTCTGC TGCAGGTGAA TGTTTTGCTA ATACTACTGC TTACGAATTG GCCTACACTC GTTCTCCTCC TCACATGAAG GGCCTTGTTT TGGCTTTGTT CTTGTTCACT TCTGCCATCT CTGCTGCTCT TTCACAAGCA ATCACTCCAG CTTTGAGCGA CCCACACTTG ATCTGGCCAT TCGCTGGTAT TGCCATCGCA ACTTTTGTTG CCGCATTCGT ATTCGTCTAT CAATTCAGAA ACTTGCACAA GGAAATGGAA GAGGAAAGGA TTCTCAGAGA AGCCTTTGAT AAATCTGAGA GAAGTAATCT CATATCGCAC GGTGGAATTG AAGATGACAA CAACTTGCAA GCAGTTACAT CCATCAAGTC TGCCGTTGGT AAGTAA
|
Protein sequence | GDYDYNDANN YSTHYVDEYN PKGLRVPTDE ESQSLRRILG RASYASYLIC LCELAERASY YSVQGILSNF IQRPMPENSP HGWGAPADRN SNVSAGALDQ GLQAANALTL LLTFLAYVVP LYGGFIADTK IGKFKAIWVG VIAGFVSHVL FVIAAIPSVL KNGGAALAPT VLGIITLAFG TGFIKPNLLP LLMDQYREQT DVVKVLPSGE NVIIDRQKTL ERMTLIFYWA INIGAFFQLA TSYIERDVGF WLAFFIPIII YLVLPIVLVF LQSRLVRDTP QGSVLENAWR VTRVTFSKGW IGRWRNNTLW EYARPSVMLE RGREFYNENT KSPITWGDQW VLDIKQTVNS CKIFIYFPIF NLADSGLGSV ETSQAGAMTT NGVPNDLFNN FNPLTIIILI PILDYLVYPM LRKYRIEFRP VWRIFLGFIL AGSSQIAGAI IQWKIYKTSP CGYQATTCSE VSPLSAWQDV SLYILSAAGE CFANTTAYEL AYTRSPPHMK GLVLALFLFT SAISAALSQA ITPALSDPHL IWPFAGIAIA TFVAAFVFVY QFRNLHKEME EERILREAFD KSERSNLISH GGIEDDNNLQ AVTSIKSAVG K
|
| |