Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30714 |
Symbol | |
ID | 4838083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 756286 |
End bp | 757350 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640389398 |
Product | predicted protein |
Protein accession | XP_001383427 |
Protein GI | 150864562 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00434268 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.747519 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACTC ATTTTACGCG TGACTTTTTG CAGGTGATTC GTGGGCCGAT ATGTAATCTT GAAATGTTGA ACCTAAACAT CCTGAAACGA CATCCAAAAG ACCAACTACA GCTGGCTGGG AATATGAGTG CGGATGGGGA CTTCGATATT GACCATCGCG ACATATTTCA GAACTTGGAT GTTAGCCTTC TAGAGCTTTT ACATCGTCGT AGAGTGACGG AGCGTTTTCT AAACGCTTCG CCCGACGAGA GAATCGCTAT ACTCAACGAA GAGATGGACA CTCTTGGCCA GGAAATTGAT GATACTTTAA TAGAGAGGCA TAATCTCTAC ACTGAACAAA GAGGTCACGT CCACAACTCA GTGTTTCGGT GGGTCTGGAT CGGTATTCCT CTTCCAGTCT TCAGATCTCG CAACTTTTAT AGAAATTATA GTATGAGACG CATGTTGACC GTGTTGGCCG CATTTGTTAC TGCTCTATGG CAGCGTTTCG TTCGTATGGT GCTGTTGGTT GCGTTTGGCT TTGCACTTTT CAATTATGTG CCTAATCTAT TCCGAATGGT GCTTGTTATC GGAAACCAAA TCACATTTTC CGAAAACTTC TTAAGAGATG CTTTGACTTA TATATTCCGT GACAACACAG CAATGATGGA ACGTCACCTT CTCATCTTGA AGAGCAATTA CGATTTGTCC GCTGGAATTG CTGGCGACAG CTCCGTGTTT AACACCACCT ACTCATTGGC ATACAATATT ACCTGTAGAT ACCTAATGAG TTTCTTCATT GATTTCGGAG AGGACACCGC TGTGTATATT GACAACAAAT CGTTGGTGTT CAAGTTTGGT GATGTAATTA ACAGCTGCTT CCCGGTGTTG ACGAGCCATC CCTGGTATGG GACCTTTGTC ACCGTCTCGG TGTATATGTT GTACTCTGTA GTGGGAATCC TCATCTGTGT GAACATCAAC TGGTTCTACG CAGCCAACAT TCTCAACCGT ATCCTGAGAT ACAGACGGTT TTACTTTAGC TTGGCCAAGA TCGTGTGGAA GAGCTTGTTT TCGGAAGTCT TATAG
|
Protein sequence | MSTHFTRDFL QVIRGPICNL EMLNLNISKR HPKDQLQSAG NMSADGDFDI DHRDIFQNLD VSLLELLHRR RVTERFLNAS PDERIAILNE EMDTLGQEID DTLIERHNLY TEQRGHVHNS VFRWVWIGIP LPVFRSRNFY RNYSMRRMLT VLAAFVTALW QRFVRMVSLV AFGFALFNYV PNLFRMVLVI GNQITFSENF LRDALTYIFR DNTAMMERHL LILKSNYDLS AGIAGDSSVF NTTYSLAYNI TCRYLMSFFI DFGEDTAVYI DNKSLVFKFG DVINSCFPVL TSHPWYGTFV TVSVYMLYSV VGILICVNIN WFYAANILNR ISRYRRFYFS LAKIVWKSLF SEVL
|
| |