Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_82631 |
Symbol | |
ID | 4837867 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 227934 |
End bp | 229797 |
Gene Length | 1864 bp |
Protein Length | 551 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640389182 |
Product | predicted protein |
Protein accession | XP_001383673 |
Protein GI | 150864724 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.293318 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCCT CGGGAAATTC AGCGGGAGGC GGTCTTTTTG GTGCGTCTCA GAACAATCCA GGCTCTTCTT CCATGTTTGG ATCAGCACAA CCAGCGTCTG GAACCTTTGG CCAGAATACA GCTACGGCTA GCGGCCCATT GCAAGTGAAC ACAAACTCTG GTGGTTTATT TGGCGGTGCT TCCAAAGCTC CCGCCACCTC TGGAGGGCTT TTTGGTTCTT CTAGCGGAGT TGGAGGTTTT GGTTCAGCTT CCAATTCGAC CGGTTTTGGC AATGCGGCCG GAAATGCCGC TCCAGCTTCG GGATTTGGTG CTTCTAGTAA TACTTTAGGT GGCGGATTTG GCCAGACGGG TAACAAACCA GCCACAGCTG CTGGTGGTGG TCTCTTTGGA GGTTCTACCA ATTCTAACAC TGGAGGTTTG TTTGGGAAGC CTGCCGGAAT TGCTGCCCCA GCGGCTTCCA CAGGTGGAGG ATTGTTTGGC GGAAATACCC AGCAAAATCT GGCTGCTGGA GGTGGACTTT TTGGAGGCTC TACAGCTACT TCTGGTGGCT TATTTGGCAA TAAACCCAGT GGAGCTGCTG GAAATAGTGG AGGATTGTTT GGAAGTGGGA ACACTGCTTC TGGTGGCAAC ACTGCCGGCA TGTTTGGTGC CAATACTGCC AATTCTGGAG TCAATACCGG TTCTAGTTTA TTTGGAAGCA AGCCGGCAGC ATCTACAAGT GGAGGACTTT TTGGAACTTC TAATACAGCC TCTAATACAG GAGGGGGATT GTTTGGATCT CAGCAGCAAC AGCAACAGCA ACAACAGCAG CAACAACAAC AACCACAACA GTCTTCTTTA TTTGGAGTCA ATTCAACCAA CAATGCCCAG CCAGCTTTTG GCTGGAACAG TAGCCAACAA CAGAAGTCTA GTTTTGGCAC GTCTCAACCA GCATTAAACA ACACATTTGG TGCAACTAAT CCTATCGCAT CGGCAGCTCC AGCTTCCAAC ACTAATAACA AGTATACACC AGCAATAAAC GATCAGTTGA TCAAGATCAA AGAGCAATGG GACCCTAATT CGCCTAAATG TGCATTGAAG ACTCACTTCT ACAACAAATT CAGTGAGCAG GAAATCAACA TTCTCTTGAA CCAGCAGAGA CCTAACAACG AAACCCCAGA AGACTGGGAT AACGCTATGA GCAAAAGACC CACGGCTAGC CATTATCCTA TCAAGATTTC GTCATTTAAC GATGTAGCAC AGAGAATAGA AACTCAGCTT GAACATGTGT CAAAGTCTAG GGTGATATTG AATGAAATCA ACGAGAAACT GAATGCATTG TCTTCCAAGC ACGATTTGGA AAATACTACT AGAATTTTAA AGGCGAAAGC CAAACATACA AAGTTGTCAA GAAGGTTGTT GAGATTGGCC ACAGTGTTGG CCATCTTGAA GTTGAAGGGT TACCCTCTTT TGCCAGAAGA AGAAGAGATT TCCAAGCAGT TCGATGTGTT ATCTTCTAAG CTTAATGATC CAAACAGTCC CGTAGGGAAG TTGAGTGACG TCTTTGCCAG ATTGGCAATC TTGAAAGAGA GAGCCGAGGA CTTGAACTAC CAGTTCGAAG TCTCTATTGG TGGGTTGAAC GGTTTGGCTA ATGATGACAA ACAGGAACAG AGAGTGGCTG AACAGAAAAA CAGCAATATC GAAGAGACCA TCAACAAGTT ATCCAAGGTT CTCTTGAAGC AACAAATGGG ATTGAACTAC TTGAACGAGG TGTTGGAGAA GGATTTGGAG GTGGTGGAAA AGGTTGCTTC TCGCTAAGAG AAGATCCTAC TAATGTATAT TAAATAGATC ACAACAAAAG TAATAGTAGA TACTGCTAAG TGCT
|
Protein sequence | MFSSGNSAGG GLFGASQNNP GSSSMFGSAQ PASGTFGQNT ATASGPLQVN TNSGGLFGGA SKAPATSGGL FGSSSGVGGG GFGQTGNKPA TAAGGGLFGG STNSNTGGLF GKPAGIAAPA ASTGGGLFGG NTQQNSAAGG GLFGGSTATS GGLFGNKPSG AAGNSGGLFG SGNTASVNTG SSLFGSKPAA STSGGLFGTS NTASNTGGGL FGSQQQQQQQ QQQQQQQPQQ SSLFGVNSTN NAQPAFGWNS SQQQKSSFGT SQPALNNTFG ATNPIASAAP ASNTNNKYTP AINDQLIKIK EQWDPNSPKC ALKTHFYNKF SEQEINILLN QQRPNNETPE DWDNAMSKRP TASHYPIKIS SFNDVAQRIE TQLEHVSKSR VILNEINEKS NALSSKHDLE NTTRILKAKA KHTKLSRRLL RLATVLAILK LKGYPLLPEE EEISKQFDVL SSKLNDPNSP VGKLSDVFAR LAILKERAED LNYQFEVSIG GLNGLANDDK QEQRVAEQKN SNIEETINKL SKVLLKQQMG LNYLNEVLEK DLEVVEKVAS R
|
| |