Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_54632 |
Symbol | |
ID | 4837360 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 289516 |
End bp | 291447 |
Gene Length | 1932 bp |
Protein Length | 591 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388675 |
Product | predicted protein |
Protein accession | XP_001382819 |
Protein GI | 150864116 |
COG category | [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5391] Phox homology (PX) domain protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.397245 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTGG AAGACCAATT CTCGTCCATC CAATTGGACA GAGACGAAAT TGACAGAGAT AAAAACAGCA ATCATGATGA CGACAAGCCT TCGACTATTG CCGAGGAAGA TGCTCTAAGC GAACTGCAAA CACGTACAGA ACACGAACAG AGTGCTTCTG CAATAGATCC AACGACTGAA TTCAACTCCG AGGCCCAAAT TCAAGGTATA GATGGTGAAA ATGGAAATGT AGGAAGCCCG AATTTCGATT CAGATAATGC CAATTCGGAG TCTGTAGGCC AATCAAAGGG AAGTAACGAG TCTGCTGGTA CAACTGTAGT TAATAGATCA AAGTCCGGGA GTTCTAGTGA TCAAAATGCC TCGGAACCGG GACCCTCTCC AATTGTTCAG GGTTCTTCAT CTACTACCTC GGCAGTTTCT GGAGCTAATA GAATTTCTAG TCAAGCAACA ATAACTCCGC TGGAACAAGA GAGAACTGTA ATGTTTGAAA AGTATCGGAT TGAGTCTTCG GTGACGTCTC CTATCAACGA TTTGGATACC GCTTCGAAAC ATTTCATTTC GTATTTGGTG ACTACCACAA CAAACCATCC GGCAGTGGTG AAGCTTTCGT CTCACAGGTC GGCTCCAGAA GACGAGTACG TCACCATTAG TGTTCGGCGT AGATACGGTG ATTTTTCCCT CCTTCACGAG TGTTTATCGA ACGATATTCC TACAGCCATG ATTCCTCCAT TACCCTCTAA GTCCAACTTC AAATACTTAA CGGGTGATAC ATTCAGCACC GAATTCGTCA ACAAACGGTT GCATTCGTTG GACAGGTTCA TGAAGTTCAT CTTGCAGCAC AAACGATTAT CTCAGGAGTC AGTGTTCCAT CTTTTCATCA GTGACTCTAA TGACTGGGGC AACTTCACGA AGAACTTGAA ACTTCGCGAC ATTAACTACG ACGAATCTGG GGCTGGTTCC TCAGCCAACG GATTTGTCAA CAAAGTTGTC AACGAAGACA TGATTACGGA GAAGGTGATG AACTACTTTA CTTCATCTAA ACACAAACGA GAAACGAACA AAGATATTCT AGAAATCAAC GACAAGTTGA AGAAGATCTA CGAAAACCTC ATGAAGTTAG ACAAGATATT TGTCAGATTG AACAAGAAGA ACCACGACTT AAGCATCGAT TACGATCAAT TCCTGGCACA AATCATGAAG CTATCCTTGG TACAAACTAC AGATATGAAC TCGACCTCTG AACCATCGAC ACCCTCCAAA AATGGCGACG CTGCTTCTGT TGTTGTTGAT TCTACTATCA CCAACAATTT CAAGATATTT GCTGATTCCC TCGACTATCT CCTGAAGAAC TGGTCAGAAT TGCACAAATA CGTAGATGAA ACATTTCTTG TTTCATTACG GGACTGTTCC AAATACATCA TAAGTTTAAG TAACCTCATC GAATTTCAGC ACAACAAGAA AATCGATCTT CAGGTTTTAC AGGATTACTT GGCCAAAGCC AGGAGCGAGT TGGCTAGTTT TGGCGGTTCG ACGGCAAGTA CAGGTGCTCG TCACCCACCT CCTGTACCAG TATTGAATAG TCATTCTGGA GGCGGAATAG TGAATAATAC CACCCAGTTG ATTAAAGACA CTTTGTCTAC ATCTGCGACT CCACACATAG GATCTACAGC CACGGATGGA AAGATCGGCC GATTAGAAGA AAGGATCAAC AAATTGGAAA GTGAAATCAC CGTGCAAACG AAGCTTGTCA ACGACCTCAC TTATAGGATA ATAAACGAAG AATATCCCAA CTGGGACAAG TTTAACCGCA ATCAGTTGAA GGAGTCGATG GTGGGGTTAT GCGACCAGGA GATCAAGTTT TATAAGGGGT TGGTTGACAA CTGGAGCGAT GTTGAGACCA AGTTGTTGCG GCGGTTGGAC GAATTGAAAT GA
|
Protein sequence | MSLEDQFSSI QLDRDEIDRD KNSNHDDDKP STIAEEDALS ESQTRTEHEQ SASAIDPTTE FNSEAQIQGI DGENGNVGSP NFDSDNANSE SVGQSKGSNE SAGTTRTVMF EKYRIESSVT SPINDLDTAS KHFISYLVTT TTNHPAVVKL SSHRSAPEDE YVTISVRRRY GDFSLLHECL SNDIPTAMIP PLPSKSNFKY LTGDTFSTEF VNKRLHSLDR FMKFILQHKR LSQESVFHLF ISDSNDWGNF TKNLKLRDIN YDESGAGSSA NGFVNKVVNE DMITEKVMNY FTSSKHKRET NKDILEINDK LKKIYENLMK LDKIFVRLNK KNHDLSIDYD QFSAQIMKLS LVQTTDMNST SEPSTPSKNG DAASVVVDST ITNNFKIFAD SLDYLSKNWS ELHKYVDETF LVSLRDCSKY IISLSNLIEF QHNKKIDLQV LQDYLAKARS ELASFGGSTA STGARHPPPV PVLNSHSGGG IVNNTTQLIK DTLSTSATPH IGSTATDGKI GRLEERINKL ESEITVQTKL VNDLTYRIIN EEYPNWDKFN RNQLKESMVG LCDQEIKFYK GLVDNWSDVE TKLLRRLDEL K
|
| |