Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_56684 |
Symbol | |
ID | 4838212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 913889 |
End bp | 916195 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640389527 |
Product | predicted protein |
Protein accession | XP_001383460 |
Protein GI | 150864581 |
COG category | [R] General function prediction only |
COG ID | [COG1409] Predicted phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGAC TCCCGCGGCT TTGGGTCCGC CAGTTGGGCT ACCTATGCGT GTTTTTTGTA CTAGCAGTGG TGCTTCTTAT TGTAGCCAAC TCACAACAGA TCATCAAGAT CCAGGACTAT ATTCCACTGA GCCTTACACC AACTTTCTTC CAGCCTGCTG CCGACCACTA TATAATCGAT ATCGTAATCC GTAATTGCTA TGGCTACAAA TCAAAGCTTC CAGGTTGTGG AAAACCCGTA GACAGTGAAG GTGAATTGGG CTACCTCGGA ATGTACGGTG AATGGACCAA AGTCGACAAA GACTTGTCAC TTGGCTCTGG ATGGGTCAAA CAACAATACC TCTCGTATAA GATGTTGAAA GCGGACGTTC TGGATACCGA ACTCAACAAG GTTGCTGGAA AAGATGCCAC ATCGACTATC AACAGAAGGG TAATTCTTGA TCTCACTGTA GCAAACCCTA GCAAAGATGC GAAGATCAAG GGCAACGAGA GGTCCAAGTA TCCAGCCAAA ATTATAAAGG AGTACAACTC AAACAAAGTA GCTGGTGAGC TGGATATCGA ATTCTTAAAA GAACAGGGCA AGAAAGAACA CGGTGCTGTG ACAGAAGCAC TCACAGTAGA CAAGGATAAA GCTGCCAGCC GTAAGATCAG CGAACTAAAT GCAAAGCTCC ACAAAGAGAG CCAAAGGAAA CAACAGGGTG ATAAACCAAC TAAGGAATTC GAAGAAGATA TCAGCCCGCA AGACCCTGAA GCTGGTTCAA AAGAGGTTCA TGCTGAAGCA AATCCGGAAG ACGAACTAAA GGCAGCGGAA GCAAAAGCTG AAAAGAAACT CGAAAAGGAA GAGATAGAAA AAGAAGAAAA AGAGCAAAAG AAACAGAGCG AGGAGGCACA GAAAGCACAG AAAGAACAGT TTGAACAGGA GCAAACTCAG AAAGAAAAAC AGGAACAAAA TGAAAAGAGA GTGCTAAACA AGCCACTTGA CAAGAGAGTT GTAGAAGAAA GTAGACATGG ACTTAATAGT GTGGTCTACA TTCCAAGTAA GGAAGACGTA AAGAACAGTG GCTGGGTAGA AAAGTCAAAT GGAATTTGGG TGAAATACGG TGCTCCTAGA CACAACGCTG TCACAGCGAT TGATATTCTC TTTGGTGAAG ATGCTGTGGA ACCTCGGCCC AACTGGGAAT TGGTAGATAG TCCTTTGACG GGGACAGCTA CACAGTCAGA CTTGCCAGCA TATTTGACCT ATAGAAAGGG CCCCAAAGCA GACTATAGAA TCAAGGAATA CCAACCAGTT CTCAAGGTCA ATAAGAATGG CAAGTTCAAG ATTTTGCAAG TTGCCGACTT GCACTTCTCC ACTGGATATG GTAAATGCCG CGATCCCTCT CCAGCTCTGA CTACAAAGGG TTGTCAAGCC GATCCAAGAA CGTTGAAGTT CTTAGGTAGG GTTTTAGATA TTGAAAAACC AGATTTCGTT ATATTGACTG GAGATCAGAT ATTTGGCGAT GCGGCACCAG ATGCTGAAAC CGCAGTTTTC AAGGCATTGT ACCCGTTCAT AAAGAGAAAG ATACCGTATG CGGTGACAAT GGGGAACCAT GATGACGAAG GATCGTTGTC CCGTAATGAG ATCATGAGTC TTTCGGCTAA TTTACCATTT TCTAAGGCAG AACTAGGACC AGAAGATATT CAAGGTGTAG GCAATTACTA TTTGACAGTG GAGGGTCCAG CTTCACACAA TCCAGCGTTG TCGTTGTATT TCTTGGATAC ACATAAGTAT TCGAGTAATC CTAAGATCAC ACCAGGCTAC GACTGGATTA AAGAGAATCA GTTGAAGTGG CTAGAGGCAA CCGCAGCTAG TTTGAAGAAG TCCATAGCAG CATACACCCA TATTCACTTG TCTATGGCAT TCTTCCACAT CCCATTACCA GAATATAGAA ATCTCAAGCA GCCATTCATT GGTGAAAACC GAGAGGGAGT GACTGCTCCT AGATATAATT CTAATGCTAG ATCTGTACTA AGCGATATTG GCGTGAAAGT TGTCAGTGTT GGCCATGACC ATTGCAACGA CTACTGTTTA CAAGACTTCC AGAAAAAGGA TGGAGTCACC GAGTCGAAGA TGTGGCTCTG TTATGGTGGA GGTTCAGGAG AGGGTGGTTA CGGCGGATAT GGAGGATATA TCAGGAGACT TAGAGTTTTC GACATCGATA CCCAGAACGG AGAAATCAAG ACATGGAAAA GAGCGGAAAA CGACCCTGAC AAGGAAATTG ACCGCCAGAC CATAGTTCAA GGTGGAGAGG TTGTCAACTT TGCATAA
|
Protein sequence | MIGLPRLWVR QLGYLCVFFV LAVVLLIVAN SQQIIKIQDY IPSSLTPTFF QPAADHYIID IVIRNCYGYK SKLPGCGKPV DSEGELGYLG MYGEWTKVDK DLSLGSGWVK QQYLSYKMLK ADVSDTELNK VAGKDATSTI NRRVILDLTV ANPSKDAKIK GNERSKYPAK IIKEYNSNKV AGESDIEFLK EQGKKEHGAV TEALTVDKDK AASRKISELN AKLHKESQRK QQGDKPTKEF EEDISPQDPE AGSKEVHAEA NPEDELKAAE AKAEKKLEKE EIEKEEKEQK KQSEEAQKAQ KEQFEQEQTQ KEKQEQNEKR VLNKPLDKRV VEESRHGLNS VVYIPSKEDV KNSGWVEKSN GIWVKYGAPR HNAVTAIDIL FGEDAVEPRP NWELVDSPLT GTATQSDLPA YLTYRKGPKA DYRIKEYQPV LKVNKNGKFK ILQVADLHFS TGYGKCRDPS PASTTKGCQA DPRTLKFLGR VLDIEKPDFV ILTGDQIFGD AAPDAETAVF KALYPFIKRK IPYAVTMGNH DDEGSLSRNE IMSLSANLPF SKAELGPEDI QGVGNYYLTV EGPASHNPAL SLYFLDTHKY SSNPKITPGY DWIKENQLKW LEATAASLKK SIAAYTHIHL SMAFFHIPLP EYRNLKQPFI GENREGVTAP RYNSNARSVL SDIGVKVVSV GHDHCNDYCL QDFQKKDGVT ESKMWLCYGG GSGEGGYGGY GGYIRRLRVF DIDTQNGEIK TWKRAENDPD KEIDRQTIVQ GGEVVNFA
|
| |