Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66897 |
Symbol | |
ID | 4837503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 927483 |
End bp | 931285 |
Gene Length | 3803 bp |
Protein Length | 993 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388818 |
Product | predicted protein |
Protein accession | XP_001382949 |
Protein GI | 150864215 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.384478 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACC TACGACCCAT CAATCAGCCA GAGGGACCTC TAGGGGTTTT CTCTGGTGAA AGCTCTTCAA ATCAAGCCTT AAACGAGCCA AAACCACCGG ATATTGACCG AAATCCACAC GACCATGTCG CACCCCCTGA TCTGATGGAC CTCGATGGCG AGTCCACAGA TGCCGATGAA AACGGCGATT TAGAGCCATC TTTCGAGACA GCATTGTCAA CTACTTCCGA GGAACCAGCC CAACTACGGG ATAATCCCAT GGACGGACTG GTTAGTCAGA ACCAGACCCA ATTAAGCTCC ACTATGGACA GTTTTGAAAC TTCGGTTTCG AAAAATTTTC ACGATCACGT GATCGGACAG GTACACGACC AGGAAATGGC TGCAAATGAT GAAAATGAAA TGGTTTCCAC TTCTTTAGAA TCTACCAGCA GATCAACTGA TTTACCAGAA CAAGAAGACT CTCTTCTTAT TCACCATCAA CATGAAAACG CCAAAACCCA AAAAAACAAA GAAAATTTAC AAAACTCCCA AAAAAATACA GGAAACTTAC AAAACTCAAA AAAAAACTCA AAAAACTCAA AAAATTCAAA AACAACTCAA CCAAATGTTG ATCAGATCTT CCCTATCTTA GGAACTGCCA GCAAATCAGC CAAAACAGGT TTACGTACCT TCAATATTGC CAAACAAGTA CCAATCCCAA TACTCAATCC AAAAAATGGC CCATCTGCTA GCCAACTCCA AAAGGTAATT GTGGACAGAG ACTCCTCACC TATTCTTCAA GACGCAAAAA CTAGACGTAA GCAATTGACC GAACTACACG AAACTACCGG TCACCTTTCA GAAGAACAAT ACCGTCAATT AGCAAACACC TACTATCTCC AAAAATATGC CAACTCCCTT TCAGATCTCA ATTGGGCGGG ACAAGAAGAA AGACTTAAGG CATGGAATGT GCCCCAAGAT TTCTGCCTTA CCGCACTCGG TGAGATTGCT CGAAATTCAA ACGAGAAAAG ATACGTACGT CTTCAAATAG ACGCTTTCTA TAACAAAAAT GACCATCTCG ACCAAAGGCA TTCAATGCGG GCCGAGGAAA TGGCCAAAAT AATTGAAAAA CACCTTACAG AGACTATCCC AAACAAATGG CCCATGGTCT CCCAAAACAA TAACAAAACT TTAAATGAAC TCCATAAATC CTTGGAGTTC TTAAAAAGTC AATTCGACCC CGGCTCAAAT GACGAATTTG AAGCCACTAA GGACATAAAA GACAAAATGC AGCAATTGTC TAGAGAAATA AGTTTCAGCG CTACAATGAG AGATGTTAAA GACAAGTTCC ATACAATTGT AAGAGATAAC ACTCATGTGG ATTTTCGTTT CGGAAGCATA GTAACTCAGG ATCGGATCCC AGAACTGGCT CAACAGAAAT CAACACCCAT GACAACTTGG CTTGAAAGAT TTCAACAACT TTTCCACTCA CCTTACGTGG GCAGTGAAGA CACCTTTGAA TTTCAGCTCA GTCTTGTAAG GCCAAAGCAA CTATCCAACA TGTATCTAGT TGGCATAAAA GCTTCGTCGG ACTTCCCAGA TCCGCGAGAC ATCCTAGACA TCATGTTCCA CACAAAAGAT TTTGAAATTC CTCAATACAT CGATACATCC TTACCTTCTC GTAAACCCTC GAGACGACAG GACCGAATCC CGTACGAAAT TCTACATCAA TTCATAAGAA AGGCACCAAT CTTAAATTAC ACTGACAAAA AAGCAGACTT TACACACGTC CTGTTCTTCT TGATTGGCAG CAACTCAGAC AATATTCCCA ACAGAAAGAG TATCTTCTTG GAAAACGTCC AACTTGACAT CATCTCTTCA TACCAATTCT GTTTCAGATG CCACAACAAC AAGCACACTA CCAAAAGATG TCCAGTTCCC AAATCAACCA CATTGTTCCA AAGTCGGCCC TTAACACAAT GGCCCACTAA GGTACCATCC CCAAACAAAC AGAACCATCC AATAACCCAC ACCACGGGTC CCAGAACTAC AAATCAAGAC GGGGATGGCT TTAGTCGACC CACGAAAAGA TCTAGACAAA AGACAACCCC CAGTCCACCA ACTATCCCAC AACAACAGAA TAGCTTCGAG GTGCTCCCGA TAGAGGACCT CACAACACAA GAGGTTACAG CAGAGGAGAC CGAAGCCACA CGCAACCGGC CAAACACTGT TTCATCTACA CCACAAGCAC CATCACGTCA CGAAACCCCG AAAGCCATTA ACAAGAACAA CGATAAAGCC CCTGAAATTT CAACACAAGA CGATGAAATG GTGTACTACA CCGATGACGA AGAACCCTCG ACTATAAATG ACGAGGACAC CCAACTCGCT CCATCTACAT TGATTGAGCA ACAACAAAAA AATTCAAAAG TGGATACCAC CCCAAATACT CCACAATACA CTACTGACAT GCTCCCTCTG AAGCATGCAC CTAAATCAAA TTCAAGATCT CAACCGGTAA CTCCCTCCCG GCCTACTAGC ATGCTCCCTC TGAAGCATGC TCCCATAACT AGTTCCAAAT CTCAACCGGT AACCCCTGCC CGGCTTGGAT CCAAGGTCCT CAAACCTGCG TCAACAGGTA AGAAACCCTG GACACCGACA CCCACTCCCA GGACCACGAA TGGCCCTAGT GGGTCCCCCG CAAGGACTCC GACCACGTCA ATACGACTAC CTTCTCTCCT AAACAACAGT AGAACAAACT ATAGTGAATT AAGGGAATCA CAAATGGGAT CGACCATTAG TTTTCCTGAG TCTATGAGGA CATCCAACCT CAACATCCCC GACTCTTCAC TAATTCTACA AACTCAAATT GAGGAAGCAA CTCAAGTAGA TTCTCCAGGC CAACAACAGG CACCAGGTCA CAATCCATTA ACAGATGACA ACCTGGACCT AAGTATGGAC ATAGAGGACA TCAACGTTCA TTCTGATAAT ACTAATTATT AAGATGTCTA GATTAGTAAA TTTTCGGATA ACCAACCCGC TGTTAACGGA ACACCGGGTA GCAAATCCAA CTAATCTTCT CAAGATCCGT ACTAAAAATA TTCAAAAAAA CACACAGATC AATAAGTTTC GAGAACTCGC GACTTACTGT GATGTCTTAC TCATTCAAGA AACTGATTTT AATTCAAAAC AACCAGCCAA CTCCCAGAAC AGGCGGGGAG CCAGGAGGGG AGCAAGACGG GCAATGCAAC AACAACCCCC AAACTACTCT CAGGATCCCA CCCCAGATTG GATTACCTCA TTACAAAAAC AATTGAACCA AGCAAATCAA GAGCTAATCT ATACAGATAC ATTAGCTCGC TCGGGAATCA TACTTAATTT TCAACACCAA CACTTTGAAA AGATTTCGTC TAATAACCTC CAGCTCGACT CTGAAATCGC GCGGTATGCG ACCGACGTAA TCATTCAACT CAAAGAAACA AAAGAATATA TCTTAGTTAT CTCAGTGTAC GGACCGAGTG GGAATCATCG TTCTCAAGAG CAATTATTCC ACTCTCTTTA TACTCTGATT AACACACTCA TCACAAATTT CGAAATTGAC AATAACAATC ACAAGCTCCA CCTCTGTATA GGGGGAGACT TCAATATGAT ACAAAATCCA GAACTAGATT CTACTGCAAG AGAAAGCTCC AGTCGAGAGC ACGCCTCCCG ACAAGCCTTC AACTCTCTTT GCAACGAATT TCAACTACAT GACTCTCTTA GAGGATTAGA ACCGACTATC AAGGTACCCA CAAATACCAA CACAAATAAT TGCAGAAGAC TCG
|
Protein sequence | MADLRPINQP EGPLGVFSGE SSSNQALNEP KPPDIDRNPH DHVAPPDSMD LDGESTDADE NGDLEPSFET ALSTTSEEPA QLRDNPMDGS VSQNQTQLSS TMDSFETSVS KNFHDHVIGQ VHDQEMAAND ENEMVSTSLE STSRSTDLPE QEDSLLIHHQ HENAKTQKNK ENLQNSQKNT GNLQNSKKNS KNSKNSKTTQ PNVDQIFPIL GTASKSAKTG LRTFNIAKQV PIPILNPKNG PSASQLQKVI VDRDSSPILQ DAKTRRKQLT ELHETTGHLS EEQYRQLANT YYLQKYANSL SDLNWAGQEE RLKAWNVPQD FCLTALGEIA RNSNEKRYVR LQIDAFYNKN DHLDQRHSMR AEEMAKIIEK HLTETIPNKW PMVSQNNNKT LNELHKSLEF LKSQFDPGSN DEFEATKDIK DKMQQLSREI SFSATMRDVK DKFHTIVRDN THVDFRFGSI VTQDRIPESA QQKSTPMTTW LERFQQLFHS PYVGSEDTFE FQLSLVRPKQ LSNMYLVGIK ASSDFPDPRD ILDIMFHTKD FEIPQYIDTS LPSRKPSRRQ DRIPYEILHQ FIRKAPILNY TDKKADFTHV SFFLIGSNSD NIPNRKSIFL ENVQLDIISS YQFCFRCHNN KHTTKRCPVP KSTTLFQSRP LTQWPTKVPS PNKQNHPITH TTGPRTTNQD GDGFSRPTKR SRQKTTPSPP TIPQQQNSFE VLPIEDLTTQ EVTAEETEAT RNRPNTVSST PQAPSRHETP KAINKNNDKA PEISTQDDEM VYYTDDEEPS TINDEDTQLA PSTLIEQQQK NSKVDTTPNT PQYTTDMLPS KHAPKSNSRS QPVTPSRPTS MLPSKHAPIT SSKSQPVTPA RLGSKVLKPA STGKKPWTPT PTPRTTNGPS GSPARTPTTS IRLPSLLNNS RTNYSELRES QMGSTISFPE SMRTSNLNIP DSSLILQTQI EEATQVDSPG QQQAPGHNPL TDDNSDLSMD IEDINVHSDN TNY
|
| |