Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_81171 |
Symbol | |
ID | 4851745 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2724606 |
End bp | 2727924 |
Gene Length | 3319 bp |
Protein Length | 997 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393453 |
Product | predicted protein |
Protein accession | XP_001387089 |
Protein GI | 126275470 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.150116 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.378562 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCAGTTAAAG TATTCACCAG TAACAACAAC CCCTATTCGT GTAAGCCGTT AAGCGTTATA GGTTCAGACT GGAGAAGATC TGATCTCCAA CTGTTGCAGA TATAATCGCA TTATCTGCAG GATTCCAGAT AAGTAGTCGC ACTACGAAAA ATCCAAATTC CATACAGAAA TTAAACAATC TACAGTATGA AGATGAAGAT TGGCTGGTTT CGAATACTTT CTCAATTGAC TCCCACTGTC CACTACAATT CCTTCGCTAA ACCACATATG ACAATATAGA ATGAGTGGTT CAATTGTGCG GGCTTTGAAC TGGGACTCGG GCTACGAGCA GCAGTTTCTT GCTGTGAATC CAATTGGAGA CGAAGTCCTT CTCTACCAGA CTAATCATGA AGACCCAGGG ATCGAGTCCA ATGACTTGAT CAAGTTGAAC AGCCGAACGG GGTTCGAAAA CATACAATGT TCGTCGTACT CGATGATTAA CCGAGGGATA ACTGGAGTCG GTTCCATTTC CGGAAACATC TCTATCTTCG ACATCAATTC TAACAACTCG TCGATTCTCA AATTGAGACC CAAACAGAAT CGTCCATGTA ATGCGATTTC CTTCAACAGC AGTAACTTAA TAGTAGCAGG TTTTGACAAG GGCCGTCAGG ACAATTCGTT GCAAATTTGG AACATCGAGC ATTACTCACG AAACAGTACT AATGAGCATA TAAAAAGACC TGTGGCTACA TACTTACCCA ATGAGGCTAT CTTGCTGTCT ATATTCTATC CTGACAGAGA AGGCAGTATA CTCTGTGGTT CATACAAATT TTTGCGTGAA ATAGACCTAC GTGTGGACCA GCCGATTTTC CAGATGGCGA CAAAATGTAC GTTGGGTTTA GCAGTAGACC ATTTCCGAAC CCATTTGTTT CTGTCGTTCA GTGAAGATGG TTCTTTAGCC ATTTGGGATA GACGGAAATT GACATCGAAT ACGGCGGTTA AACCTAAAGG TCCTCTCACG TCAGGAAATG TCATCACCGA AACTCCAGTA TTGCAATTCG TAAAATTACT CAATGATTCA ACTTCTCGAA AGAACCAGAA TCCTTGTGTA CGATATTCTA CTATCAGAAA AGGTGAGTTT TCAGCTATAT TCAACGGAGA TTTGATTAGA AGATGGAATA CAGGCATAGT TCCAGCAACC AGCTCAACCT CAATCTCGGA GAAGAGTTCT AGGGAGAGTA ATCCTACTCT TGCCAGCTTG CAACAACAAT CGCAACAACT ATACAAACCT ACTGATGAGT CTCTATTTGT GTCACTAGTA TTGGATGTAA AGACAGACTA TGAACGTGTA GTTTCGTTTG ATTATTCGCC AGATATCACA TCTGCTACTT CAACTAATTT CGTATGTATG CGCCAATCCG GTTCGGTGTT TAGAATGCCA GCTGTAGAAT GCATTGAGTC CCTTGATTTC AACTCTCTTA ACGAGTTTAC AATCGCTGGA CCTGAGGGAA CATTGACCAA GTTTTCGGAT CAGGAGGAAC TAGCTAAGGT GCGAGCAGAA GCTGCTGCTA TGGCTCAGAC AGCATCGAAT CGTGGAAACT CCGTAGTTAA CAAACTCGCT GATCTAGGCA TTATTGACGA AACAAGAAAG TACAGCGAGG CAGAGTTTTC CGAAGATATT GAGTCCACTG TAGATGATGA AAGTGCCATT GCTCCTTATG AAGCTGATAA TGCCAATAGA TACAACTTTG GCGATCTTGA AGTAGATATA CATTTAAACG ATATTCTTGA TGCATCGGCT GTTATACATA GCGACATCTG CTCCACAATA AGAAAAAGAG CAATACTAGG CTATGGTGTC GATTGCGACA GAAACATTCG CGTTTTGGAA GACTTGGACT CTCTCAACAG TCAACTTTTC TTGAGGAACA CCTGGAAGTG GTTGGGGTTG GCTAAGAAGT CCTTGGAAAA GGGTACCATG ATCTCCGAAG GGATTGACTT GGGGTATCAG GGAGTATTGG GAATCTGGGA AGGAGTAAAA GAAATGGATA ACCAGAAACG GTCTGTGCCT GAAGCGGGTC TAATAACCGA TGGCTGGTTT TCCCATGCTG TCAAGTCTAT TGTTTCATCT AAGGGAAAGA AGACAGCTGG TATCAATATC GCTAGTAACA GCGAAAAAAA GGCGCAACGG AAACTTTGTT TAATTGTCTC TGGGTGGTAT TTGGCAGATA GTGAATTTGA GGAGAAATTG AATATTTTGA TTTCTTTAGG ATACTCAGAA AAAGCTGCTG GTTGGGCAGT TTTCCATGGC GATGTGCCTA AAGCTATTGA AATTCTTGCC AATGCGAAAA AGGAAAGATT GCGATTGATG TCTACGGCTG TAGCTGGTTA TTTGGCATAC AAAGATTCCA ACGTTAATAG TCCGTGGAAA GACCAATGCC GGAAGATGGC TTCAGAGTTG GACGATCCAT ATCTCAGAGC CATTTTTGCG TTTATTGCAG ACAATGACTG GTGGGACGTG CTTGATGAAC ATTCGTTGCC GTTGAGAGAA AGACTAGGTG TAGCCCTCAG GTTCCTTTCA GATAAGGACT TGAATGTTTA CTTACACAGA ATCGCCGATA CTGTAGTCAA CAAAGGCGAA TTGGAAGGGC TCATTTTGAC CGGAATTACA CCTCGAGGAA TCGACTTGTT GCAGAGCTAT GTAGATAGAA CCAGCGATGT CCAGACTGCA GCATTGATTG CCGCGTTTGG GAGCCCCAGG TATTTCTCCG ACGAACGAGT AAGGCATTGG ATTGATTGTT ACAGAAGCTT GTTGAACAGT TGGGGACTCT TTAGTGTACG AGCCAAATTT GATGTAGCTC GTACTAAGCT TTCCAAGAAT GCCGCTGGCA CTTCGACCAT TAAACCTTCG CCAAAGCAGG TTTACTTACA GTGCTCCAGA TGTAACAAGA ATTTATCAAA ATCGAAGACA ACTAACTCCA ACAGTCTTCC TGGTTCGAAC CCTCAGGCAA TCATCAAACA ATTCAACAAA ATGAACCACC ACAACAATAA TAGTAGCAAA CTGGCTACAA ATGACATTGC TGCTTGTCCT CATTGTGGTG CTCCGCTTCC TCGTTGTTCT GTTTGTTTGC TTACCTTGGG TACACCTCTT CCATTGGAGC CATCCGAGAA AATCCAGGAA GTCACATTGG CCAACAAAAT CGAAAACAGA TTTAGAGAGT GGTTCAGTTT CTGCTCTAGC TGCAACCATG GCTGCCATGC ACATCATGCT GAGGAGTGGT TTTCCAAGCA CTACGTTTGT CCTGTTCCGG ATTGTAATTG TAGATGTAAC AGTAAATGA
|
Protein sequence | MSGSIVRALN WDSGYEQQFL AVNPIGDEVL LYQTNHEDPG IESNDLIKLN SRTGFENIQC SSYSMINRGI TGVGSISGNI SIFDINSNNS SILKLRPKQN RPCNAISFNS SNLIVAGFDK GRQDNSLQIW NIEHYSRNST NEHIKRPVAT YLPNEAILLS IFYPDREGSI LCGSYKFLRE IDLRVDQPIF QMATKCTLGL AVDHFRTHLF LSFSEDGSLA IWDRRKLTSN TAVKPKGPLT SGNVITETPV LQFVKLLNDS TSRKNQNPCV RYSTIRKGEF SAIFNGDLIR RWNTGIVPAT SSTSISEKSS RESNPTLASL QQQSQQLYKP TDESLFVSLV LDVKTDYERV VSFDYSPDIT SATSTNFVCM RQSGSVFRMP AVECIESLDF NSLNEFTIAG PEGTLTKFSD QEELAKTASN LNKLADLGII DETRKYSEAE FSEDIESTVD DESAIAPYEA DNANRYNFGD LEVDIHLNDI LDASAVIHSD ICSTIRKRAI LGYGVDCDRN IRVLEDLDSL NSQLFLRNTW KWLGLAKKSL EKGTMISEGI DLGYQGVLGI WEGVKEMDNQ KRSVPEAGLI TDGWFSHAVK SIVSSKGKKT AGINIASNSE KKAQRKLCLI VSGWYLADSE FEEKLNILIS LGYSEKAAGW AVFHGDVPKA IEILANAKKE RLRLMSTAVA GYLAYKDSNV NSPWKDQCRK MASELDDPYL RAIFAFIADN DWWDVLDEHS LPLRERLGVA LRFLSDKDLN VYLHRIADTV VNKGELEGLI LTGITPRGID LLQSYVDRTS DVQTAALIAA FGSPRYFSDE RVRHWIDCYR SLLNSWGLFS VRAKFDVART KLSKNAAGTS TIKPSPKQVY LQCSRCNKNL SKSKTTNSNS LPGSNPQAII KQFNKMNHHN NNSSKLATND IAACPHCGAP LPRCSVCLLT LGTPLPLEPS EKIQEVTLAN KIENRFREWF SFCSSCNHGC HAHHAEEWFS KHYVCPVPDC NCRCNSK
|
| |