Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29930 |
Symbol | |
ID | 4837404 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1523885 |
End bp | 1525828 |
Gene Length | 1944 bp |
Protein Length | 575 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388719 |
Product | predicted protein |
Protein accession | XP_001382514 |
Protein GI | 150863884 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.470261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAGCC CAGAAGACAA GCTGATTTTT AAGCTCTTGT TAACGATACC CACCGACGAC TCATACTATA ACGCTCATTT CCAGGTTGGA AAGTCTGAAA ACCAGGTAGA CTTGCGGCTA GATTTGATCC AGCCGGAAGT CTGGGTCATG AACGATAATG CGTTCTTTGA CTGTGACCAT ATTGATGAAT GGTGGAGTTC AGAAGAAAAG GCATACAGTC AATCCTCAGA CTTGCCAGCA CTGATAACCA CTGAATTAGA ATACTTGGCT ACAGTATGTG GGCAAGGGGG CCTCTACACA CTGTCTACTG GCACTGCCAT GCCTACTGCC ACAGTTGATG GCCTAGAAAA CGGCGACCCG TACTTAATTC CGTATATCAA CGTAATTGAA GCATCAGGAG TATTTGCTAC CGACGATATA CGATTCAATC TCTCGACGGG TGCCCTGTTT TTGATGCCTA ACTTTACGTT TCTCAACGTT AACCACACCA ACATGTATTT CGGAGGATTG GGAGTGGCAG GAAACCCACG AGGCTCAGGT TTTCTCGATA CTCTAACTGA GAGAGGAATC ATCAAGTCTT CAGGTTACTC CCTTTGGTTC AATAACCAGA CTGGTACGGA TGCTCTTGGG CAGTTGATAC CAGGTGTTGT AGATTCCAAA TACTATGACG GCGACTTCTA TGTGTTTGAC ATGTTGCCTC ACTCTGGTAT CCGTTTTCCT GTGGCAGAGC AATGGGCCAA CAATGTGTTG GATGGGTTGA TTTTGCCAAC GTTGCAGATC GACGATGTTC GTGTAGTAAA TCTGAACAGC AAACAGAGTC TCTCGTTAAA GTCGCTGAAC GAACCCATTC CGATCATATT GGACTCAAGG TCTACGTATA ATTACCTTCC CCTAGATGTT ATTGTAAATT TGGCCATCCA GCTCAATGCT TACTACAGTA ACGAAGCTGG AAGATGGTTG GTAGAATGTG ACACTGTAGC TGATTCGGGA GGATTGTTCA GTTTTGTATT CGAAGGTTTG CAAATCAGAA TCCCACTTTC GGAGTTCATG TCTGAAGCCT ACTTTCAGGG AAGTCTCTTG AAATTCTCCT CTGGTGAAAG AGCCTGCTAT TTGTCAGTCT TACCTACAGA TCTGAACGGT TTCAACCTGC TCGGATTGCC GTTTATCAAG AACATCTACC TTGCTGTGGA TAACGCTGGA CAGCAGATAG CCTTGGCCAA TTCAAATAGA AATCTTGTAT TGGAGAAGGA CGATTTCTCT GCAGTAGATA GCACATTCTC CCAGACTACA ACGGTTGCAG CTGGAGGTTC TTCAAGCGCA CGAAACGCGT CAGTTTCGAT TGCATACATT GAATCCGGAT TCATTCCGTT TGCTACTAAA GTCAACAATA CTTCTGAGCA AGAGTTGACC TTCACCTTTT CTACAGTTAG CGACAGTCTG AATAATGCAG TATTGGATAT TCCAGCAAGG TTCTCTGGTG CAATCATCAG AAGTGGGGAG ATCATCGTGA CTGGAGTTCA GACAGGGACA GGTAGTGGTG CATTCAGCAC TACCACGATC CTTCTACCAG GAATGGCTAG TGCTGCAAGT GAAAGAACAA CCAGTACCAA TGCTGCTCGC AGCTTAAAGG GAAATACTAT TGGAAACAGC ATTCTCCACT CACAGCATAG TCTCAGCATA ACAGTGGTGT TAGTGTTTTC GTTGATCATA GGCATTTGCA TTTTATAGTG CAACAAAAGT CGTTTGTAAG ATTTGTAATG AGCAATAATT AGTGTATGAT AATATAGTAA TAATAGAAAT AATAGTAATA TATTAATAGT CATAAGTTCA CTATTTAACC AGGACTATAA TTTGCGCAAC TCTATGTCGT CACCATCATC CGTAGTGTCA TTGGTGGGAT CAATGTTTTC TATATCGTCA TTTCTGGCCT TGAGACTCAC ATAA
|
Protein sequence | MESPEDKSIF KLLLTIPTDD SYYNAHFQVG KSENQVDLRL DLIQPEVWVM NDNAFFDCDH IDEWWSSEEK AYSQSSDLPA SITTELEYLA TVCGQGGLYT SSTGTAMPTA TVDGLENGDP YLIPYINVIE ASGVFATDDI RFNLSTGASF LMPNFTFLNV NHTNMYFGGL GVAGNPRGSG FLDTLTERGI IKSSGYSLWF NNQTGTDALG QLIPGVVDSK YYDGDFYVFD MLPHSGIRFP VAEQWANNVL DGLILPTLQI DDVRVVNSNS KQSLSLKSSN EPIPIILDSR STYNYLPLDV IVNLAIQLNA YYSNEAGRWL VECDTVADSG GLFSFVFEGL QIRIPLSEFM SEAYFQGSLL KFSSGERACY LSVLPTDSNG FNSLGLPFIK NIYLAVDNAG QQIALANSNR NLVLEKDDFS AVDSTFSQTT TVAAGGSSSA RNASVSIAYI ESGFIPFATK VNNTSEQELT FTFSTVSDSS NNAVLDIPAR FSGAIIRSGE IIVTGVQTGT GSGAFSTTTI LLPGMASAAS ERTTIISSLF NQDYNLRNSM SSPSSVVSLV GSMFSISSFS ALRLT
|
| |