Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_55209 |
Symbol | |
ID | 4836789 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 2667068 |
End bp | 2669974 |
Gene Length | 2907 bp |
Protein Length | 912 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388104 |
Product | predicted protein |
Protein accession | XP_001383264 |
Protein GI | 150864447 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGCG CAAAGTACGC TAAAGACATC ATCCAAGACA GAGTCAGTCT TTACAGCACA TCGTCGAATG TGCCCATCGC TTCGGTGAAG CCTCTCAGAA GAAAGTTTCC TCCTAACCAG CCCATCCCAG AAGCGCAAAA CTACTCGAGG TCCCAGAGCA TCCTCGAATG TACACCCCTA AAGACAGATG GAAAAGGAAA TTTTGTGGAT GAACAGGGCA GAAAAACCAC GCTAAGAGGC ATCAACGTGG ATGGTGCCAT GAAGTTGCCC ATTAACATCC CCTCATACAT AGGAGATGGA AGCAACCCAG ACTCTTTCTT CTTCGACGGA GAAAACGTAA CCTTCGTTGG TCGTCCCTTT CCTCTCGAAG AGGCAAGCCT GCATTTCCAA AGAATCAAGT CCTGGGGCTA TAACACCATC AGATATTTGT TGACGTGGGA AGCCATAGAG CATGGAGGAC CAGGAGTGTA CGACCACGAG TATATTGATT ATACTGTGAA GATTCTTAAA ATACTTCATG AAGTAGGAGG ACTCTATGTA TTGCTTGAGT TCCACCAGGA CGTGTGGTCT CGTTTCTCAG GTGGAAGTGG GGCTCCTATG TGGACTCTCT ATGCTGCTGG CTTCCAACCT CAAAGATTCA CAGAAACAGA GGCTTGTATC TTGCACAACG AGTCGAGATT CCACGATGAC GATGACCCGG AACACTACCA CAAGATGCTC TGGACGTCCA ACTACAAACG TTTGGTGGCT TTCACTATGT TCACCCTCTT CTTTGGAGGC AGAAATTACT TTCCAGACTT AATTATCAAC GGCCAGAACA TTCAGGACTT GTTACAGAAT CACTACTTCA ATTCGGTAAG ACTTATCTGG AAGACAATCA ATAATAAATT GCCAGACATG ATCAAGGACG GAACAATTCT CGGGTTTGAA CTGATGAATG AACCAAATGC TGGTTTATTC GGCCACGAGA ACCTTGGTCT CATTCCAGGA AACCAGCAGT TGAGAGTTGG AACAACACCA ACAGTGTTCC AAGCTTTGAA ATTGGGAATG GGGTTTGCTT GTGAAGTCGA TGAGTACAGG ATCACTATTG CAGGACCTCA GAAGTTCTCC ACAAGAGTTA TAGATCCCAA GGGAGCTCGT GCCTGGCTCA CTGTAGAAGA AGCAGCGGAA ATAGACACCA AGTACAAGTG GAAAAGAGGT GAAAAATGGA AGCTTGGTGA ATGCATTTAT GCTCTGGAGG GCATTTGGAA GTGGAACGAA AACATTGACT TTGACAACTT TCCATTCTTA ATCGAGGACC AACGTGTAGC TCTTACTCAG ACTGAATGTC AATTACTTGA CGAGCATTAT TTTAACAGAG TTGACGAACG TCACAACTTC ACGACATACC ATGGTGTTGT TCCCGATAAA ATTGACATCG ACTACTTCAT CAACAATAAC TTCATAGACT TTTACTTGAG ATTCAAGAAG GTAATCCGTG ACATATGTCC TACAGCTATT CTTTTCATAC AGCCACCAAC ACTTGAGCTA CCACCTGATA TTAAGAACGA TTCTCGTGGC ATTATAGATG ATAGAACCGT TTACTGCCCT CACTACTACG ATGGGATGTC GCTTATGTTT AAGAGCTGGA ACACTAAGTA TAATGTGGAC ACTTTGGGAA TCATGCGTGG TCGTTACTTG AATCCGGTAT TAGGTATTGT TTTTGGTGAA AGGGCCATTC GTAACAATAT CAAGAAACAA TTCGCCGAGA TGAAACGAGA ATGCCACACT TACTTGGGAA ACATTCCCAT CTTGATGTCG GAGACAGGTA TGCCATTTGA CATGGACGAC AAGAAGGCTT ACAAAAACCT TAAATACCAC TCCCAAACAG CTGCATTGGA TGCTATCTGT AATGCCATCG AAGCCAATAA CTTGAACGTG ACCTACTGGT GCTACACAAG TATTAACAAT CACAAGTGGG GAGACAATTG GAATAACGAA GACTTTTCTT TTTGGAGTTC TGACGACAGA AATCGGCTTA TGGACGATGA CAATCAAAGT TTGAGATCTT CCAAGTCATA CCAGCCTTCG TTGTCGCTTT TTTCTTCTAT TGGACAGGCA AGATCAACTA TCAGATCCAC AATCAGACAT GTAAGAACAA GGAGAATTGA AGCAGTCAAT TTGGTCAGGT CAAAATTATA TTATACGCAA ATCGAAGAAA CAGATATCCA GGAAATCACC CAAAATGGTG ATGTAGTGAA GAATGGAAAA AACAAGAATA ATCTTGCTTC CAAAAGCAGC AATGGAACAA GTGACACATT GATAGAAAAT GGAGTTGTGG AGGATCTTTC TTCTGAAGGT AACAGCTGTA TCTTTGATAA GACTGCCCAC GATGAATCCG ACGCCCAGTC AGTTCAGGCA TCGTCTGTTA TATCAGCTAG GTCTGACAAC ATGAAGTTCA AACATGCACG TAATTGCTTT CCAAGTCCAG ATGGTGTTAG AGCTGCTGGT GCTGTATTAC GTCCTACAGT GTTAGCAACC AAGGGTGAAA TTCAAGTCAT GGAATTTGAC TTGAAGTCAG TTAAATTTGG TGTATGTTTA TTGATCGATA GAACCGATTC TGACTTGGCC ACAGTGCCGA CCATTATCCA TCTTCCCAAG TGGCACTATC CCTTCTTGAG CTACAGGGAT ATTTTCATAT CTTCAGGACA TGTGAAGTAC CACAGCGAAT TGGAGTATTT GGAATGGTAC CACACTGAAG AGGTAGTCAG TGAACTGGAT GAGGCTATAG AAGTTCCCAT AGCCACAGGC GTCACCAAAG AATCGTTGAT TATCAAGAAC TACAGTGGAT CCTTAGATGA CGCTGTAGTG TCCGAAGAAA GAGGCATCTT TCCATGTGCT GGAGAACTCA GCTGTCCAGT TCAATAA
|
Protein sequence | MNSAKYAKDI IQDRVSLYST SSNVPIASVK PLRRKFPPNQ PIPEAQNYSR SQSILECTPL KTDGKGNFVD EQGRKTTLRG INVDGAMKLP INIPSYIGDG SNPDSFFFDG ENVTFVGRPF PLEEASSHFQ RIKSWGYNTI RYLLTWEAIE HGGPGVYDHE YIDYTVKILK ILHEVGGLYV LLEFHQDVWS RFSGGSGAPM WTLYAAGFQP QRFTETEACI LHNESRFHDD DDPEHYHKML WTSNYKRLVA FTMFTLFFGG RNYFPDLIIN GQNIQDLLQN HYFNSVRLIW KTINNKLPDM IKDGTILGFE SMNEPNAGLF GHENLGLIPG NQQLRVGTTP TVFQALKLGM GFACEVDEYR ITIAGPQKFS TRVIDPKGAR AWLTVEEAAE IDTKYKWKRG EKWKLGECIY ASEGIWKWNE NIDFDNFPFL IEDQRVALTQ TECQLLDEHY FNRVDERHNF TTYHGVVPDK IDIDYFINNN FIDFYLRFKK VIRDICPTAI LFIQPPTLEL PPDIKNDSRG IIDDRTVYCP HYYDGMSLMF KSWNTKYNVD TLGIMRGRYL NPVLGIVFGE RAIRNNIKKQ FAEMKRECHT YLGNIPILMS ETGMPFDMDD KKAYKNLKYH SQTAALDAIC NAIEANNLNV TYWCYTSINN HKWGDNWNNE DFSFWSSDDR NRLMDDDNQS LRSSKSYQPS LSLFSSIGQA RSTIRSTIRH VRTRRIEADL SSEGNSCIFD KTAHDESDAQ SVQASSVISA RSDNMKFKHA RNCFPSPDGV RAAGAVLRPT VLATKGEIQV MEFDLKSVKF GVCLLIDRTD SDLATVPTII HLPKWHYPFL SYRDIFISSG HVKYHSELEY LEWYHTEEVV SESDEAIEVP IATGVTKESL IIKNYSGSLD DAVVSEERGI FPCAGELSCP VQ
|
| |