Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32673 |
Symbol | TIP39 |
ID | 4840302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 370409 |
End bp | 372649 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 12 |
GC content | 37% |
IMG OID | 640391617 |
Product | Tuftelin-interacting protein TIP39, contains G-patch domain |
Protein accession | XP_001385756 |
Protein GI | 150866231 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.541012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0285832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTTC TGGGACTCGC ATTTACCAAA GCAAGTAATA ATAATTCCAA TAATGATACG ACTAATAATA ATGTGGGTAG TAAACCCTTC AGTATGATGG CAGCAGCAGT TAGAGATGGA TATTCAGATT ATGAAGAGGA GATGGAGGAA GATGTAGTTG TACATAACAA TGAGAGAGAC CTGAAGTCAA AGTTTGCGTC ACCTACAGAA TCATTTAACA AGAATGCCGC TTGGCTCAAC CCACCGACGG CAGTGAATTC CAATTTTCAG AAGTATGGAA TAGGTGCACA ACTCTTGGTT AAAATGGGAT ATCAAGTAGG CAAGGGTCTC GGAGCTAATG AAGAAGGTAT TGTTAATCCA ATAGAGACTA TGTTGCGGCC AAAGGGATTG GGTGTAGGTG GTATACGAGA AAGAAAGGAT TCAAATCGAA AGAAGGGGAA CGATGATGTC GATATGGATA TCGAAAGCAG TGACGATGAA CTGACATCAT TACAAAAGAG AGAATCGATT AACTTATACA GTATTGTAGA AGAGTTGGAA TTGAAAAATA TGACTGTTCC CAAAAAGTAC ATCGAATTGT CTGATCAGTA CTCTCAAAAT CAGTACTCAG AGGATCTATA CGATCAAGTT AAAGAGGCAT ACAATCAACT CAGCAGAATA TCAGAAGAAC TCAATTCGCT TGATCAACAA GAAAAGTTCA TAACATATCA GTTAAAGGAT ATAAATCTCA AGTCAAATAC CCTTGAATCA GAACTTTCTT CCACTGTCGT CACGTTAGAG AAAATTGAAG AAACATTACC GATTCTACAG AACACGAAAG ATTCCATCGC AATAATTAAA GAAGTGGATA TACTACTTCG TTCAATATTG GAAACACCAC TGAAATCATA TCCCAACATT AGTTCATTGT GTGGGAGCAT AGTTGCTGTG GCTATACCAT TACTCTTCAA TGATGGAAAC ATTGATGAGA AAAGTTCGAG CTTCAATACA TTAGCCGAAT GGTCGATCAT CTATAGAGAA GTAGAAAAAA GTGCACTGTC AGAATTGTCG TTTTTTGATT CCTTGATCTA TCCCCAAATT GCAAATCAAA TATCGAAGAC TATTTCCCTG ACGATACCTG CAGAAGAAAG GAGTATTCAA ATATTGAACT ATTTGCAATT CTGGCTTGAG TCACCGATTA TCATAAACCC ACAATTTTGC ATTCAAGAAA AGCTCATGCT GGAACTAGTA ATTCCCTTCA TATCTGAATC AATAGATCAA TGGAACCCGC TTGATCCTAA AGATTCGTCT CCGACCTTTC TTATTGATTA TCTTGCCGGA GTAGTTGACG GTGATATTTC TTTATTTGAA GAGATTTTAC AAAATGTAAA TCAAAAATAT TTGGACTTCA TAAACTATAA CAATCCCAAA TCTATTTGGC AAAACATTTG CAAAGCAACT CTGATTGAAG AAATTGCTTG GATAGAAGAA ATCAACAAAT TTACCAAAGT ATGGATTCCT CTTTTCAAGG AATACCCACA ATTAGATTTT GCTGCATCCA ATGGAACAGA ACTTTTGGAA GCCTTGACAT GTGGACTACA GTTTCCTTTT AACATTCGCT CGGATCTTCG AGTAAAATTA TTATTAGCAT TTGAATTGCT TTCTTTGATA GAAGAAAATG ATTCTGTCAT ATTACTTCAG TTCACCTGGG GCAACAATTG GATTGAATCA CTCTTAAGTA TGATTGAGGA AAATCCATCG AGAGTCGCAG AAAGTTACAG AGAGTGGTTT AAGTATTTTC AAGAAACAAA GAACTCGTAT CTGGAAAGAA TACTTGATGT GATTAAATGG TACCTAGATC TTGTATTGAA TATTATTGAG AATGGGAGGG ACGTTGCAAA TAAACTTCCA AATATTAATG GAAATGCACA CCCAGAAAAG TCTGAAGTTT TGAAGTTGAT CCAAAATAGC ACTTCGAACG AAAAGGAAGA GCATCGCTCT GTTCATGGTA TTCCCGCTTA CCGACTAATG ACATCATTCA AAGATGTCGT GGCAGATTAC TGCTTACAGA AGGGATTACT TTTTGGCTCC GAAAAGAACA GAGTACATCC GACATTAGGG TATCCACTAT ACACTATAAA GAGTTCGAGA GGTACAAAGC TCTTCTGCTA TGCCGATCAG GATGTGCTTT GGATTGCACA AAGTGGCGAA GCTATGGAAT ATGATCCTAT TTCGCTTGAC GACTTACTAC TGTATTTATA G
|
Protein sequence | MSFSGLAFTK ASNNNSNNDT TNNNVGSKPF SMMAAAVRDG YSDYEEEMEE DVVVHNNERD SKSKFASPTE SFNKNAAWLN PPTAVNSNFQ KYGIGAQLLV KMGYQVGKGL GANEEGIVNP IETMLRPKGL GVGGIRERKD SNRKKGNDDV DMDIESSDDE STSLQKRESI NLYSIVEELE LKNMTVPKKY IELSDQYSQN QYSEDLYDQV KEAYNQLSRI SEELNSLDQQ EKFITYQLKD INLKSNTLES ELSSTVVTLE KIEETLPILQ NTKDSIAIIK EVDILLRSIL ETPSKSYPNI SSLCGSIVAV AIPLLFNDGN IDEKSSSFNT LAEWSIIYRE VEKSASSELS FFDSLIYPQI ANQISKTISS TIPAEERSIQ ILNYLQFWLE SPIIINPQFC IQEKLMSELV IPFISESIDQ WNPLDPKDSS PTFLIDYLAG VVDGDISLFE EILQNVNQKY LDFINYNNPK SIWQNICKAT SIEEIAWIEE INKFTKVWIP LFKEYPQLDF AASNGTELLE ALTCGLQFPF NIRSDLRVKL LLAFELLSLI EENDSVILLQ FTWGNNWIES LLSMIEENPS RVAESYREWF KYFQETKNSY SERILDVIKW YLDLVLNIIE NGRDVANKLP NINGNAHPEK SEVLKLIQNS TSNEKEEHRS VHGIPAYRLM TSFKDVVADY CLQKGLLFGS EKNRVHPTLG YPLYTIKSSR GTKLFCYADQ DVLWIAQSGE AMEYDPISLD DLLSYL
|
| |