Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_65985 |
Symbol | |
ID | 4840396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 1370936 |
End bp | 1372993 |
Gene Length | 2058 bp |
Protein Length | 685 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640391711 |
Product | predicted protein |
Protein accession | XP_001385615 |
Protein GI | 150866124 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00221694 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.861424 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTTAA AGTCAACTTC TGCGGGTAAC GTCTCAGTGT ACCAAGTTTC TGGGACCAAC GTATCGCGGT CGTTACCAGA CTGGATTGCC AAAAAGAGAA AGAGAGCATT GAAAAATGAT ATTGAGTATC AAAACCGTAT AGAGTTAATT CAGGATTTCG AGTTCAGTGA AGCTTCCAAC AAAATTAAAG TCACACCCGA TGGACAATTC GCCATGGCTA CTGGTACTTA TAAGCCCCAG ATCCATGTGT ATGATTTTGC CAACTTGTCA TTGAAATTCG ATAGACATAC TGATTGTGAG AATGTAGACT TTTTGATAAT GTCTGACGAT TGGACCAAGT CTGTGCATTT GCAGAATGAC AGATCCATAG AATTCCAGAC CAAGGGCGGA ATTCACTACA GAACGAGAAT CCCCAAGTTT GGGAGAAGTT TGGCCTACAA CAGTATGTCA TGCGATCTCT ATGTAGGTGC TTCGGGCAAT GAGTTGTACA GATTGAACTT GGACCAGGGT CGCTTTTTGA ACCCCTTTGC ATTGGACACT TCTGCTGGTG TCAACGCTGT AACTGTCAAT CCAGTTCACG GATTGCTAGC TGTAGCTTTG GAAGAAGGAG CGGTAGAGTT CTGGGACCCC AGAAGTAGAG TAAGAGCAGC TAAATTATTT GTAGAAGACC AACTTAAAGA ACAAGTTCAA GTGACAGCAG CTTCATTTAG AAATGACGGT TTGAACTTTG CCTGTGGTAC GTCTAATGGT AAGGCTCTTA TCTACGATTT GAGAACAGCT GTGCCTACCA TAGTTAAAGA TCAGGGTTAT GGCTTTGATA TCAAGAAGAT CATCTGGATA GACGATAACG AGTCTGATGC TGACAAAATA TTGACTACAG ACAAGAGAAT AGCCAAGATC TGGGACAGAA ACGACGGAAA GCCATTTGCA TCTATGGAAC CAAGTGTAGA CATCAACGAC ATAGAATATG TTAAAGGTTC TGGAATGTTC TTTATGGCCA ACGAAGGTAT ACCTATGCAT ACATACTACA TTCCTAACTT GGGTCCAGCA CCCAAGTGGT GTTCTTTCTT GGACAATGTC ACTGAGGAGT TGGAAGAAAA GCCTTCAGAC TCAGTCTACT CCAATTACAG ATTCATAACA AGAGACGACG TGGTCAAATT GAACTTGTCT CACTTGATTG GGACCAAGGT GTTGCGTTCG TACATGCATG GTTTCTTTAT TGACAACGAG TTGTACGACA AGGTAAACTT GATTGCCAAT CCAAACTCGT ACAGAGACCG CAGAGATCGG GAAATCAGAA AGAAGATTGA AAAGGAGAGA GAGTCGAGAA TCAGATCTAC TGGTGCTATC ACAAACACTA AAATCAAGGT CAACAAGGAC TTGGCAGCCA AATTGCAAGA AAAGCAAGGT TCTAACGCTG CCGAATCTGT CATCAATGAT GACCGTTTTA AGGAGTTGTT CGAGAACCCC GACTTTGCTG TAGACGAAGA GTCTCACGAC TATAAACAGC TTAATCCTGT TAAGGCTGTG AAGGATGTCA CTAACAGCAG ATCTCGTGGA TTGACTGCTG CTGAAGAGTC TGATGAAGAG AGGAATGCAG AAAGCGGCAC TTTATCGGAA CTGTCTGAAG AATCTGAGGA AGAAGAGGAA GAAGATGAAG AAACAAAGGC ATACAAGAAG ATGAGAGTGG AAAAGGAAAT GGAGAAGTTG CGTCGTAAGA AGAAGGAACA GGAAGAAGCC AAGAGATTTA TGAATGAAAT GAAGGTTGTT TCTGAAGAGG GACCACAACA AAAGGCATCG GAATCGTTTG GTTCGCAAGT TAGAAAAATC AATAAGGTCA CTAAAGAACA AGTCGGCGAC AAGGATTCGA GATTGCGTCG CCATGCTCGT GGTGAGGCCG AGTTGACGTT TGTTCCAGCA AAGAAGGAAA AGAGAAAGGT TCAGTTCAGA GCCGATGACG AGGAGGAAGA TCCAGAGAAG GTCAAGAACA GCGGAAGAAC TAAACAGAGA TTTGATGGCC GTAGAATAGC TTCAAGAAAC AAGTTCCGTG GTATGTAA
|
Protein sequence | MVLKSTSAGN VSVYQVSGTN VSRSLPDWIA KKRKRALKND IEYQNRIELI QDFEFSEASN KIKVTPDGQF AMATGTYKPQ IHVYDFANLS LKFDRHTDCE NVDFLIMSDD WTKSVHLQND RSIEFQTKGG IHYRTRIPKF GRSLAYNSMS CDLYVGASGN ELYRLNLDQG RFLNPFALDT SAGVNAVTVN PVHGLLAVAL EEGAVEFWDP RSRVRAAKLF VEDQLKEQVQ VTAASFRNDG LNFACGTSNG KALIYDLRTA VPTIVKDQGY GFDIKKIIWI DDNESDADKI LTTDKRIAKI WDRNDGKPFA SMEPSVDIND IEYVKGSGMF FMANEGIPMH TYYIPNLGPA PKWCSFLDNV TEELEEKPSD SVYSNYRFIT RDDVVKLNLS HLIGTKVLRS YMHGFFIDNE LYDKVNLIAN PNSYRDRRDR EIRKKIEKER ESRIRSTGAI TNTKIKVNKD LAAKLQEKQG SNAAESVIND DRFKELFENP DFAVDEESHD YKQLNPVKAV KDVTNSRSRG LTAAEESDEE RNAESGTLSE SSEESEEEEE EDEETKAYKK MRVEKEMEKL RRKKKEQEEA KRFMNEMKVV SEEGPQQKAS ESFGSQVRKI NKVTKEQVGD KDSRLRRHAR GEAELTFVPA KKEKRKVQFR ADDEEEDPEK VKNSGRTKQR FDGRRIASRN KFRGM
|
| |