Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_63396 |
Symbol | |
ID | 4840645 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 329456 |
End bp | 331192 |
Gene Length | 1737 bp |
Protein Length | 551 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640391960 |
Product | predicted protein |
Protein accession | XP_001386264 |
Protein GI | 150866610 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.28283 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCAG AAAAACCAGA CCAAGTTGAG ACTCTCCGTC GAGATCTCCT CAAAGCTTGT AACACTCTTT CCAATGTCAA TTTGTTTCAA GCAGCCAAGT GGTGTGCAGA GGCCCTCAAT GGACTTCCAG AGCCCAAGCC AGAAACAGCA GACCTACAAC CAGAGATGCT AATACTAGAT GAGGATGCAC TTTTGGATCA AAACAAATAC TTATTGGCCA AAGCGTACTT CAACTGTAAG GAATTTGACC GAGCAGCTCA TATTTTGAAA AATTGCAAGA CAGGTGACGC GTTGTTTCTT AGATTGTATT CCATATTGAT ATCCGTGGAC AAAAGAGCCA CAGAAGAAAC CGATGGATCC ATCAACATAG GCTCGGTGAA TGATCTCACA GACCAAAGTG AACCTCAACA GCACAGAAAC AAGGATGTAG TCGATGATAT GAATAACCGA CTATCAAAAA TCATACAGGA GAGTGAAAAC TACCTCACAA AAAGCAAGCC CAACTCCTTT CTATACTACT TGAATGGAGT GATATATAAC AAGAAGAAGA AGTATGCATT GGCACAGAGC AATTTGTACA ATTCATTGAA GTTATTTCCC TACAATTGGT CGGCATGGCA AGAACTCATA TCGTCATTAA TTACTTTCGA AGAGGCTATC AACTTCATCA CCAAAGTAAA GGCAGCCAAA AGTTCTCTTT CTTCAAGCAT CATGTTTCAA TTCTTTGAAG TCGTAGTTCT TCAAGAATTC TATCAACAAC TGAGCTCATT ATTTGACTCA CTCAATCATC TCATAACAAT CTTCCCATCA TTTACATTTC TCAAAGTTCA ACAGTTCCTA ATTTCATACC ACAGTCTAGA CTATTTCCAG GCCGAGTCAA CCTTCGACCA GATCCTCGTG GACGACCCTT TGAGGCTAGA TGATCTAGAT ACCTATTCCA ATATGTTATA CGTAATGGAA AAACGTTCAA AACTTTCCTT CTTAGCACAG TTTGCCTCCA TGATAGACAA GTTCAGGCCA GAAACGTGTT GTATTATAGC CAACTACCAT TCTATGAGAA GTGAACACGA GAAAGCCATT ATGTACTACA AACGTGCATT GACTCTAAAT AAGAATTGTT TAAGCGCCTG GACTCTAATG GGGCACGAAT TTGTTGAATT GAAGAACTCG CATGCAGCTA TCGAATCCTA CAGACGCGCA GTAGATACCA ATCCAAAGGA CTTCAGAGCT TGGTATGGTT TAGGTCAAGC ATACGAAGTC TTGGACATGC ACTTGTACGC ATTATATTAT TACCAAAGAG CCACAAATTT ACAGCCATTG GACAAGAGAA TGTGGCAGGC ACTTGGAAAT TGCTATGAAA AGATCGATAA ACTTGAAGAG GCTGTTAAAT CCTTTGAAAA AGCTTTGACG ATAAATAGTT ATACAAATGA TGAAGGTGAG GCTTATGGAG GTGCCGAGCC TCATATTTGC TACCGTTTGG CACTCATTTC TGAAAAGTTG GGGGATGTAA AGGAGACGTA CAAGTATATG AAACTTTGCT TCGAACAGGA ACTTGATTGG GGCGTCAATG ACGAGACCTC GAAGGCAAGA TTATGGCTAG CACGTAACTC TCTCGAAAGT AGACGATTTG AAGAAGCCTA TGAGTTGGCA AAGGATCTCA GCCATAGCAA TGCTCACGAC ATTGAAGAGG CGAGATCAAT TGCTAGAGAG GCAAGGAATA GAATGCTGAA GAATTGA
|
Protein sequence | MSSEKPDQVE TLRRDLLKAC NTLSNVNLFQ AAKWCAEALN GLPEPKPETA DLQPEMLILD EDALLDQNKY LLAKAYFNCK EFDRAAHILK NCKTGDALFL RLYSILISVD KRATEETDGS INIGSDVVDD MNNRLSKIIQ ESENYLTKSK PNSFLYYLNG VIYNKKKKYA LAQSNLYNSL KLFPYNWSAW QELISSLITF EEAINFITKV KAAKSSLSSS IMFQFFEVVV LQEFYQQSSS LFDSLNHLIT IFPSFTFLKV QQFLISYHSL DYFQAESTFD QILVDDPLRL DDLDTYSNML YVMEKRSKLS FLAQFASMID KFRPETCCII ANYHSMRSEH EKAIMYYKRA LTLNKNCLSA WTLMGHEFVE LKNSHAAIES YRRAVDTNPK DFRAWYGLGQ AYEVLDMHLY ALYYYQRATN LQPLDKRMWQ ALGNCYEKID KLEEAVKSFE KALTINSAEP HICYRLALIS EKLGDVKETY KYMKLCFEQE LDWGVNDETS KARLWLARNS LESRRFEEAY ELAKDLSHSN AHDIEEARSI AREARNRMSK N
|
| |