Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_68235 |
Symbol | |
ID | 4840452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 39594 |
End bp | 41662 |
Gene Length | 2069 bp |
Protein Length | 449 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391767 |
Product | predicted protein |
Protein accession | XP_001386220 |
Protein GI | 150866575 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGATATTTCA ACAACCAATA AATTCCACCC CGATTGAAAG ATATTTATCG GCGTCTGTAT TATCCATTGC AACGCAGGTC CTCATCTCAA AACTCTAATT TCCCTCGCGA AATTACCCAA TAAAAACTTC TTGTTGCAGT GTGTGCAGTC GAAAAAAAGA TACTTTTTGT CGTTGCTCGC CAAAGCCAGT CCCTAAACTT TCCTTTGTTC TCCCTAGCGT CGCCATTCTT TGCTGAAAGC AATTTAGTGA TATTCTTGTT CCAAGAGAAA GTTGATATTA CGAGCCACAC TAAGACGTAT TCTGTGTCAG ATCACTACTG CTACTGAACG TCTATATAAC CAACCTATTT CCTCCAGCCA TACTAGCAGG ATTTTTCGCT TCTCATCGAG GGACATTCTC ATTCCCCCTA GAACCGCTAG CGAAATCTTC CCCACGCAGG TTGACGTTAA CTGATAAATT TAATCATCAG TTATCACATT TTTTTCATGA TGGACACTTG AACCACTACC AAAGAGTAAG AATTCCAAAG TAATAATTAG TCAATACTAC AGACCACTAA CCCATCGTCC ACACACCACC ACCTATACGC CACTAGTAGC ACAACCCTAG AACTTCACTG CTTATAAACG CTAGTTCTCC TTAACCCAAC TATTTACCAT AGTCGTTGAC AATATGAACT ACATTTCGTC CAACGACGTT CTCTACGAGA ACCCCCTTGC AGCTAACCTT GATCCTTCCA CCACTGGAGG TGATCGTGTT GATGCTGACA CAGCTGACTT CCTAAACGAA CTCCGCCATC CCTCATCGCA AGACCTTGCC GAAGTCCCGG CTCCCTTGCA GCACGCTACA GCATCGTCTA CGTCGAGTTC CACAAACTCG GTAGGTATGC AAATAGACTC TTTGGGTATT GATCCCGTCC AACAGAACTC CGTTAGCCAG AGCCAGAATC AAAATCAAAT CCAAAGCCAC AGGAATGGTA ACCATGGTTC GCTCACTTTT TCCAAACCAT TTCTGGCAGA TGACTTGGCT ACTCTCGCAA ACGTTCACCA GATGAACAAC AATACCAATA ACAACAACAC GAACTCCAAT TTGAATAGTA ATAACAATAA CAGCAATAAT GCCACATCGT CGCTTTTTGC GTTCCACAAC CCGTTCGACT TTAAGTCATA CCCCATTACC AACCCACCCA TATTCGACTC AACCCTTCTC CTCCCGCTCT ATTCAAACGA TGGGGTCCCA CGTAGAAGAA GAATATCCAT CTCCAATGGG CAGATTGGCC AGATCGTCAA CCACGAAGCT CTCTTTGAGG ACGACTCTGG TTTCGATACA GACTTGGGAG TCACTGGCTT CGGCAACAGT CAAGGACATT CGCAAATTAG CTCGCCTCCT CAGGCGCAAC TCAGCAGTAT GAATTCCAAT ACGCTGATTG CTGCCATCGC CGATCAACAA CAACAACAGC CGCAACAAGT ATTCCCCGGA AACTTCAACA GCCAGGCGGT TCCGCCTCAG TATCAACCGC AGATTACATT TGCTCCAGCT TCGACATCCA TATCCGTGTC TCCAAATCCC CAAACACCTG TTGCAAGTCC ACAGAAATCA CTCTCTAAAT CCCACTCTCG CAAAAACTCC ACCGCTGTGC CAGAACTCAC AGGTGTAGCC GGAGTGCCTC CTCCCAACCA CCAGCTCATA TACAACAACG AGGTTATCTA CAACCCTAAC AATGGACCTA TTCCCGGTAC TGCAGCCTGG AAGAAAGAAA GACTTTTGGA AAGAAATAGG ATAGCGGCCT CTAAATGTAG AGAGAGAAAG AAGCAGGCAC AGCTAGAGCT CCAAGGCAAC ATCTCGAAGA TGAAGAGCCA ATACAAACGT GACCAGGAAA AGATAAAGAA ACTCAACAAA CTTGTAGAGT TCTACAATAA GACTATAGTC AAACACCTCA ACGACGGAAA CCAGGAATTA TCGGTATTAC GGAAGTTTAT CAACAAGGAT ATAGATGAGA TCGACATTAA AGATATCTCA TGATTGATAG ATTGTATTTT ATAGAATGAA TGAATGTTAC AGTACGGAA
|
Protein sequence | MNYISSNDVL YENPLAANLD PSTTGGDRVD ADTADFLNEL RHPSSQDLAE VPAPLQHATA SSTSSSTNSV GMQIDSLGID PVQQNSVSQS QNQNQIQSHR NGNHGSLTFS KPFSADDLAT LANVHQMNNN TNNNNTNSNL NSNNNNSNNA TSSLFAFHNP FDFKSYPITN PPIFDSTLLL PLYSNDGVPR RRRISISNGQ IGQIVNHEAL FEDDSGFDTD LGVTGFGNSQ GHSQISSPPQ AQLSSMNSNT SIAAIADQQQ QQPQQVFPGN FNSQAVPPQY QPQITFAPAS TSISVSPNPQ TPVASPQKSL SKSHSRKNST AVPELTGVAG VPPPNHQLIY NNEVIYNPNN GPIPGTAAWK KERLLERNRI AASKCRERKK QAQLELQGNI SKMKSQYKRD QEKIKKLNKL VEFYNKTIVK HLNDGNQELS VLRKFINKDI DEIDIKDIS
|
| |