Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83067 |
Symbol | |
ID | 4838900 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 1444140 |
End bp | 1445430 |
Gene Length | 1291 bp |
Protein Length | 363 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390215 |
Product | predicted protein |
Protein accession | XP_001384575 |
Protein GI | 126136102 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00421755 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAAA AAGAAAGACT TGCCGAATTG GAGCGGGATG CCCAAGAGTC GTATTCGTTT TTGGTATCTG ATGAGAAGTC ATCGACATTC CGCAGTCAGG AGGAGTTCAA AGAATTTGTA GTAGACAAGA AGCAATCTCC AGTAGAAAAG ACACCAGTGA AGCCATGGGT CCATTTTGTA GCTGGTGGTA TCGGGGGAAT GGTCGGTGCC ATAGTAACGT GCCCTTTAGA TGTGGTGAAA ACGAGATTGC AATCAGACGT CTACCATGCC ATGTACAACA AGACACCTAA GTCTGCGAAC CCTGTAATCA AGATGTTTCA GCATTTGAAG GAAACAGGCT CCGTTATTAG GGAATTGTAT GTGAGCGAAG GTTCTAGGGC CTTGTTCAAA GGTTTGGGAC CAAATTTGGT CGGTGTGATA CCTGCTCGTT CTATCAACTT CTTCACATAC GGCTCTACCA AAGAGTTCTT GACCAGCAAC TTCAACCAGG GCCAGGAAGC CACCTGGATT CATTTGGCAG CCGGTATAAA CGCCGGTTTT GTCACCTCGA CAGCTACCAA TCCAATCTGG TTGATCAAGA CCAGATTACA GTTGGACAAA ACTAAGGGCA AACACTATAA AAGCTCTTGG GATTGCCTCA CTCATGTGAT CAAGCACGAA GGATTCAGTG GCCTTTACAA GGGTTTGAGT GCTTCATATT TGGGAGGTGT AGAATCGACG TTGCAATGGG TGTTGTACGA ACAGATGCGG ATGTTTATCC ACAGAAGATC GTTGGCTCTA CATGGAGATG ATCCTAGTAG TAAAACTACT AGAGACCACA TCATAGAATG GTCTGCCCGA TCTGGTGCTG CCGGTGCTGC CAAGTTCATA GCATCTTTAA TTACGTATCC TCATGAAGTG GTCAGAACTC GTTTGAGACA AGCTCCGTTG GAGTCCACAG GTAAGCCGAA GTACACGGGC TTGATCCAAT GCTTCAAATT GGTGTTGAAG GAAGAGGGTC TTGCCAGCAT GTATGGAGGT TTGACTCCAC ACTTGTTGAG AACAGTGCCC AACTCCATCA TCATGTTTGG CACCTGGGAG CTTGTAGTTC GTTTATTGTC ATGAGCATTT GAGAGACATG TTGCTCTTTC TAACGTAATA TCTTGTACAT ATTTGCTTGG GTTCGTTCTT CATGAAGGTT GGTTTTGTTA TATATATCTA TATATATCTA TCGTTTCTTG TTCATAAGAG ATTATCATCT CTTTGTATAT AATTCATACA AGTTTCGGTT TCTTCGACTG TTTTTCTGTT A
|
Protein sequence | MTQKERLAEL ERDAQESYSF LVSDEKSSTF RSQEEFKEFK QSPVEKTPVK PWVHFVAGGI GGMVGAIVTC PLDVVKTRLQ SDVYHAMYNK TPKSANPVIK MFQHLKETGS VIRELYVSEG SRALFKGLGP NLVGVIPARS INFFTYGSTK EFLTSNFNQG QEATWIHLAA GINAGFVTST ATNPIWLIKT RLQLDKTKGK HYKSSWDCLT HVIKHEGFSG LYKGLSASYL GGVESTLQWV LYEQMRMFIH RRSLALHGDD PSSKTTRDHI IEWSARSGAA GAAKFIASLI TYPHEVVRTR LRQAPLESTG KPKYTGLIQC FKLVLKEEGL ASMYGGLTPH LLRTVPNSII MFGTWELVVR LLS
|
| |