Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31994 |
Symbol | |
ID | 4839687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 417244 |
End bp | 418659 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640391002 |
Product | predicted protein |
Protein accession | XP_001384742 |
Protein GI | 150865501 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATACAT TCAACGTTTC CGAGATGCTC TCCTTGGATT TCAAACAAAT AGTGACCGTA GCCTTGATGT TCTTCATCAT CCGTGAAATC AACTACTTGA TCAACAATCT CAGTCGGATC CTCAATAAAA ACCAATGGAG GAATATTCTG AATCAAGAGT CGCTGGAGAG GGATCTTCCG AACCCAGTTT TTTTGCACCC TAATTCTACC GCTGAGCCTG TAAGTTTTCC GGACGAAATC ATATTGGAAA TCCTTGAGTA CGCTCCTCAA CACGATGTCT TGAGAATGGC CCGTGTCAGC AAGAGATTTG CTAGAATTTG TAGAATGAAA CTCTTCAAAA ATATATACGT TGGAAGCCCG ACATCCAATG TCTTACACCC AGAGTACAAC ACTCCATTCT ACCAAAAATA CACCATCATA AAGTACGAAA ACTTTCTTAT AAACAGTGGG TCATTGTTTT TCACAAGACG CCCAATCCAA GAGTTTGTGT TCAAAGATCC TAAATTCTCT ACTGGTTTTT TTGACAAATT GAAGTCTTTT CACCCACAAG CAGCTTTCTA CATCGAGAAC AAACCCAGGA CAAAGTCTCC TTTTAAAACC CTTCGACATA ACTTGTTGAT TCTGGACATA AGGAGGTTGG ATATAGTTCC AGAAGAAATT GATTCTTTGA CAAGTTTCCC TGATTCTATC AGACATTTGT CGATTGACTT CACTGACCTT CAAGAAAATG GGGCAGCATT GAATAGGTGT AGAAATACCT TTGCTGGGTT GACTTCTCTA AAGTTGAAGA ATGTGGACAG CCATATGATT CTAGCACTTT TTGCTGGAGA GAAGATCAGT GTCAGAAAGC TTTCTCTTCT GACCAGTAAT AGTGAATTTG GCTTTGATAC GATAGAGAAG TGTTTCGACT TGAGTACCAT TTCAAGCTTC GAGTTATTAG ATAGGAATAT CAACCGAAAG AATGAATCCT ATAAGCAGTT TATAACCAAG CTTGCTTCGG TTCGCCTGTT AACACTTTCG TGTCCACAGT CATTTCTCAG GGACATTATA ACCTCTTTTA AAAAGAATAC ACTTGAAGAG ATCAGTTGTC TAATTGACAC AAGCCACGAC GTATCTATGT CATTTATTCA AGGATTAATC GAAGATCATG CACAGTCTCT AGTTCGTATC AGCTGCTGTT CTTCCAATGA GTGTTTTATT TTGACAGACC TGGGCTCCTT AGATAAGTTC ACAAGCATTC ACACAGACAG GTCTTCAGAA TACTATATTG ATATGGCCAA AGAATTGCAC AGGAATTCAG ATGATTATCC CAAGTTGAAA TTGTTCGAAT TAAATGGAGT TCCAATTATA TTGGATAAAA CATATGGCGA ATTAACTGGA ATAACTCCTC TAGTTCCCAA CCGCATCAGT CAGTAA
|
Protein sequence | MYTFNVSEML SLDFKQIVTV ALMFFIIREI NYLINNLSRI LNKNQWRNIS NQESSERDLP NPVFLHPNST AEPVSFPDEI ILEILEYAPQ HDVLRMARVS KRFARICRMK LFKNIYVGSP TSNVLHPEYN TPFYQKYTII KYENFLINSG SLFFTRRPIQ EFVFKDPKFS TGFFDKLKSF HPQAAFYIEN KPRTKSPFKT LRHNLLISDI RRLDIVPEEI DSLTSFPDSI RHLSIDFTDL QENGAALNRC RNTFAGLTSL KLKNVDSHMI LALFAGEKIS VRKLSLSTSN SEFGFDTIEK CFDLSTISSF ELLDRNINRK NESYKQFITK LASVRSLTLS CPQSFLRDII TSFKKNTLEE ISCLIDTSHD VSMSFIQGLI EDHAQSLVRI SCCSSNECFI LTDSGSLDKF TSIHTDRSSE YYIDMAKELH RNSDDYPKLK LFELNGVPII LDKTYGELTG ITPLVPNRIS Q
|
| |