Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33076 |
Symbol | |
ID | 4840250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 1375821 |
End bp | 1377767 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640391565 |
Product | predicted protein |
Protein accession | XP_001385960 |
Protein GI | 150866382 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.759857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATGT CGTCATTTGA TGTAGGAACT GGGTCAGTTC AAAAACCAAG GAAGACCCAG CGAGCTCCCA AAAGTTGTTA CCAGTGCTCC AAAAAACGTG TGAAGTGCAA CAAACAAATA CCTTGCCAAA ACTGCATCAA GCGGGGACAA GAATGCTTTC AGGAGGCCGT AATTGTCAAA GGAGTCATTT TAAACGACAC CAAGTTCGAC CTAACTGAAA AATTGAAAAC AGAGAACGAA TTCTTGCATG AGAAAATCCA ACGCTTGGAA GCCAAGCTAT CCCGACAGGA TGTAAAACTG ATGCAATCAA TGGGATCCGT TGTTAATAGA GATTATGTAG ACAAGCTTGG TACCGGGGCT CGATTGGTTT CTCGTGATCT TTTACCAGGC TCAGATGTTA TCGACACAGA TACAGAGAGA CTAACCAGTG CAAAATTAGA AAAACTATCG AGGTTCGTCA CCCGAGATGT GTCAAGGAAA TTGGTAGAGT TCAATTTGGA GAATCTTTAT TTGGTTCATT CGGCAGTACA TCCAAATTCC TTTCTAAAGG AGCATGAATT ATATTGGAAC GACAATTCAA GACCTAAGCA TTTGAACTAC GAGGTTAACC TCTCACAAAA CCAGTATTTA TGGATGGCTA TTTGGTATGC GATGATAAGC GGCGCACTAT ACACTTTGGA CACTGATTTA GAGCTGTATT TGGGCTTAAC TTCGGAGGGA TATTTTGAGA TGGCTAAAAT TTCTTCTCTT GTATCTCTAG AGTGTCTACA TAGAGGGCAG TTTTTGAGAA TTCCTAATAT TCGTTCAATC CAGGCATTTT GCGTATTAGC TTCATGCTTC CATGGTTTTT CCGGAATACA CTTACAGAAC TCTTTACTAT CTTGCATGAT ATACATTGGC CAATCCTTGA ACTTACACAG GCTTAGCTTG CTGCAAGCAG AAAGTTTAGT AGACTATGAA GTATCTTGCA GATTGTGGTG GATACTTGTT GTTATAGATT TCCTCGAGGA TGTCCATAGG CAAACAATTC TTTCAGATAA TTTCCAAACA CCAATACCAA GAAATATCAG TGAAGATGAT CTCAATTCTG GAGATTTAAA CGTTACAGAG ACCGACGAAT TTACATGTAT TACTTACAAT CAGATGATCA TGAAATTATC AAGAATAAAA AAGAGTTTGT ATTATGAGGA TAACGCAGAA ACTAGCAAGT TCACCTTCAA CCAATTGAAC TTAGCAGATT TGGAATTGTT AAAGTTGCAA AGCACGATTT CAAACCAAAT TTTGAAACTC AAAGATCCAA AGCGATCAAC TAGATTTGCC ATATTTTTGA CGGAAGTGAA ATTAGCTCAT GAAAGATTAC TTGTCAATAG AATGGTTATT AGCCATGTAA GCAAGGAAAA ATGGTTATCG GAATATAGAT ACAAATGTGT ATCTTTCGCT ATAACGGTGA TCAGCAAATT TAACGACAAG AGCCTACCTT TTTATTTCAA AAAGTACTGG ATGACTAGTG AACATTCGAT AAATGCAATA GTGTTCTTAA TCTTGGACCT AGTATTGCAC CAACTGCCCA GAGGTGAGCG TTCCTACAGA CTAAAATTGA TCAACGAATG TATTAATATA TTGGTGTTGC TAAAGAGAAC TCATACGACT GTTTCTCGGG GATTGAGAAT AGTGGAAGCT TTGCTCATAA TGTTACAACA AAGCGGCCGT TCAAAATTTA CTAATATGTC AGAGACTGCA GAGATCAGCA ACACCATTAG CGCCTTAAAA TCGACACCTC GTATTTACGG TAGTGTTGAA GTTAAGGGCC CATTAAAACC AGAGAATGAA CAATTGATAT TCAAAAATTA TGATGAAAGT TCAGAGACTA TTTTAAATGA CTTGTTGCAA GACAATAATT GGCAGCAGTT TCTTGAATGG ATTAATTCAA ATAGTATGAA GCAATGA
|
Protein sequence | MNMSSFDVGT GSVQKPRKTQ RAPKSCYQCS KKRVKCNKQI PCQNCIKRGQ ECFQEAVIVK GVILNDTKFD LTEKLKTENE FLHEKIQRLE AKLSRQDVKS MQSMGSVVNR DYVDKLGTGA RLVSRDLLPG SDVIDTDTER LTSAKLEKLS RFVTRDVSRK LVEFNLENLY LVHSAVHPNS FLKEHELYWN DNSRPKHLNY EVNLSQNQYL WMAIWYAMIS GALYTLDTDL ESYLGLTSEG YFEMAKISSL VSLECLHRGQ FLRIPNIRSI QAFCVLASCF HGFSGIHLQN SLLSCMIYIG QSLNLHRLSL SQAESLVDYE VSCRLWWILV VIDFLEDVHR QTILSDNFQT PIPRNISEDD LNSGDLNVTE TDEFTCITYN QMIMKLSRIK KSLYYEDNAE TSKFTFNQLN LADLELLKLQ STISNQILKL KDPKRSTRFA IFLTEVKLAH ERLLVNRMVI SHVSKEKWLS EYRYKCVSFA ITVISKFNDK SLPFYFKKYW MTSEHSINAI VFLILDLVLH QSPRGERSYR LKLINECINI LVLLKRTHTT VSRGLRIVEA LLIMLQQSGR SKFTNMSETA EISNTISALK STPRIYGSVE VKGPLKPENE QLIFKNYDES SETILNDLLQ DNNWQQFLEW INSNSMKQ
|
| |