Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83827 |
Symbol | |
ID | 4839240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 937150 |
End bp | 939542 |
Gene Length | 2393 bp |
Protein Length | 789 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640390555 |
Product | predicted protein |
Protein accession | XP_001384848 |
Protein GI | 150865577 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0497188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAGACCTTTG CCCTTCCTCA AGCATGGATA CCTTGAAAAC TACCTATAGC CACCGGGATA TAGAACCTAT TTATGTGGGT GGAACCTCGG CTACCATTTC TGCTGACGGG CTGATCTTGG CCACGCCTTT GAACGAGGAC GTAGTTATCA CCAGTTTAGA TACTAACGAG ATCCTCCACA AGATCGAGGG AGATGGTGAA ACCATTACAA ATCTCGTTAT TACACCCGAT GGATCCAAGT TGGCAATTTT GTCGCAGTCC CAGCAATTGC GTATATTTGA CTTGGACAAG CTGGAAATCA CTAAAACCTT TAAAATGCCA TCTCCAGTAT ATATCTCTTC AGTCGATTCC ACATCTTCCC TATTTGCTTT CGGAGGTTCG GATGGTGTTA TCACTGTTTG GGATATCGAT GGTGGATATG TGACGCATTC TCTCAAAGGA CACGGAACCA CCATCTGTTC GTTAATATTC CACGGCCAGT TGAACTCTAC CGAATGGAGA TTGGCTTCTG GTGATACTAT GGGTACAGTT AAAGTATGGG ACTTGGTAAA AAGAAAATGT CTCACAACAG TTAATGAACA TAATACAGCA GTCAGAGGCG TTGGCTTCGA CAGTTTGGGC CAACATTTCA TCACTGGTGG AAGAGACAAT GTAGCAATCA TCTATAACAC GAAAAACTAT AAGCCAGTGA ACACCTTTCC AATCAACGAA CAGATCGAAT GTGCTGGTTT CATCACCATT TATGACAGGG AATTCTTCTA CACCGCTGGT TCTGAAAATA TCTTGCGACT CTGGAGCATT GCTACTGGTA CATTGGTGGC CAGCTCAAAA GCTTCGTTAA AGACAAATGA AGAATTGATC ATAATAGATG TCTTAAAGTT GGAAAACAAT GACTTGGTCT TAGTAGTAAG TGACCAGACG TTGATTCATT TGGATTTACA AGAGCTTGAT TTCGACAATG GTGAAACTGT AGAAATTCCT GTAGCCAAAA GAATTGCAGG AAATCATGGT ATCATTGCCG ATATCAGATA CGTGGGAGAA AAGTTCAACT TGATAGCATT ATCCACCAAT TCGCCAGCAT TGAGAATAGT GGACCCATTA AAGCCATTGG AATTGAGATT ATACGAGGGC CATACCGATT TGCTCAATGC TATGGATGTC TCTACTGATG GAAAGTGGAT AGCTACTGCT TCTAAGGATA ACGAAGCCAG ATTATGGAAA TGGGATGAAG AACAAGATGA CTTTGTTCCT TTTGCAAAAT TTCAAGGTCA TGCTGGATCA GTCACAGCTG TGGCATTATC CAAAGCAGAA AACACACCCA AGTTCTTGAT TACTGGATCC AGTGATCTTA CTATCAAAAA ATGGAAGATC CCAGCTACTG CCGGATCTAC TGTCAAGACA TCCGAATATA CCAGAAGAGC TCACGATAAG GATATCAACT CTATAGACGT AGCGCCAAAC GATGAGTACT TTGCATCTGC ATCATATGAT AAGTTCGGTA AAGTATGGAA CACTGCTAGC GGAGAAACTA TAGGTGTTTT GAAAGGTCAC AAGAGAGGAT TGTGGGATAT TAACTTCTAC AAATTCGACA AGCTCATTGT CACTGCAAGT GGTGACAAGA CTCTCAAGGT ATGGTCCTTG AATGACTTCA CCTGCGTCAA AACTTTTGAA GGCCATACCA ACTCCGTGCA GAGAGCCAAA TTTTTCAATA GATTCAGCCC ACAGTTGCTT TCAACTGGTG CAGATGGTTT AGTTAAGGTT TGGGACTACA AGAGCGGAGA AATCATAAAA ACGCTCGATA ACCACGAAAA TAGAATTTGG TCTATCGATA TTAAAGAAGA TGGTAATACT TTTGTCACTG CTGATGCTGA TGGTAAATTG AGTGAGTGGG ACGACAATAC GGCAGAAGAA ATCAGACTTA GGGAACAACA AGACAAGTTC AAGGTTGAAC AAGAACAAAA CTTGTCCAAT TATATCAGTA ATAGAGACTG GCCAAATGCT TTTTTGTTGG CATTGACGTT GGACCACTCT ATGAGATTGT ACAATGTCGT CAAGTCTTGT ATTGAAGCCA ACGAAGATCC TAATTCAGCT ATTGGCTCAG AGCCATTGGA AGAAACTATC ATTCAGTTAT CTGACGAGCA ATTGTTGAAG TTATTCAAGA AAGTCAGAGA TTGGAACACC AACTTCAAGT TCTTTGAAAT TAGTCAAAAA TTGATTTCTG TTCTCATGTC CAACATCGAA ACTGAAAGAT TGATAGAAAT ACCAGGCTTG ATGAAAATCA TCGAGGCACT CATTCCATAC AACGAACGTC ATTACAATAG AATCGACGAC TTAATAGAGC AAAGTTACAT CTTGGATTAC GCTGTGGAGG AGATGAACAA ATTGATAGCA TAG
|
Protein sequence | MDTLKTTYSH RDIEPIYVGG TSATISADGS ILATPLNEDV VITSLDTNEI LHKIEGDGET ITNLVITPDG SKLAILSQSQ QLRIFDLDKS EITKTFKMPS PVYISSVDST SSLFAFGGSD GVITVWDIDG GYVTHSLKGH GTTICSLIFH GQLNSTEWRL ASGDTMGTVK VWDLVKRKCL TTVNEHNTAV RGVGFDSLGQ HFITGGRDNV AIIYNTKNYK PVNTFPINEQ IECAGFITIY DREFFYTAGS ENILRLWSIA TGTLVASSKA SLKTNEELII IDVLKLENND LVLVVSDQTL IHLDLQELDF DNGETVEIPV AKRIAGNHGI IADIRYVGEK FNLIALSTNS PALRIVDPLK PLELRLYEGH TDLLNAMDVS TDGKWIATAS KDNEARLWKW DEEQDDFVPF AKFQGHAGSV TAVALSKAEN TPKFLITGSS DLTIKKWKIP ATAGSTVKTS EYTRRAHDKD INSIDVAPND EYFASASYDK FGKVWNTASG ETIGVLKGHK RGLWDINFYK FDKLIVTASG DKTLKVWSLN DFTCVKTFEG HTNSVQRAKF FNRFSPQLLS TGADGLVKVW DYKSGEIIKT LDNHENRIWS IDIKEDGNTF VTADADGKLS EWDDNTAEEI RLREQQDKFK VEQEQNLSNY ISNRDWPNAF LLALTLDHSM RLYNVVKSCI EANEDPNSAI GSEPLEETII QLSDEQLLKL FKKVRDWNTN FKFFEISQKL ISVLMSNIET ERLIEIPGLM KIIEALIPYN ERHYNRIDDL IEQSYILDYA VEEMNKLIA
|
| |