Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_78790 |
Symbol | |
ID | 4840121 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 271432 |
End bp | 273346 |
Gene Length | 1915 bp |
Protein Length | 405 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391436 |
Product | predicted protein |
Protein accession | XP_001385390 |
Protein GI | 126137734 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.372401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.242751 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCCCAC TTGTGCTCGC AACTCAACCA GTACTTGCGC CGACTTGCCA GTCGACCTCA TCCTTAGAGG CTCCGTACCA ATTTGCCAAA TCTATATAGG TTCCGTTAGC TAATTTTCAT ATCTTGTGAA TTTGACATCT GTGTTTCCAA CTCTGAACAC GGACTTTCGC TATAGCTTCC ATCTTTGCCC TGCAAAAGTT AAGTGTTGTC TGTTGTTGGC CAATCACCTT TTTTCGCTGA TTTGGTCATA AGAGTGTTAA CATAGTGGGA ACAAAACTGT TCATATAGAT TTGATTTTAC ACTGAACTTT ACCCAGAACA TTTCCATTTT ATTTCGGCAA AGGCATATAG ATTACTCCAT TTTCTCTTCT TGAACATTTA CCTGTGTCCT TTACTCATTG TATCTAAAAA TGTCTGTCCT ATTCCTAAAG TACGCCCTAA AGCCGTTCCT ATTCGTTTTC CAGTTGCTAA ACAGAATTTT CTGGTCCGGA TTAAACGGCC GTACCGTTTT CCAGCTAGTG CTAAACTTCT GGCTCAACTT TTCACCAGTC TTTATTTGGT TACTTATGTT TAAGAATGCC GGAATTATAC CCAAGGAAAT CCGTCCTAAG ATTTACGTCG CATTGGCTAT GCATGTTGAC GACTATATGT TCAACTTCGT CGGTCATCCG CTCATCTCCA CTGTAGCTCT TGTGAGCTTA GTGTCTGGGG CTTGGTTGAT CTACTACGTG TTTTATAGAA CCCCCACCTC CAAAAAGCAA GAACAGTCAT ATTCGGCTCT TTCCAATGTT TACAAAAATG AACTCCATAA TGGACATTCC ATCGATTCCG ACGATCCTAC AGCTGTCGGA TCGTCGTCCG AGACTTCTTC CGATTTAGAA GATTTTAATG AATACGAATT AACAGATATG AATTCAAGTT CGTCTGATCT CGGGGACGTG TCTATTTTCA ATTCTCCAGG AGATTGGCAA GACGACACCC ATCACAGTTC GATTGAGTTT TTCAAAAATT TATCGTCCAA TGCCATTTCT ACACAGACCT CAGAAACCAA CCGTAGGATA TGGAGAACTA TCAAAACCAG AGGATACGGG CCATTAAACT GCTGGAACTT GTCTCCACCA ATTCTTATGG CATTGAGTTG GTTCCTACTT AACATTGACT ACTGGTTCAA GGACCCAATT AACACTCCTA AGGACTTACT TGCATGGACT TCTTATGTTT TGTTTCATTT CTTTGTTCCT TTATTCACTG CCATATGGTT ATATGTATTC CACGCCCCTG GAGCTTTGAG ATTGTTTTCA TTTGGACTTG GAATGCAAAA TATAGCAGGT GTTTGCACCC ACTTGCTTTT CCCCAATGCT CCACCTTGGT TCATCCACTT ATACGACGAA GATGCGGAAG CAACTTATGA CTTGCCTGGT TATGCCGCTG GATTAACCAG AGTCGATATG GCCATGGGAA CCCATCTCAA TTCCAACGGT TTCCATGCTT CACCCATTGT GTTTGGAGCT TTGCCATCTT TGCATTCAGC CATGGCAGTG ATGGCTTTCT TCTTTGTCTC GTACTACTCG AGATGGACAA CCCTAAAATT GCTTGCCGCA TCTTTTGTAG CATTACAATG GTGGGCGACA ATTTACTTGG ACCACCACTG GCGTTTAGAC TTGGTTGTTG GCATGTTGTA TGCGATTACC AGCTTCACGT TGTTATATTG TTGGCCCAGG GGAATTAAAA AAGTTGATTC AGATTTCATG AAAGCTAGAC TACGATTTGA TTTCAAGAAT GGATCGACTA TGGGAATGAG AGTTTTCAGG AATACCCGCT TACAGAACTT TTTCGATCCT TTAGCATAGA CATATAATAC ATTTACCTAC GCCTTTAATC TACGAATGCA TATCGGTCTA CGATCGATCT TATAA
|
Protein sequence | MSVLFLKYAL KPFLFVFQLL NRIFWSGLNG RTVFQLVLNF WLNFSPVFIW LLMFKNAGII PKEIRPKIYV ALAMHVDDYM FNFVGHPLIS TVALVSLVSG AWLIYYVFYR TPTSKKQEHS IEFFKNLSSN AISTQTSETN RRIWRTIKTR GYGPLNCWNL SPPILMALSW FLLNIDYWFK DPINTPKDLL AWTSYVLFHF FVPLFTAIWL YVFHAPGALR LFSFGLGMQN IAGVCTHLLF PNAPPWFIHL YDEDAEATYD LPGYAAGLTR VDMAMGTHLN SNGFHASPIV FGALPSLHSA MAVMAFFFVS YYSRWTTLKL LAASFVALQW WATIYLDHHW RLDLVVGMLY AITSFTLLYC WPRGIKKVDS DFMKARLRFD FKNGSTMGMR VFRNTRLQNF FDPLA
|
| |