Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83776 |
Symbol | |
ID | 4839503 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 277857 |
End bp | 279669 |
Gene Length | 1813 bp |
Protein Length | 572 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390818 |
Product | predicted protein |
Protein accession | XP_001385072 |
Protein GI | 150865737 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTTGA CCAGCAGGGA ACTCAACTAT TTGGTGTGGC GCTACCTCCA GGAAGCGGGA TTCGAACTAG CGGCGTTCGC TCTCGAGAAA ATTGCTTCCT GCCTGCATTA TGAATCAGAA ACGTCGTCTG CCATAATTTC CAAGATTGAA CCAGGGTGTC TCGTCAACTT GGTGCAAAAA GGCATTTTGT TCTCGCTAGC TGAAGAAGAA GCCGAAAGAG ACAAAGATGA TACCGATGGG TCAAAACTAT CGTTATTTGG CGCCTTGCTT TCTGACCATC TCGAACTAGA CACAAGTAGT GAAGAAAAGG AAGAAGTCAG CAAACGCTTT TTGCTGAAGG CAGAAGCCAA ACTCAACGGC ACTCATGAAG ATGAAGACGT AGAAATGAAA GACGAGACAT CGAATGACAA CCAAAGTGAC AGTAATCACG AAAATCAAAT ACAGAATGAT CTTGCTGAAC CTGAAGATTC TCAAATGATT CCTAAGAGCT TCACTACCCA GTCGTTGCAG CCTATAATCA CTTATGGCCA GAGTCTAACG TCCGAATGGC ATCCAAGTAC GTCAGTGTTT GCCTACGGGA AGAGCAACTC TAGTGCCGTC ATCAATGCTA TCAAAGACGG CGCCATTGCT GAATCAGTGA CGTTGGTGCA TCCCAATATT CTGAACTTGA AAAACGAAAT CAACATTGTT TCTTGGGCTC CACAAGGAAA CTTGATAATC ACAGCCGGGA TAAACGGAGA ACTACGTGCA TGGTCTCCGG ACGGAAAGTT GCGCAACATT GCAAACTTGC TCTCGGACGA TATACCCGTA TCCGCGTCTG ATGTTGAACC AATAGAAAGG ACGTCTTCAG TGATAACGAA TGTGCTCTGG AACGAGAATG GCCAACTTCT TCTTTCTTTG GACAGTAACA ACCAGGTCAG TCTCTGGGAC GGAAACACGC TTTCCTTGAT TAAACAAATC AATCCACCTG TAGCTCCAGC AGATAACATA TCCGTAATTA CGGCATGCTG GCTCAATGAA GATAAATTTG CACTTTCGAC AGTGAAAAAC TCGATAAAGA TATACTCAAT TACACCACAA CAGTTTGGCA GCTTGAACCA GCTCGATGTC CAAACTATAG GATTTCTCCA TGGCCATGAA AACAGCATCT CCTTGCTTAA ACTTAATAAC AAATCGAAGT TGCTAGCTTC TTGTTCTGAC TACGACTACC TCATAAAAGT GTGGATCCGA GGATCTTCCC AGGAGAGTTT GGATTTGAAT ACCACGAAAG ACGAAACCTC AAAACTCCAC ACATCGCCTA TTGTAGCCTT GGAATGGCTC GAGGGGTTTG ATGATGTCAG TACCTTGCTC AGTGTTTCAA TGGAAGGTAT ACTTAACATT TGGAATTGCT CTACCGGAGA AACCATCAAG AGTTCAGATC TCTTTAGCTA CAAGGACAAC TTTAAACTCG ATAGCGACGA TGATTTCCAC CATTCACACG ACTTGTTGGT GTTTGATGCT TCTTTATCGC CCAACCAAGA ATATTTAGCC TTAGCTGATG ATTTGGGACG AGTCACTGTC TGGGACGTTT CCTTGAGTCG CTACTCCAAG AACGATCCTC GTGACTTTGT CAGATGTCTA GCAGTGTATA ACTTCCAGAT TCCAGATCAT GTCGACAATG ACGACACCAA GGTAGGTATT TGTGACGTCA AGTGGGATAA CGAGTCGTCC AACATCTCTG TTTCCTACAG TGGAGCTGAT AGCGTGGTGC TAAAATGGAA ATAGTGGGAC TAGAGGCTGT AGAGCGTTAA ATTATATATT CTACAAGTGT ATT
|
Protein sequence | MSLTSRELNY LVWRYLQEAG FELAAFALEK IASCSHYESE TSSAIISKIE PGCLVNLVQK GILFSLAEEE AERDKDDTDG SKLSLFGALL SDHLELDTSS EEKEEVSKRF LSKAEAKLNG THEDEDVEMK DETSNDNQSD NSQMIPKSFT TQSLQPIITY GQSLTSEWHP STSVFAYGKS NSSAVINAIK DGAIAESVTL VHPNISNLKN EINIVSWAPQ GNLIITAGIN GELRAWSPDG KLRNIANLLS DDIPVSASDV EPIERTSSVI TNVLWNENGQ LLLSLDSNNQ VSLWDGNTLS LIKQINPPVA PADNISVITA CWLNEDKFAL STVKNSIKIY SITPQQFGSL NQLDVQTIGF LHGHENSISL LKLNNKSKLL ASCSDYDYLI KVWIRGSSQE SLDLNTTKDE TSKLHTSPIV ALEWLEGFDD VSTLLSVSME GILNIWNCST GETIKSSDLF SYKDNFKLDS DDDFHHSHDL LVFDASLSPN QEYLALADDL GRVTVWDVSL SRYSKNDPRD FVRCLAVYNF QIPDHVDNDD TKVGICDVKW DNESSNISVS YSGADSVVLK WK
|
| |