Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80641 |
Symbol | |
ID | 4851404 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1728821 |
End bp | 1731226 |
Gene Length | 2406 bp |
Protein Length | 606 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393112 |
Product | predicted protein |
Protein accession | XP_001387971 |
Protein GI | 126274512 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATTGCCCATC AATTGTACCT AAAAGCCTGT TTTCAATCCG GTAATACGCT CCTGTCTTTC TCTAGGGTGC ATTGACCTCA TTATCCACTC TGTCATCGTT TGATTTACCA GATCATCATT TTCATATTTA GATCAAAGTA CTAAACTGAG ATCTTATATT ACTGAAAATT ACAACTACAG TCTCAAAGTT ATCAATTGCT TAGTCTATAC TGCTATAGTC TATCATTGCA GAGATATCAC ATTGAACTAT ATCTTTTATC ATTGAGTCTA CTTGGTCTGA GTTAGTCTTT CAATAATTCG ATATCAATAC TCACTTCGTC TTCAGTCTTT CACTTTATCA CTAAGTCTAT TAGTTAAAAT TTAAGATGGA AAAAATCGAC TTATCTTCCA ACTCCAGAAA GATCCAAGAC GCCTACGATA AACTCGTACG TGGTGATCCT TCTGTCAACT ACGTCGTCTA CTCTGTCGAT GCCAGTCTGA CTCTTGAAGT CTCACAAACT GGAAATGGAT CACTTGAAGA ATTCGTAGAG AATTTCAGTG ACGGTCGTAT CCAGTTTGGT TTGGCCAGAG TCACCGTTCC AGGCTCGGAT GTTTCCAAGA ACATCTTGCT AGGCTGGTGT CCAGACAATG CTCCTTCCAA GTCGAGATTG TCGTTTGCTT CTAACTTTGC AGAGGTTTCC AAAGTCTTGA GTGGCTACCA TGTTCAAATT ACTGCTAGAG ACCAGGATGA TTTGGACATC GACGACTTTG TCCAGAGAGT CCGTGCTGCT GCCGGAGCCT CTTATTCACT CAACACTGGA ACTGTCAAAG CCGCTGCTGC TCCTGTTCCT AAACCTGTTG TAGTCAAACC AATCGTAGCC AAGCCTTCCA AGCCTGCTAG TGTCAGTGGA ACATCTTTCA TTCCAAAGTC AACTGGTAAG CCTGTGGCAC CTGTCAAACC GAAGCCCGTA TTGCCTGCTC CTAAAGCCTT TGGCCAGCCA AAGCCCGTAG CTTCTTCCAA CGACGGCTGG GGCGATGCCC AGGATGTCGA AGAAAGAGAC CTCGACTCCA AACCTTTAGA GGATGTTCCT TCGGCCTACA AGCCTACGAA AGTTAACATC TCGGAATTGA GATCTCAGAA ATCTGACACT ATTTCTTCGA CTCCAAAGCC ATTTAAGGCT GAGCCAAAGC CAGCTGAAAA GGATGACGAC AATGAGCCAA AGTCTTTATC TGATAGAATG AAGACGTACA AGTCGTTTGA ACCTTCTTCC GACGGAAGAT TGACAAGTTT GCCAAAACCA AAGGTATCTC ACTCGGTCAA TACTCGTTAT AAGCTGGAAG CTCCTTCTTT CGGTGCTAAG CCTACTTTCG GTCAATCTGA CGACTCCAGA AAGGACAAAG TCGTTGGTGG ATTATCCAGA AACTTTGCTG CTGAAAACGG CAAGACTCCA GCGCAAATTT GGGCTGAGAA GAGGGGCCAG TACAAAACTG TAGAAGCTGG GGAATCGGAA TCGGGCGAAG TCCATGCTCA CAGCTCTGAT TTGGCTCATA AGTTCGAGGA ACAAGTCAAA TTGCACGAAC AGGAAGAAGA GGAAGAATTG GCTAAGCAAC ATGAACCTGT AGTTATTAAG CCTTCTACCT TCCCCAAGAA GCAATTTGAT GAACCAGAGG AAGAAGAGGA AGAAGAGGAG GAGGAAGAAA AGCCTACTCC ATCTTTACCT GTTCGTTCTT TGCCTCCTCC ACCAGCTAGA GTTGTAGAAC CTGAGCCAGA AGCAGAGGAA GAAAAGGAAG AAGAGGCTCC AGCTGCTTCT TTACCTTCTC GTAGTTTACC TGCTCGCAAC TTACCTCCTC CTCCAGCTCC AGCCGCTGAA CCTGAAGAAG AAGAGGAAGA AGAAGCACCA GCTCCATCGT TGCCTTCCAG AGAAGCTGAA CCTAAGAAGG ATGGTGCTAG TGCTGTAGCT GAGTATGACT ACGTCAAAGA TGAAGACAAC GAAATAGGCT TTGCAGAAGG CGACTTGATC GTTGAAATCG AGTTCACTGA TGAAGAGTGG TGGACTGGTA AGCACTCCAA GTCGGGCGAA GTTGGCTTAT TCCCGGCTGC TTATGTGTCG TTGAAGAAAG AAGAAGAAAA GGCTACTGAA CCGGAAACTA AGGCTGAACC TGTCGTTGAA AAGAAGTCTG AAGGAAGAAG TGCTACTGCT GAGTACGATT ACGAGAAGGA CGAAGATAAT GAAATAGGAT TCGCTGAGGG TGACGTGATT GTTGAAATCG AGTTTATCGA CGATGATTGG TGGTCTGGAA AACACTCTAA ATCCGGTGAG GTAGGTTTGT TCCCAGCCAA CTACGTCAGT TTGATCTGAT ATGTAAATTT CTTTAAAACT TTATCTATAT GTGTCCTGAG ATCAATGCAA AAATGAAATG ATAACT
|
Protein sequence | MEKIDLSSNS RKIQDAYDKL VRGDPSVNYV VYSVDASLTL EVSQTGNGSL EEFVENFSDG RIQFGLARVT VPGSDVSKNI LLGWCPDNAP SKSRLSFASN FAEVSKVLSG YHVQITARDQ DDLDIDDFVQ RVRAAAGASY SLNTGTVKAA AAPVPKPVVV KPIVAKPSKP ASVSGTSFIP KSTGKPVAPV KPKPPKPVAS SNDGWGDAQD VEERDLDSKP LEDVPSAYKP TKVNISELRS QKSDTISSTP KPFKAEPKPA EKDDDNEPKS LSDRMKTYKS FEPSSDGRLT SLPKPKVSHS VNTRYKLEAP SFGAKPTFGQ SDDSRKDKVV GGLSRNFAAE NGKTPAQIWA EKRGQYKTVE AGESESGEVH AHSSDLAHKF EEQEEEEKPT PSLPVRSLPP PPARVVEPEP EAEEEKEEEA PAASLPSRSL PARNLPPPPA PAAEPEEEEE EEAPAPSLPS REAEPKKDGA SAVAEYDYVK DEDNEIGFAE GDLIVEIEFT DEEWWTGKHS KSGEVGLFPA AYVSLKKEEE KATEPETKAE PVVEKKSEGR SATAEYDYEK DEDNEIGFAE GDVIVEIEFI DDDWWSGKHS KSGEVGLFPA NYVSLI
|
| |