Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28943 |
Symbol | |
ID | 4851683 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2533282 |
End bp | 2534490 |
Gene Length | 1209 bp |
Protein Length | 383 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393391 |
Product | predicted protein |
Protein accession | XP_001387059 |
Protein GI | 126275249 |
COG category | [R] General function prediction only |
COG ID | [COG5415] Predicted integral membrane metal-binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGTGT TCGGTTTCTT CAGAAGAGGA TTCGACCCCG ATTCCTTTGA GAAGGAGTTG ACGTCGGTGA CAGAATCGAT TTCGAGTACC AACCAGCAGA TTTCGCGATT GAGATTAAGG TCCAGGTCTA TCCGTAGATC GTTAGCAATA TATCTGCTAA TCGTCTATGT TGGCATCATT TCGTACGATT ACACTTCATT GCCGTCGCAG GTCGTTGGAC CGGACAAATT GCGTAAGTTT TTGGCGTATC AAACTAGAAA CCAACTTCTT GTACTCTTGT TATTTCCAGT GGCGGCGTTC GCTTTGGTCA GAGTAGTTCG TATCTTATTT GATTTTCTCA TCAGAAGTCG AGAAAACAAA TTGAAGTGGT TGAAGAAGAA GCACAAGGAG AAAATCGAGG AATTGAAGAA AATCACCAAC TTCACCACAA CTGAAAAACT CTTGAATAAG TACGGAGACA CCCCTAAAAA GCCCTCAATT ACTCCGAACA ACACCAATAA GAACATTACT GCCAAGCCCA AACAGCTTCC AGCAACTGCA AACGTCGCCA AACCACCGGC AGGCCAATTG AGTCAACAAG AATTGAACAA ATTGAATCTC AACATCACTC CTCGGGATGT TGTAGCACCA ACACACACAC CTCAATTTCA ACAAAACAAG CAGCCGCAGC ATCCACAACA ACAGAAATCG GTGGCTCCTA TTCGTAGATC GATTCAGGAT CGCCTTTTAG ATATGTTAAT CGGCTCTGAG AACAACGAAT CTGTGGAGTC TCGATATGCT CTCATCTGCT ACAATTGTTT TACCCATAAC GGTCTTGCCC CACCAGGTAC TAGCGATCCT GCCACTGTGG TGTATATCTG TATGAAATGT GGAGTCATGA ATGGCGAACT AAATGAAGAA AAGCTGATAC AAGAGGACCA TGTTGATACG GCTCTGGTTA GTCCGGTTGA CAAGACTCTG CAATTGATAG ATAACGAAGC AAAGTCGAGT CTGAAAGACA AGTTAGAACA AGTACAAGCA GAAGTTGAGG AGCGTAAGGA AGAAGTTCAA CAGGAACAGA AGCAAGAAGA AAAACAATCT GAAGAAGTTC AACAAGAGCA AAAGCAAGAA GAAGAGCAAC CGGAAGACAA AGAAGTAGTT AAACAAGTGC AACAGCACGA AGAACAAGAA AAACAGGTAT CAGAATCAGA GTCAAAGTCA GACTCTTAG
|
Protein sequence | MGVFGFFRRG FDPDSFEKEL TSVTESISST NQQISRLRLR SRSIRRSLAI YLLIVYVGII SYDYTSLPSQ VVGPDKLLAA FALVRVVRIL FDFLIRSREN KLKWLKKKHK EKIEELKKIT NFTTTEKLLN KYGDTPKKPS ITPNNTNKNI TAKPKQLPAT ANVAKPPAGQ LSQQELNKLN LNITPRDVVA PTHTPQFQQN KQPQHPQQQK SVAPIRRSIQ DRLLDMLIGS ENNESVESRY ALICYNCFTH NGLAPPGTSD PATVVYICMK CGVMNGELNE EKLIQEDHVD TALVSPVDKT LQLIDNEAKS SLKDKLEQVQ AEVEERKEEV QQEQKQEEKQ SEEVQQEQKQ EEEQPEDKEV VKQVQQHEEQ EKQVSESESK SDS
|
| |