Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32667 |
Symbol | |
ID | 4840081 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 357941 |
End bp | 359487 |
Gene Length | 1547 bp |
Protein Length | 507 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391396 |
Product | predicted protein |
Protein accession | XP_001385418 |
Protein GI | 150865977 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0357822 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCCAA TGATATCACG AGATTACACG CCGAACAAGG CTCAGCCTTC GGTACCTACG GGTACTAGTG CCCCTAATAA TACCCCAAAG GGAAAGGAAG TACCCTTTGA GAACCATTTA AAATCAGTAG AGGACCGTAG TAATACGATT GTTGAAATGG AATTGGATCC GATATCATCT TCCAATATCG CATATAACAA AGAATATGCT CGTTCAATGG ATACGAATCT TCTTACCTCG GATATTTCAC GAGAGTCTTC AACAACAAGT GTACAAAGTG CCTTCATGAA ACGTGGTAAA GACTCGGGAA GACGGAAGTC TACTCCTTTT GAATCAAGTC TGTCGTTGTA TAGCACAGAA TCTACACCAG TGCGTTCAAC AGTAGGAGGT GGTTCTATTA ATAATGGTAA CGGTTCATTA CTTTCATCGC CAATGTCACC TCGACAGTAT ACTACCCATC AGGCTCAACA GCGAACTATC AGTGGTAGCA GTAGAAGTCC CATGACATCT AATCGTAAAT TTATCGGAGC CTTGACTCCT TTGGTGGCTG GATCTGTACA ATCTATATCT GGCTCTGATG CCAAGGCCGC GTTAGCTTCC AGTAGCGAAG CCGATAAAAA GAGACTTGTA GATCAATTCT ATAATTCCGT GAATGAATCG AATCCCGTAA GTCGAGCTTC AAGTAGTTCT GAGTTGGCGC AGCATTTCAA GAGTACTTCC ACCAACTTGC AATCGCCTAC AGCTGAAAAA GCGTTGAAAT GGATAGAGAA CTCCAGTTCA AATGAAAGTG ACCAGAAACC ATTCAAAATA CGAACCACTC CCAAAATCAT TAACTATACA GAAGAGAAGT TTAATAGCAT AATCAACAAT GGAGGGTTCA CATACACTTC TAACAGTTCA AAGTTGATAG AGGACCAAGA GCTTACAAAT TACTTATTAA ACATTGACAC AATTCTAGAG GACCCCAATA AAGAGAACGA CAACAATGGA AAAAGCAAGC GGCACGACCA CCATAAACAG AACAAGGATC TGTTGGAACC ATTGGCTACG ATAATGAAGT CACTTTCTCT GACATTAGTA GAAAAATCGA AGTTAAACTT AATTGACGAT GACGACGATC ACATTGAATC GTTGGCACAA TTAAACGGGT TGAGCAAGTA TTTGCTGGAC TTGCACAACA GTACGAACGA GTTGCTCCAA AGGTTGATAT TGAACCGAGA GGAAATCAAG TCCAATTATA GAAGTGAAAT CAAGGATAGT TTGAACCGAC TCAGTGATCT TTCAGCAGAG CTTAACAACC TAGAGATAAA GCTTTCAGTG ATCAAGAATA AAATAAATGA CAGTAAAACT GTGATGTCGA GAGAAATGGC AGATAAAATA GAGTTGTTGG AGTATGTTAA TGACCGATTC AAGGACTACT CCACACAAAA GAGAAACATG CGTTTCAAGC AGTACAATAT TGCATTAGCT GTCTTAGTGG TGGTGGTAAG TATATACATT GGATATAGAT AGCAGTTATG AGAAATGAAT ACAAAAGGGT TACTTAG
|
Protein sequence | MHPMISRDYT PNKAQPSVPT GTSAPNNTPK GKEVPFENHL KSVEDRSNTI VEMELDPISS SNIAYNKEYA RSMDTNLLTS DISRESSTTS VQSAFMKRGK DSGRRKSTPF ESSSSLYSTE STPVRSTVGG GSINNGNGSL LSSPMSPRQY TTHQAQQRTI SGSSRSPMTS NRKFIGALTP LVAGSVQSIS GSDAKAALAS SSEADKKRLV DQFYNSVNES NPVSRASSSS ELAQHFKSTS TNLQSPTAEK ALKWIENSSS NESDQKPFKI RTTPKIINYT EEKFNSIINN GGFTYTSNSS KLIEDQELTN YLLNIDTILE DPNKENDNNG KSKRHDHHKQ NKDSLEPLAT IMKSLSSTLV EKSKLNLIDD DDDHIESLAQ LNGLSKYLSD LHNSTNELLQ RLILNREEIK SNYRSEIKDS LNRLSDLSAE LNNLEIKLSV IKNKINDSKT VMSREMADKI ELLEYVNDRF KDYSTQKRNM RFKQYNIALA VLVVVIAVMR NEYKRVT
|
| |