Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_40652 |
Symbol | |
ID | 4836772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 604215 |
End bp | 605507 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640388087 |
Product | predicted protein |
Protein accession | XP_001382878 |
Protein GI | 150864161 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.470816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAAT CAAGTCTTTG CAAGCAGGCT TTGACGGCTT CGTCTATGAA TTACAAGCTG GTACAGGCAA CAGCCATCAG AAGCTTCCAC GAGTCACAGA TCCATTTCAA CAAAGAACAA GCACCACAAG GATCGCCTTT AAAGGTCTTT TTCGATACCT TCAAGAACGA AGTCAAGAAA TCGAACGAGT TGAAGGAAAA TATCAAGGCT TTACAGGATG AGTCTGGAAG AATGGCTGAA TCTGAAGCTT TCAAGAAGGC CAGAGAAGCT TATGAGACAG CACAGAAAGG TAGTAATGCT GCTGGTAAAG TGCTCAAGCT GACTGCAGAC GTTGTAGGCG GTGCTGCTGT AAAAGCATGG GATTCTCCTG TTGGTAAGGG AGTCAGAACT ACAGTACGTG TAAGTGCTGA AGTAGCAGAC AAAGCCTTTG AGCCTGTAAG ACAGACACAA GTCTACAAGG ATGTTTCTGA AGTTATTGAT GATGGTTCGT CGACTTCATA TGGTGGATTT TTGACAAAGG AGCAGAGACA GAGATTGAGA GAGAAGGAAT TGGAGGAAAG AGCCAGAAAG GGAGTCAAGG GTCCTGTTCG TGAGAACGAA GAGGCTGGCG GAGAGTTAGT AGCCACTGAA CATAAGGCTT CAGGTCCATC TGTTGGTGAA AGATGGGAAG AGTACAAACT TAAAACACCT GTGGGCCGTT TCTTTACGTA CTTGCAAGAG AAATGGCAGG ACTCTGAGAA TGGCTTGATT TCACTTATAA GAACCATCAT TGAAAAAGTA ACAGGGTTCT TTGCCGAAAC TGAACAAGCC AAGGTGGTTA AGCAATTTAG AATGATGGAT CCTTCTTTCC GTTTAACCGA CTTCCAGAAG ACATTGACCA ACTACATTGT GCCCGAGATC TTAGATGCCT ACATCAAGAA CGACGAAACC GTGTTGAAGC AATGGTTCTC CGAAGCTCCC TTCAACGTCT GGCAAGCCAA CAATAAGCAG TTCATCCAAC AGGGCTTGTT CCTGGACGGT CGTATCTTAG ATATCCGTGG TGTTGAAGTA GTCACATGCA AGCAGTTGCA ACCTAACGAT ACTCCTGTCA TTGTTGTCAG TTGTCGTGCC CAAGAGGTTC ATTTGTACCG TAAGGCTAAG ACAGGTGACA TTGCTGCAGG TACCGAGGAT CATATCCAGT TGAGCACATA CGCTATGGTT CTTACAAGAG TTCCAGAAGA ATTCGACAAC GCCACTACGG AAGGATGGAA GATCATAGAG TTCGCTCGTG GTGGTTCCAG ACCTTTCCAT TGA
|
Protein sequence | MMKSSLCKQA LTASSMNYKS VQATAIRSFH ESQIHFNKEQ APQGSPLKVF FDTFKNEVKK SNELKENIKA LQDESGRMAE SEAFKKAREA YETAQKGSNA AGKVLKSTAD VVGGAAVKAW DSPVGKGVRT TVRVSAEVAD KAFEPVRQTQ VYKDVSEVID DGSSTSYGGF LTKEQRQRLR EKELEERARK GVKGPVRENE EAGGELVATE HKASGPSVGE RWEEYKLKTP VGRFFTYLQE KWQDSENGLI SLIRTIIEKV TGFFAETEQA KVVKQFRMMD PSFRLTDFQK TLTNYIVPEI LDAYIKNDET VLKQWFSEAP FNVWQANNKQ FIQQGLFSDG RILDIRGVEV VTCKQLQPND TPVIVVSCRA QEVHLYRKAK TGDIAAGTED HIQLSTYAMV LTRVPEEFDN ATTEGWKIIE FARGGSRPFH
|
| |