Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_55211 |
Symbol | |
ID | 4836966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1734205 |
End bp | 1735200 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388281 |
Product | predicted protein |
Protein accession | XP_001383099 |
Protein GI | 126133148 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3435] Gentisate 1,2-dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.110005 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCCAG CTGCTACACC ACAAACTCAG GAAACTCAGG ACAATGATGC CTTTTTAAAG AGCATGCCCT TAGAGAATGT CGCACCCTTA TGGCATATCT TGAAGGACTT ATCTCCTCCA AAGCCAAAGC CAACATCTGT GCCTCACCTC TGGAACTACA AAAAATTGAA ACCAATCTTA GATGAATCTG GGCGTTTAGT ACCAACTGAA CTTGCCGAAA GAAGAGTGCT TATGTTGGTT AACCCAAAGT TGACAGGACC ACGTACAACT GAAACTTTAT ACGCTGGTCT TCAATACATT AAACCTGGTG AAGTTGCTCC AGCACACAGA CATGTTGCCT TTGCTTTTAG ATTCATTCTT GAGGGACAAG GTGGATTTAC TAATGTTGAA GGTACAAGAA TGACTATGAA GAGAGGTGAT GTGCTTTTGA CCCCAAGAAA CTGCTGGCAC GATCACGGTA AGGACGGAAG CGGTCCAATG ATCTGGTTGG ATGGTTTGGA CTTGCCAATT TTCCAAACTA TTCCAGTCAA CTACACTAAC CACTACGAAG AAGATAGATT CCCAGCAGTT GACAATGATA ATACACCAAT GAAGTTCCCA TGGCAACCAG TTCAAGATAA GCTTGACAGT ATTAAAGGTG ACTATGCTAT TTTCGAATAC CGTGACCAGG AAAACCCTGA AAAATTTGTA TCTTCCATTC TCGGTGCTGA AGCTTTGAGA ATTTCTCCAA ATGCTTCTAC TCCTGTACGT CAGGAAAACA GTTCCTTCGT TTTCTGTGTC TACGAAGGAA AGGGTCACAC TATCGTCTAC GGTGATGATG GCGAAGAAAA TGTTTTAAAC TGGGAAAACA GTGATGTGTT CTGTATTCCA TGTAACATGC CATTCAAGCA CTTTAACGAC AGCAGCAATG AACAAGCCTA CCTCTTTAAC TTCTCGGACA CCCCATTGCT CAAGAACTTA AATATCCATA GTTCCGATTT GGAAGTTAAG AACTAG
|
Protein sequence | MAPAATPQTQ ETQDNDAFLK SMPLENVAPL WHILKDLSPP KPKPTSVPHL WNYKKLKPIL DESGRLVPTE LAERRVLMLV NPKLTGPRTT ETLYAGLQYI KPGEVAPAHR HVAFAFRFIL EGQGGFTNVE GTRMTMKRGD VLLTPRNCWH DHGKDGSGPM IWLDGLDLPI FQTIPVNYTN HYEEDRFPAV DNDNTPMKFP WQPVQDKLDS IKGDYAIFEY RDQENPEKFV SSILGAEALR ISPNASTPVR QENSSFVFCV YEGKGHTIVY GDDGEENVLN WENSDVFCIP CNMPFKHFND SSNEQAYLFN FSDTPLLKNL NIHSSDLEVK N
|
| |