Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_50672 |
Symbol | |
ID | 4841050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 185792 |
End bp | 187081 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392365 |
Product | hypothetical protein unknown function |
Protein accession | XP_001386433 |
Protein GI | 150866739 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.482129 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGTAA AAGACTTCCC ACAGATCAAA AGTATCAAAA CTTTCATTTT CAACCAACCC GGCGTCGGCG GAGACTACCA TAATGTTGAA AGGGGCCATT GGTTGATCGA CCATCCCATT GCCAACCCCA TGTCCAAGTT TGAAGAGTAC CGTGCCTCTC GTGTCAGTTG GGGAATCAAT GTTTTGGGTT CTTTCTGTGT TGAAATCGAA GCAACGAATG GCGTTAAGGG ATTTGCTACT GGTTTTGGAG GCCCACCTGC TTGCTGGTTA GTAGCCAATC ATTTCAGACG TTTCTTGATT GGAGCTGATC CAAGAGATAC TACTTTGTTA TGGGATAAGA TGTTTAGAGC TTCCATGTTT TACGGTAGAA AAGGTTTGAC TGTGGCTGTC ATCAGTGTCA TAGATTTGGC TATCTGGGAC TTGTTGGGAA AGTTGAGAAA CGAGCCCGTC TACAAGATGA TTGGAGGTGC TACTAGAGAA AGATTGGACT TTTACTGTAC TGGCTGTAGA CCAGACATAG CTAAGGAAGT TGGTTTCTGG GGAGGTAAAG TTGCTTTACC TTATGGTCCA GCAGAGGGTC ACGATGGTCT TAGAAGAAAT GTCGAGTTTT TGAGAAAGCA TCGTAAGTCC GTAGGACCAG ACTTCCCCAT TATGGTAGAT TGCTACATGT CTCTTAATGT ATCGTATGTT ATCGATTTGG TAAATGCTTG CAAAGACTTG AACATCAACT GGTTTGAAGA GGTCTTGCAT CCAGATGACT TTGACGGTTT CCAGAAGTTG AAGAGTGCCT GCCCGTGGAT GAAATTCACA ACTGGTGAAC ATGAGTACTC CAAGTATGGA TTCAGAAAGT TGATCGAAGG TAGGAATGTA GACATCTTGC AACCTGATAT CATGTGGGTC GGTGGTCTTA CTGAAATCCT CAAGATCTCT CATCAAGCTG CTGCCTACGA TATTCCAGTA GTTCCACATG CTTCTGGTCC ATATTCGTAC CATTTTGTAA TCTCTCAAGA AAATACTCCA TTCCACGAAT ACTTGTCGAA CTCTCCGGAC TCGATGTCTG TGTTGCCAGT ATTTGGGGAA CTTTTCACCG ATGAACCAGT TCCTACAGAA GGTTATTTGC TGATTACGGA ATTTGACAAA CCTGGGTTTG GCTTGACTTT GAACCCAAAG ATCGAGTTGA TCAATGGCGA CTGCTTATTA TCGCCTAATC CAGAAAGACC ATTAAGTATT CAAAATGGAA ATGGACATGC AAAGACCAAT GGCAATGGAA CCATCAAGAA TGGCCATTAG
|
Protein sequence | MSVKDFPQIK SIKTFIFNQP GVGGDYHNVE RGHWLIDHPI ANPMSKFEEY RASRVSWGIN VLGSFCVEIE ATNGVKGFAT GFGGPPACWL VANHFRRFLI GADPRDTTLL WDKMFRASMF YGRKGLTVAV ISVIDLAIWD LLGKLRNEPV YKMIGGATRE RLDFYCTGCR PDIAKEVGFW GGKVALPYGP AEGHDGLRRN VEFLRKHRKS VGPDFPIMVD CYMSLNVSYV IDLVNACKDL NINWFEEVLH PDDFDGFQKL KSACPWMKFT TGEHEYSKYG FRKLIEGRNV DILQPDIMWV GGLTEILKIS HQAAAYDIPV VPHASGPYSY HFVISQENTP FHEYLSNSPD SMSVLPVFGE LFTDEPVPTE GYLSITEFDK PGFGLTLNPK IELINGDCLL SPNPERPLSI QNGNGHAKTN GNGTIKNGH
|
| |