Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_44075 |
Symbol | |
ID | 4838193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 786476 |
End bp | 787615 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 12 |
GC content | 48% |
IMG OID | 640389508 |
Product | predicted protein |
Protein accession | XP_001383431 |
Protein GI | 126133813 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACTC CCTACAAGCA ACCACCCCAC AAATTGACGA TGATTCCTGG CCCCATCGAA TTCTCTGACG AGGTTCTTGG GGCTATGGCC ACACCTTCGC AGGCCCACAC TTCTCCCGAG TTTGTCAAAA CGTTCCAGTC GGTCTTGCAG AACTTGAGAA AGTTGTTCAA GTCTTCTGAC CCCGACGCAC AGGCCTATGT AATTGCTGGT TCTGGAACTT TGGGCTGGGA CATTGTTTCC ACTAACTTGC TTAGCCCAGG AGACAAAGTG TTGGTTTTGT CGACGGGATT CTTTTCCGAT TCATTTGCTG ACTGTTTGAA GATTTACGGA ATCGATGTTG ATGTCGTTAC TGCTCCTGTC GGAGGAGTGG TTCCGGTCGA AACCGTCGCT GAAAAGTTAA AGTCTACTAA GTACACAGCC ATTACCATCA CCCATGTTGA TACGTCGACT TCTGTGGTAA GTGACGTGAA GGCTGTTTCT GAAATCGTAA AGAAGGAATC GCCAGAAACG TTGATTGTAG TCGATGGAGT CTGTTCTATC GGGGTAGAAG ACTTGGAGTT CGACAAATGG GGTATCGATT TCGCCTTGAC AGCTTCACAG AAGGCCATTG GTGTTCCTGC TGGTTTGTCC ATCTCCTTTG CCTCGGCCAG AGCAGTGGCA AAAGCTTTGG CAAGAAAGGA AACTGTCTTC TTTGCCTCGT TGAAGAGATG GACTCCGATC ATGAAGGCTT ACGAATCCGG TAACGGTGCC TATTTTGCCA CGCCAGCCGT CCAGACTATC ACCGCTTTGA AGGTATCGTT AGATGATATC TTGAGTGGTA GCATCGATGA CAGATTTGCT AAGCACGCTG AAATCTCGTC TAAGTTCAAG TCGAGCGTTG AAAAGTTAGG CTTGAAGATA GTTCCTCTCA GCCACGATGT CGCTGCTCAC GGATTGACCG CTGTTTACTT CCCAGAAAAC ATCAATGGTG CCGACTTGCT TGCCAAGTTG AGCTCCAAGG GTTTCACCGT TGCTGGTGGT ATCCACAAGG CTTTGGTAGG AAAATACTTC AGAGTAGGTC ACATGGGCTA CTCAGTCTAC GCTGGACACG TAGACCAGCT CACCAAGGCT CTTGAAGAAT CATTGGACGA ACTCAAATAG
|
Protein sequence | MATPYKQPPH KLTMIPGPIE FSDEVLGAMA TPSQAHTSPE FVKTFQSVLQ NLRKLFKSSD PDAQAYVIAG SGTLGWDIVS TNLLSPGDKV LVLSTGFFSD SFADCLKIYG IDVDVVTAPV GGVVPVETVA EKLKSTKYTA ITITHVDTST SVVSDVKAVS EIVKKESPET LIVVDGVCSI GVEDLEFDKW GIDFALTASQ KAIGVPAGLS ISFASARAVA KALARKETVF FASLKRWTPI MKAYESGNGA YFATPAVQTI TALKVSLDDI LSGSIDDRFA KHAEISSKFK SSVEKLGLKI VPLSHDVAAH GLTAVYFPEN INGADLLAKL SSKGFTVAGG IHKALVGKYF RVGHMGYSVY AGHVDQLTKA LEESLDELK
|
| |