Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_57266 |
Symbol | UGA22 |
ID | 4838078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 582347 |
End bp | 583861 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640389393 |
Product | succinate-semialdehyde dehydrogenase NADP+ dependent |
Protein accession | XP_001383394 |
Protein GI | 150864539 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01780] succinate-semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.876007 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTAAGG TATTCACCCG ACTGGCTCTG TCGAAGCAGG TGTTAACTCA ATTCAAGAAT AAGGAATTGC TCAAGTTCAA AGGATATGTG AATGGAAAAT GGCTAGAAAG TAACGACAAA GCTACTTTCA AGGTATACAA CCCAGCCCAA AAAGATGGCC AATCTGACGA AATTGCTGAA GTTTACTCTT TCTCTCCAGA AGAGTACGGT ACAGCAATTG AGGCTGCCCA GACATCGTTC AATTCCTTCA AGAAGACTAC AGGAAGGTAC CGTTCAGATT TACTTTTGAA GTTGTACGAT TTGATGCTCA AGTACGAAGA CGACCTAGCA ACTCTCATAG TTCTTGAAAA TGGAAAGCCG TATGCTGATG CTCTTGGAGA GGTCCGATAT GCTGCTTCGT TTTTCCAATG GTTTGCCGAA GCTGCACCGC ATGTCACAGG AGATGTAATT CAATCAGCCA ATGCCTCTGC TAGGATACTT ACAATGAAAC AGCCTATTGG TGTGTGCGGA ATTCTTACAC CTTGGAACTT TCCCTCTGCA ATGATAACAA GAAAACTCGG AGCTGCAATT GCTGTAGGAT GTACTAGTGT GATCAAGCCA GCTTCTGAAA CCCCATTGTC TGCTTTGCTG CTTGCCTACT TAGCCCATGA AGCCGGCTTT CCTCCAGGTG TAGTGAATGT CTTGCCTACT CTGGAAACTT CAATGGTAGG AAAATATATC TGTGAGCATC CTTTGATCAA AAAGGTCTCC TTTACAGGTT CCACGAATGT CGGTAAGCTC TTGATGAATC AGCTGTCGTC GACACTTAAG AAATTGACAT TTGAGTTGGG AGGAAACGCG CCATTTATTG TCTTTGAAGG CAGCGACATA GACAAGGCAG TGGATGGCGC AATCAAGGCT AAGTTCAGAT CCAGTGGACA AACCTGCGTT TGTGCCAATC GTATCTACGT CCACGAATCG ATATATGACG ATTTCGCAGC CAAATTTGTA GAAAAGGTTC AGCAAGAAAC CATTCTTGGC AATGGCTTGG ATGAAAATGT AACTCATGGT CCCGTTATCC ACGACAGATC TCTTGCCAAA GTCGAGCACC ACGTAACAGA TGCATTGGAT AAAGGTGCAA AGCTTCTTCT TGGAGGGAAA GCCCGCCCTG ACATAGGTGA CTACTTTCAC GAGTTAACTA TCTTGGGAGA TGTCACTGAA GATATGGCCA TTGCTTCTGA AGAGACATTT GGACCAGTCG CTCCATTGTT CAAGTTTAAG ACAGAACAGG AAGTTCTTGA ACGTGCAAAT AGTGCCGATG TAGGATTAGC AGGCTACTTC TACTCACCAG ACATAAGCCA AGTCTTCAGA GTCGCAGAAG AACTTGAAGT GGGAATGATT GGTGTCAACA CTGGTTCAAT TTCAGAGGCA GCCTTGCCAT TTGGAGGTGT TAAGGAGTCC GGTTTCGGTA GAGAAGGTTC CAAATACGGC TTGAGCGACT ACTTAGTCGT CAAGAGTGTA GTTGTAGGAG TTTGA
|
Protein sequence | MLKVFTRSAS SKQVLTQFKN KELLKFKGYV NGKWLESNDK ATFKVYNPAQ KDGQSDEIAE VYSFSPEEYG TAIEAAQTSF NSFKKTTGRY RSDLLLKLYD LMLKYEDDLA TLIVLENGKP YADALGEVRY AASFFQWFAE AAPHVTGDVI QSANASARIL TMKQPIGVCG ILTPWNFPSA MITRKLGAAI AVGCTSVIKP ASETPLSALS LAYLAHEAGF PPGVVNVLPT SETSMVGKYI CEHPLIKKVS FTGSTNVGKL LMNQSSSTLK KLTFELGGNA PFIVFEGSDI DKAVDGAIKA KFRSSGQTCV CANRIYVHES IYDDFAAKFV EKVQQETILG NGLDENVTHG PVIHDRSLAK VEHHVTDALD KGAKLLLGGK ARPDIGDYFH ELTILGDVTE DMAIASEETF GPVAPLFKFK TEQEVLERAN SADVGLAGYF YSPDISQVFR VAEELEVGMI GVNTGSISEA ALPFGGVKES GFGREGSKYG LSDYLVVKSV VVGV
|
| |