Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_85822 |
Symbol | NUO78 |
ID | 4851514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2028032 |
End bp | 2030820 |
Gene Length | 2789 bp |
Protein Length | 722 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393222 |
Product | NADH dehydrogenase (ubiquinone) 78K chain precursor, 5-prime end |
Protein accession | XP_001387625 |
Protein GI | 126274713 |
COG category | [C] Energy production and conversion |
COG ID | [COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) |
TIGRFAM ID | [TIGR01973] NADH-quinone oxidoreductase, chain G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.28905 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0501454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTCAACAAGT GGAAACGCTA CAAATTCGGT TCTCTTACGT CGTGTGTGCA GTGGCAGAGT TTGAATCCTT CATTTCTTTC GATTTTGCTA CCAGATTTTA ACGTATCTGA AAGTGAATTG GATAGTAGAG TCTTTTCGTA TCACAGATTT GTATCACCAT AGTCATCTAA TAGTCTTTCG TTGGCTGGAT TTTGACCGAT ATCAACAACA AGTTTGATAC AGCATTATTA AAGCATTATT GTCATCAGTA TTTGCTTTTG TCAATTTTCA AAACTCATAC CTGAAAGGTT CTGAATCTCG AACATAATCT TATTGATTAA AATTCATATT TGAAATCATA ACTTCAACTG ATAGCTGGAA ATTTCATTAC TATAAATCAC AATATAAAAT TTCATAAGAT GCATTCCGTC AGAAACAACC TCTTGAGATC GTCCAGACGT TACCTTTCGG CCTCGATGAG AAGAGCTGCT GAGGTAGAAG TCACCGTTGA CGGCAGAAAA GTGCTGATTG AAGCTGGCTC GTCGATTATC CAGGCTGCTG AGTTGGCTGG TGTAACTATT CCTCGTTACT GTTACCACGA GAAATTGGCC GTTGCTGGTA ACTGTCGTAT GTGTCTTGTA GATGTTGAGA GAATGCCAAA GTTAATTGCC TCGTGTGCCA TGCCTGTTCA GAACGGCATG GTGGTCCATA CAGACTCTGA AAGAATCAAG AAGGCCAGAG AAGGTGTCAC CGAGATGTTG TTGGAAAATC ATCCTTTGGA TTGTCCTGTG TGTGACCAGG GTGGAGAATG TGATTTACAA GAACAATCAC AGAGATATGG ATCAGACAGA GGCAGATTCA AGGAGTTGGT AGGTAAGAGG GCCGTAGAAA ACAAAGCCAT TGGTCCCTTA GTCAAAACTT CGATGAATAG ATGTATCCAT TGTACCAGAT GTGTCCGTTT CATGAACGAT GTCGCTGGTG CTCCAGAGTT TGGAACAGCT GGTAGAGGAA ACGACATGCA AATCGGAACC TACATAGAGA GAAACATCAA CTCCGAGATG TCGGGTAACA TTATCGACTT GTGTCCTGTT GGTGCCTTAA CTTCCAAGCC ATACGCTTTC AGAGCCAGAC CTTGGGAGTT GAAGAGAACT GAAACCATCG ATGTTTTCGA CGCTGTAGGT TCAAACATCA GAGTAGATGC CAGAGGTATT GAGGTAATGA GAGTTTTGCC CAGATTGAAC GACGAAGTCA ACGAAGAGTG GATCTCTGAC AAGTCCAGAT TTGCATGCGA CGGTTTGAAG ACCCAACGTT TGACTTCTCC TTTGATCAGA AATGGCGACA AGTTTGAAGT CGGCACCTGG GACGAAGCCT TATCTACTAT CGCTGCCGCA TACGCCAAGA TTGCACCAAA GGGCGATGAG TTGAAAGCTA TTGCTGGTGC TTTGACTGAT GCCGAGTCCA TGGTGGCACT CAAGGACTTG GTCAACAAGT TGGGATCAGA AAACACGACA ACTGATGTCA AACAGGCTGT AGATGCTCAT GGCGTGGATA TCAGATCCAA CTACATCTTT AACTCGACCA TTGATGGCAT TGAAGATGCA GACCAGATTT TGTTGGTTGG TACCAATCCA AGACACGAAG CTGCTGTGCT CAACACCAGA CTTAGAAAAG TGTGGTTAAG ACAAGAATTG GACATTGCCT CTGTAGGCCA GGAGTTCGAT TCGACCTTCA AGTTGCAACA CTTGGGTGTA GACGCCAACG CTTTAAAGCA GGCCCTTGCT GGAGATGTAG GAAAGAAGTT GTCTTCTGCT AAGAAGCCTT TAATCATCGT AGGTTCTGGT GTCGCAGACT CTGAAGACGC TTCTGCTATC TACAAGTTGG TAGGTGAATT CGCATCCAAG AACACCAACT TCAACTCTGC TGAGTGGAAT GGTGTTAACT TGTTGCATCG TGAGGCTTCT AGAGTTGCTG CCTTAGACAT TGGCTTCCAG ACATTATCAC CTGAAACGGC TAAGACAAAG CCTAAGTTCG TCTACTTATT GGGAGCAGAC GAAATCTCCA ACAAGGATAT CCCCAAGGAC GCTTTTGTTG TCTACCAGGG CCATCACGGT GACTTGGGAG CCTCTTTTGC TGATGTAATC TTACCTGGCT CTGCTTACAC TGAGAAATCT GCCACCTACG TGAATACTGA AGGTAGAACC CAGGTTACAC GTGCTGCTAC CAACCCTCCT GGTGTTGCCC GTGAAGACTG GAAGATTGTC AGAGCCTTGT CCGAATACTT GGATGCCACT TTGCCCTACG ACGACATTGT CTCTGTAAGA ATCAGATTGG GTGAAATTGC ACCTCATTTG GTGAGACATG ATGTCATTGA GCCAGCTTCC AGTGACATTG CTAAGATCGG TTTCGCTGCC TTAGTCTCAA AGAACAAGAG TGCCACCATC TCTGGAACAC CTTTGAAGAA CCCAATCGAC AACTTCTACT TCACTGATGT CATCTCCAGA TCTTCGCCTA CTATGGCCAG ATGTATTTCT TCTTTTGGTG CCAAGGTTGA CAAGATTACC GACGACAAAC CCGACATCAA CTTCTAAACG CCATCCACTT CCAACCATCA ACCCTAAACA CACATTCCTG TTTTTATTGT TCTTGCATAT TCATATTCAT CGTGTTCAGC GGAACATGCT CATGACGTCT TTGACGACAC CAAAACCACC TTCTAACGTT GAACGCTGAA AAGTTGTTAA ACTATTTATT CATTGAAGAT GTTCTTTTTC CCTATTTGTT TCATAGGCAT ACATATACAT AATATTAGA
|
Protein sequence | MHSVRNNLLR SSRRYLSASM RRAAEVEVTV DGRKVLIEAG SSIIQAAELA GVTIPRYCYH EKLAVAGNCR MCLVDVERMP KLIASCAMPV QNGMVVHTDS ERIKKAREGV TEMLLENHPL DCPVCDQGGE CDLQEQSQRY GSDRGRFKEL VGKRAVENKA IGPLVKTSMN RCIHCTRCVR FMNDVAGAPE FGTAGRGNDM QIGTYIERNI NSEMSGNIID LCPVGALTSK PYAFRARPWE LKRTETIDVF DAVGSNIRVD ARGIEVMRVL PRLNDEVNEE WISDKSRFAC DGLKTQRLTS PLIRNGDKFE VGTWDEALST IAAAYAKIAP KGDELKAIAG ALTDAESMVA LKDLVNKLGS ENTTTDVKQA VDAHGVDIRS NYIFNSTIDG IEDADQILLV GTNPRHEAAV LNTRLRKVWL RQELDIASVG QEFDSTFKLQ HLGVDANALK QALAGDVGKK LSSAKKPLII VGSGVADSED ASAIYKLVGE FASKNTNFNS AEWNGVNLLH REASRVAALD IGFQTLSPET AKTKPKFVYL LGADEISNKD IPKDAFVVYQ GHHGDLGASF ADVILPGSAY TEKSATYVNT EGRTQVTRAA TNPPGVARED WKIVRALSEY LDATLPYDDI VSVRIRLGEI APHLVRHDVI EPASSDIAKI GFAALVSKNK SATISGTPLK NPIDNFYFTD VISRSSPTMA RCISSFGAKV DKITDDKPDI NF
|
| |