Gene PICST_85822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85822 
SymbolNUO78 
ID4851514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2028032 
End bp2030820 
Gene Length2789 bp 
Protein Length722 aa 
Translation table 
GC content44% 
IMG OID640393222 
ProductNADH dehydrogenase (ubiquinone) 78K chain precursor, 5-prime end 
Protein accessionXP_001387625 
Protein GI126274713 
COG category[C] Energy production and conversion 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) 
TIGRFAM ID[TIGR01973] NADH-quinone oxidoreductase, chain G 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.28905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0501454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTCAACAAGT GGAAACGCTA CAAATTCGGT TCTCTTACGT CGTGTGTGCA GTGGCAGAGT 
TTGAATCCTT CATTTCTTTC GATTTTGCTA CCAGATTTTA ACGTATCTGA AAGTGAATTG
GATAGTAGAG TCTTTTCGTA TCACAGATTT GTATCACCAT AGTCATCTAA TAGTCTTTCG
TTGGCTGGAT TTTGACCGAT ATCAACAACA AGTTTGATAC AGCATTATTA AAGCATTATT
GTCATCAGTA TTTGCTTTTG TCAATTTTCA AAACTCATAC CTGAAAGGTT CTGAATCTCG
AACATAATCT TATTGATTAA AATTCATATT TGAAATCATA ACTTCAACTG ATAGCTGGAA
ATTTCATTAC TATAAATCAC AATATAAAAT TTCATAAGAT GCATTCCGTC AGAAACAACC
TCTTGAGATC GTCCAGACGT TACCTTTCGG CCTCGATGAG AAGAGCTGCT GAGGTAGAAG
TCACCGTTGA CGGCAGAAAA GTGCTGATTG AAGCTGGCTC GTCGATTATC CAGGCTGCTG
AGTTGGCTGG TGTAACTATT CCTCGTTACT GTTACCACGA GAAATTGGCC GTTGCTGGTA
ACTGTCGTAT GTGTCTTGTA GATGTTGAGA GAATGCCAAA GTTAATTGCC TCGTGTGCCA
TGCCTGTTCA GAACGGCATG GTGGTCCATA CAGACTCTGA AAGAATCAAG AAGGCCAGAG
AAGGTGTCAC CGAGATGTTG TTGGAAAATC ATCCTTTGGA TTGTCCTGTG TGTGACCAGG
GTGGAGAATG TGATTTACAA GAACAATCAC AGAGATATGG ATCAGACAGA GGCAGATTCA
AGGAGTTGGT AGGTAAGAGG GCCGTAGAAA ACAAAGCCAT TGGTCCCTTA GTCAAAACTT
CGATGAATAG ATGTATCCAT TGTACCAGAT GTGTCCGTTT CATGAACGAT GTCGCTGGTG
CTCCAGAGTT TGGAACAGCT GGTAGAGGAA ACGACATGCA AATCGGAACC TACATAGAGA
GAAACATCAA CTCCGAGATG TCGGGTAACA TTATCGACTT GTGTCCTGTT GGTGCCTTAA
CTTCCAAGCC ATACGCTTTC AGAGCCAGAC CTTGGGAGTT GAAGAGAACT GAAACCATCG
ATGTTTTCGA CGCTGTAGGT TCAAACATCA GAGTAGATGC CAGAGGTATT GAGGTAATGA
GAGTTTTGCC CAGATTGAAC GACGAAGTCA ACGAAGAGTG GATCTCTGAC AAGTCCAGAT
TTGCATGCGA CGGTTTGAAG ACCCAACGTT TGACTTCTCC TTTGATCAGA AATGGCGACA
AGTTTGAAGT CGGCACCTGG GACGAAGCCT TATCTACTAT CGCTGCCGCA TACGCCAAGA
TTGCACCAAA GGGCGATGAG TTGAAAGCTA TTGCTGGTGC TTTGACTGAT GCCGAGTCCA
TGGTGGCACT CAAGGACTTG GTCAACAAGT TGGGATCAGA AAACACGACA ACTGATGTCA
AACAGGCTGT AGATGCTCAT GGCGTGGATA TCAGATCCAA CTACATCTTT AACTCGACCA
TTGATGGCAT TGAAGATGCA GACCAGATTT TGTTGGTTGG TACCAATCCA AGACACGAAG
CTGCTGTGCT CAACACCAGA CTTAGAAAAG TGTGGTTAAG ACAAGAATTG GACATTGCCT
CTGTAGGCCA GGAGTTCGAT TCGACCTTCA AGTTGCAACA CTTGGGTGTA GACGCCAACG
CTTTAAAGCA GGCCCTTGCT GGAGATGTAG GAAAGAAGTT GTCTTCTGCT AAGAAGCCTT
TAATCATCGT AGGTTCTGGT GTCGCAGACT CTGAAGACGC TTCTGCTATC TACAAGTTGG
TAGGTGAATT CGCATCCAAG AACACCAACT TCAACTCTGC TGAGTGGAAT GGTGTTAACT
TGTTGCATCG TGAGGCTTCT AGAGTTGCTG CCTTAGACAT TGGCTTCCAG ACATTATCAC
CTGAAACGGC TAAGACAAAG CCTAAGTTCG TCTACTTATT GGGAGCAGAC GAAATCTCCA
ACAAGGATAT CCCCAAGGAC GCTTTTGTTG TCTACCAGGG CCATCACGGT GACTTGGGAG
CCTCTTTTGC TGATGTAATC TTACCTGGCT CTGCTTACAC TGAGAAATCT GCCACCTACG
TGAATACTGA AGGTAGAACC CAGGTTACAC GTGCTGCTAC CAACCCTCCT GGTGTTGCCC
GTGAAGACTG GAAGATTGTC AGAGCCTTGT CCGAATACTT GGATGCCACT TTGCCCTACG
ACGACATTGT CTCTGTAAGA ATCAGATTGG GTGAAATTGC ACCTCATTTG GTGAGACATG
ATGTCATTGA GCCAGCTTCC AGTGACATTG CTAAGATCGG TTTCGCTGCC TTAGTCTCAA
AGAACAAGAG TGCCACCATC TCTGGAACAC CTTTGAAGAA CCCAATCGAC AACTTCTACT
TCACTGATGT CATCTCCAGA TCTTCGCCTA CTATGGCCAG ATGTATTTCT TCTTTTGGTG
CCAAGGTTGA CAAGATTACC GACGACAAAC CCGACATCAA CTTCTAAACG CCATCCACTT
CCAACCATCA ACCCTAAACA CACATTCCTG TTTTTATTGT TCTTGCATAT TCATATTCAT
CGTGTTCAGC GGAACATGCT CATGACGTCT TTGACGACAC CAAAACCACC TTCTAACGTT
GAACGCTGAA AAGTTGTTAA ACTATTTATT CATTGAAGAT GTTCTTTTTC CCTATTTGTT
TCATAGGCAT ACATATACAT AATATTAGA
 
Protein sequence
MHSVRNNLLR SSRRYLSASM RRAAEVEVTV DGRKVLIEAG SSIIQAAELA GVTIPRYCYH 
EKLAVAGNCR MCLVDVERMP KLIASCAMPV QNGMVVHTDS ERIKKAREGV TEMLLENHPL
DCPVCDQGGE CDLQEQSQRY GSDRGRFKEL VGKRAVENKA IGPLVKTSMN RCIHCTRCVR
FMNDVAGAPE FGTAGRGNDM QIGTYIERNI NSEMSGNIID LCPVGALTSK PYAFRARPWE
LKRTETIDVF DAVGSNIRVD ARGIEVMRVL PRLNDEVNEE WISDKSRFAC DGLKTQRLTS
PLIRNGDKFE VGTWDEALST IAAAYAKIAP KGDELKAIAG ALTDAESMVA LKDLVNKLGS
ENTTTDVKQA VDAHGVDIRS NYIFNSTIDG IEDADQILLV GTNPRHEAAV LNTRLRKVWL
RQELDIASVG QEFDSTFKLQ HLGVDANALK QALAGDVGKK LSSAKKPLII VGSGVADSED
ASAIYKLVGE FASKNTNFNS AEWNGVNLLH REASRVAALD IGFQTLSPET AKTKPKFVYL
LGADEISNKD IPKDAFVVYQ GHHGDLGASF ADVILPGSAY TEKSATYVNT EGRTQVTRAA
TNPPGVARED WKIVRALSEY LDATLPYDDI VSVRIRLGEI APHLVRHDVI EPASSDIAKI
GFAALVSKNK SATISGTPLK NPIDNFYFTD VISRSSPTMA RCISSFGAKV DKITDDKPDI
NF