Gene PICST_31688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31688 
SymbolNUO51 
ID4838517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1438831 
End bp1440285 
Gene Length1455 bp 
Protein Length484 aa 
Translation table12 
GC content47% 
IMG OID640389832 
ProductNADH-ubiquinone oxidoreductase 51 kDa subunit, mitochondrial precursor (Complex I-51KD) (CI-51KD) 
Protein accessionXP_001384234 
Protein GI126135420 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.472796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCGTT TCAAGAGTCA AACGACTTTC AAGCGTGGGT TGGCTACGAT TGCAGATGCG 
GCTGCCAATC CGAACCGTGT CCACGGAGGC TTGAAAGATA CCGACAGAAT CTTTCAGAAC
ATCTACGGCA AGTATGGCCA CGACTTGAAG TCTTCAATGA AGATGGGGGA CTGGCACAAG
ACCAAGGAAA TTATTCTCAA AGGCGACAAG TGGATCATCG ACGAGATGAA GAAGTCTGGA
TTGAGAGGAA GAGGTGGAGC TGGTTTTCCG TCTGGACTTA AGTGGTCTTT CATGAATCCT
CCAGGCTGGG AAAAGAACGT AGGACCCAGA TACTTAGTGG TGAACGCCGA TGAAGGTGAG
CCAGGTACCT GTAAGGATCG TGAGATCATC CGTAAGGACC CCCACAAGTT GGTAGAGGGC
TGTTTGTTGG CAGGAAGAGG TATGAATGCT ACTGCTGCTT ATATCTATAT CAGAGGAGAG
TTCTACAACG AAGCTGTTAT ATTGCAAAAT GCCATTAATG AAGCCTACAA GGCCGGATTT
CTCGGTAAAA ATGCTTGTGG CTCCGGTTAC GACTTTGACA TCTACATCCA TCGTGGAATG
GGAGCCTATG TTTGTGGTGA AGAAACTGCT TTGATTGAAT CTATTGAAGG TAAGGCAGGT
AAACCTAGAT TGAAGCCTCC TTTCCCTGCT GGAGTCGGTT TGTTCGGCAG ACCCACAACT
GTAGCCAATG TCGAGACGGT TTCCGTTGCC CCAACCATCT TGAGAAGAGG CGGAGACTGG
TTTGCTTCAT TTGGTAGAGA AAGAAACCAG GGTGTGAAAT TGTTCTGTAT TTCTGGACAT
GTCAACGAAC CATGTACCGT TGAGGAAGAG ATGTCGATTC CATTGAAGGA ACTCTTAGAA
AAGCACTGTG GTGGTGTTAA AGGAGGCTGG GACAACTTAT TGGGTGTTAT CCCCGGTGGT
TCTTCCGTGC CTATCATGAC CAAGGAAACT TGTGACAACA TTTTAATGGA CTACGATGCC
CTCAGAGACG TTGGCTCTGG TTTGGGTACG GCTGCTGTCA TTGTCATGAA CAAGCAGACG
GATATCATTA GAGGTATCCA GAGATTCTCT CACTTCTACA AGCATGAGTC GTGTGGTCAA
TGTACGCCAT GCCGTGAAGG TACCACTTGG TTGCAGAGAA TGATGGACAG ATTCCAAGAA
GGTCAAGCTA CCGAAAAAGA GATCGACATG ATCTTCGAGT TGACCAAAGA GATCGAAGGC
CATACTATCT GTGCCTTGGG TGATGCTGCT GCCTGGCCCA TCCAGGGATT GATTAAGTCA
TTCAGACCAG TCATGGTAGA CAGAATCAAC GAGTTCAAGA AGAAGAATGA GACCATTGGC
TACGGTGGTT GGATCGACGG AGGTAAGGTG AAAGAGGGTG TTGTAATCGA CAACCCAGTT
CCACACTCCC ATTAA
 
Protein sequence
MLRFKSQTTF KRGLATIADA AANPNRVHGG LKDTDRIFQN IYGKYGHDLK SSMKMGDWHK 
TKEIILKGDK WIIDEMKKSG LRGRGGAGFP SGLKWSFMNP PGWEKNVGPR YLVVNADEGE
PGTCKDREII RKDPHKLVEG CLLAGRGMNA TAAYIYIRGE FYNEAVILQN AINEAYKAGF
LGKNACGSGY DFDIYIHRGM GAYVCGEETA LIESIEGKAG KPRLKPPFPA GVGLFGRPTT
VANVETVSVA PTILRRGGDW FASFGRERNQ GVKLFCISGH VNEPCTVEEE MSIPLKELLE
KHCGGVKGGW DNLLGVIPGG SSVPIMTKET CDNILMDYDA LRDVGSGLGT AAVIVMNKQT
DIIRGIQRFS HFYKHESCGQ CTPCREGTTW LQRMMDRFQE GQATEKEIDM IFELTKEIEG
HTICALGDAA AWPIQGLIKS FRPVMVDRIN EFKKKNETIG YGGWIDGGKV KEGVVIDNPV
PHSH