Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_75518 |
Symbol | NUO20 |
ID | 4851784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2819996 |
End bp | 2821027 |
Gene Length | 1032 bp |
Protein Length | 207 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393492 |
Product | NADH-ubiquinone oxidoreductase |
Protein accession | XP_001386878 |
Protein GI | 126275592 |
COG category | [C] Energy production and conversion |
COG ID | [COG0377] NADH:ubiquinone oxidoreductase 20 kD subunit and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR01957] NADH-quinone oxidoreductase, B subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.228639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATTCGGGCCG TTTTTTTCTA AACTCTTCTC CACAAACATC CACAACATGT TCTCGTTGTC CAGAAGAGCC TTCGTGGCCG CAAAGCCCGT GTCTAGCTCC GTAGCAAGGG CCGTAGCTGT TCGTTACATA AGCTCCAAAG ACTTGACTCC CAAGGCTTTA CCTACAGACT TCCCATTAAT GTCAGAAAAA ACTGCCTCCA GTCCCATCGA CTACGCCTTG ACGTCGTTGG ACTCGATTGC TAACTGGGCC AGAAAGTCGT CATTCTGGCC AGTGACTTTC GGTTTGGCTT GTTGTGCTGT GGAAATGATG CACGTCTCCA CTCCCAGATA CGATCAGGAT AGATTGGGTA TTATTTTCAG AGCATCGCCA CGTCAGTCCG ATATCATGAT TGTAGCCGGA ACTGTCACCA ATAAGATGGC CCCAGCCTTG AGACAAGTGT ACGATCAAAT GCCAGACCCA AGATGGGTCA TCTCCATGGG CTCGTGTGCC AATGGTGGAG GCTACTACCA CTACTCATAT TCCGTTGTCA GAGGCTGTGA TAGAATCGTC CCTGTGGACA TTTACGTTCC AGGTTGTCCT CCTACTGCTG AAGCATTGAT GTACGGTGTT TTCCAGTTAC AGAAGAAGAT GATGAAGACC AGAATCACCA GATTGTGGTA CAGATCGTAG AGAAGAGATT GTATGAAGAA TGAGGATTAT GTGAATGACT AAAATTAAAA ATTAGATTGC TAAATTGTAT ACATTTTGGC AATCGATTTA GTGGATATTG GATGTTGTTC TGATCATTAA CGATAACTTT CATATCCATA GTCATGTTGT TGACATTTAA CATATTCGCG AATATTGTTA TCCATGCGAT ATTCTGATTC TCATTTATAT TCCTATTAGT CACTACCCTA CACTCATAGA TATCATAGTT CACTGTCAAG GTGAGCAACA GTTTCACTTC CGATAGTAGC CGTGTGCAAC TATCACCATT TTCCTTGCAT TGTATTCTAT ACACTTTTTA TACGTTTAAT ATATTCATAG TC
|
Protein sequence | MFSLSRRAFV AAKPVSSSVA RAVAVRYISS KDLTPKALPT DFPLMSEKTA SSPIDYALTS LDSIANWARK SSFWPVTFGL ACCAVEMMHV STPRYDQDRL GIIFRASPRQ SDIMIVAGTV TNKMAPALRQ VYDQMPDPRW VISMGSCANG GGYYHYSYSV VRGCDRIVPV DIYVPGCPPT AEALMYGVFQ LQKKMMKTRI TRLWYRS
|
| |