Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31442 |
Symbol | |
ID | 4838549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 816041 |
End bp | 817294 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640389864 |
Product | predicted protein |
Protein accession | XP_001384456 |
Protein GI | 150865301 |
COG category | [C] Energy production and conversion |
COG ID | [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.971248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCCA TCTCAGAAAT CACTGCCTTA ACTTTGAAGA CGAATATCCT TATCGTTGGA GGAGCTTATG CAGGATTAGC AGCCATTAAC CAATTTCGGA AGAGACTTGA TTCTAATTTC AGTGATAGCA ATGATAAAAA GATTTCTTTA ACTTTGGTTG AACCCAGGGC AGGGTTTTTG AACATTCTTG GAATACCAAA ATGTATTGTT AATCCTGAAT TTGCTCGGAA CCAGTACATT ACCTTTGAGA GGTTTCCATA TCTCCAATTT GACAAAGTAT ACTCAATCGA CAAAAAGATG CAACAGCAAT TGCAAGAGGA AAGAACTAAA CCACAGCCTT TTGAATTGGA CTTCATCCAC GGTAAAATTA ATTATTTGGA CGAGAAGTCT GCCACTTTTA CCTTGACAGG GGGTAAAGAA AAGTCGAAAA TCGACTTTGA CTATGTAATT TTGGCCAGTG GAAGGCTGCG TCAATGGCCC TCTACTCCGA ATGCCTTCAA TATTGAATAC TTCATGAAAG AAATGAATGA TACCCATAAG AAGATTTCTG AAAGCAACAC AATTTCCATT ATCGGGGCTG GTGCAGTTGG AATTGAGTTG GCTGGAGAAA TCAAGGCCGA ATTCCCTGAG AAGAGCGTCA ATTTAATCCA TCCTCATCCG TCTTTCCCAC CTGAACCTCT TTCAGAAGAA TTTCAGGACA AAGTTAAAAA AGGTTTAGAG GATGCTGGCG TGAATTTACT TTTGAACTCG AGAATTGACC GAGAGTTCGG AAATGGAAAT TTACAAACTA CAGATGGTGA ATTTATAGAA TCTGATTTGA ATTATTGGTG TACATCACAT AAGAATAATA TCGATTTTCT TTCAGAAGAA ATTTGTTCAT TCTTGACAGC CAAAAAGGAT CTCGCCGTTA ATGAATATTT ACAAGTAGCA GATACCGACA TTGTTTTACC TAATGTATTT GCTACTGGAG ATTTGGTAGA CTTGGATGTC ATCAAATCTG CTGGCTGGGC TCTCCATATG GGACCAATAG CTGCTGATAA CATTATTAAT TTGATAATGG GTGAAGAACC AAATTCAAAA TTGCCCGATG TTTCACTTTG GGAAAAGAAC ATTGCGCTAG CTGTTGGAAA TGGTGAAATT ATCTCCGGGA ATGGCAATAC AGTTGAGATC AACAATTCCG ATTATGTCGA AATCTACAAG GATTATGGAT TGAACAGATG CTTAGAAATG CTTGAAGCGG AACTTAGAGA ATAA
|
Protein sequence | MTPISEITAL TLKTNILIVG GAYAGLAAIN QFRKRLDSNF SDSNDKKISL TLVEPRAGFL NILGIPKCIV NPEFARNQYI TFERFPYLQF DKVYSIDKKM QQQLQEERTK PQPFELDFIH GKINYLDEKS ATFTLTGGKE KSKIDFDYVI LASGRSRQWP STPNAFNIEY FMKEMNDTHK KISESNTISI IGAGAVGIEL AGEIKAEFPE KSVNLIHPHP SFPPEPLSEE FQDKVKKGLE DAGVNLLLNS RIDREFGNGN LQTTDGEFIE SDLNYWCTSH KNNIDFLSEE ICSFLTAKKD LAVNEYLQVA DTDIVLPNVF ATGDLVDLDV IKSAGWALHM GPIAADNIIN LIMGEEPNSK LPDVSLWEKN IALAVGNGEI ISGNGNTVEI NNSDYVEIYK DYGLNRCLEM LEAELRE
|
| |