Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_84956 |
Symbol | NAF2.2 |
ID | 4840179 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 1106780 |
End bp | 1109900 |
Gene Length | 3121 bp |
Protein Length | 821 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391494 |
Product | Zn_clus Fungal Zn(2)-Cys(6) binuclear cluster domain |
Protein accession | XP_001385566 |
Protein GI | 150866088 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAACCTGGTC TATCTGGCCG GCTCTATTCC TGTATTCATG CTCTTTATTG CTCTAGTCTA AACCATTTTG TCCTTTTCAG ATTTCGTGAT AATTTGAATG AGGTGTAAAA ATAATTTATC TTGCTTCTGA CATCTCCAAT TTGCATTTCT TAGCAATAGT CAAGTCTTCG TCTCACAAAT TTCCCGCCTA CAGTTTAGTG CCTGCGTTGA CAGACTTGGT CCGAAAAAAT TATTACTTCA TAAATTCCGG TTTAAATAGT CAAAAGAGCC GCTGCCCGCT TTCTTTACAT TCTAATACTA GGGGCACAGA GCTCAATTCT CTAATTCTTT TCCACAAATA GCGATGGTCT CTCCGATACT GTCGCAGTCA AGTACGACGC CCCTCGCTCA AGAGGCTTTG TCCAATAGTC CCGATAGCAT CAATAGCGAA GATAGCAACG AAAACCCAGT TCAGCGGAAC GAGCTAGAAA ACGCTTCAGC GAACTCTACT TCCAGCATAA CTCAAAGTAG CCTCGCAGCT GCTGCGGCCG AGAAACATCT CATCCGTAGA CGAAAACACA AAAATTCCAA ACTCGGCTGT CCCAACTGTA AAAAGAGAAG AGTCAAGTGT ACCGAAAACC TTCCGGCATG CTCCAACTGT ATCAAACATA AAGTCAAGTG TGGTTACTTG GACTACACGG AAGAGCAGCT CAATGAACTC CGACAAGCCA AACTAGCCCA GGACTTTGAT GAACTAACTA CTTCGGTGAG CGGCCATGAA CGCGATGCCG ATGGAGCAAG CTCGTCGTAT TCTTCGACTA CGTCCTCCGC CCACGGCTCA ATGTCCGGCG CTGCTTCACA GAACGTCAAG AAATCTAAAC CGAAGACGAA ATCACTTTCG GCTCCTAATA AGGCAACAAC GGTCGTAGTT CCAAAGAAGG CTGCGTCGTT CGCAACGCCT TCCATTCCTC ACTCAGGCAC CGGGTCGGGC GCAGGGTCCA TAAATGCTCC AGAGTCCGTT GTGACAGGCA CTGGATTTGA AGGCAACGAG TTCGAGTACT CGGCCGATTC TCTCAATAGC ATGTTTAATT TCCAGCAGCA GTCCATCACC CAGAACTTCG ACAATTTGTT GAACAACTCC ATTAACGACG AGGTGCCCAT TATCTATCCC GTCTACTCCA TCAATAATAA TAATATTAAC GGCATTAACT TTGGCAGTAA CGACGACTCT AATAACAATA ACGTTAACAA CTTCAACAAT GTCGGTGTCA ACATTAATAA CATGAACAAC ATAAATAACA ACATTAATGG TAATAATAGT TCGTTCCTCA GAGATGGGTA TGCGGACAAC ATGATGATGG TAGATCCTAC GGCCGTTGAC TTCTCTAATT CGGCCTCTTC GTTCACTATG TCAGAGAACT CGATCGACTT CTCAAACCCA GCAGCCTTGA ATCCCGACAG CTATGTAGCA TTTTCCAACG TATCGACTAT ACCTCCATCA CGAGTAACTC CAGCTCCGTA CGCCTCAGCT TTGCCAAATG CAATCAGGTC GAATACTACA TTCACTGTTA TTAATGGAGA GCAAATCGAC TACCAAGAGA AGCTCTTGGA GGTCGTTGGG ATCTTGGGTC CGTCTATCGA TAACGGAACT TGCCCATTGC CGCAGATCCG CCACTTGTAC TACGTGTGGT TGAACTCGTT TATATATAGA TCGTATACAT CCGAGATGAT GTTCAGTTGC TTGATCAATT TAACGACAAA CTACCTCATC ACAAACTGTT TCATCAACTC CGACTCGTAC AAGAAACATT TGCCTCCTCA AGGCTCTTCT ACGACTTCGT CTGAATTGCC GTGGGATGAA TCCCACCAAT CGCAGTTTGG CGACAAGCTA TCGGTGCTAG TAGACAAGAC GAAAGCCAAG AACGTAGCCA TAGTCAAGTC TATCAAGCAC TACGCTAGAG TTATCAAGGA CATGCGTTAC TTCTTGAACA AGAATGAGGA CCCAGATTTG TGCTCAAGTG TCTCGTACAT TCTTAGTTTG ATGTCAATCT ACGATCCAGA GGCTACACTT AATAGTTCCA ACTGTTTCAG AGACGGACTT TTCAGCATTT TGTCGTACAA CATCAACTTG ACGTTGAAAC GTAAAGGCGA CATAGGTATA ATCATTACAA CGCATCTCAA GTTGATGAAG AACATCGCCA GATCGGTGTA CTTGCCGGGA TATGATCCAG CTCTTTTAAT TGAATTCCAG TCTGTCTTGA ACAACTTTAG TGAGTTGATC AGACCAGTCA TCAACAGAGT CAAAAATTAC GTACTCAGCA ATAACCTAGC TCCTGTTGAG AAGTTGCGAT TTGTTGAGGA GAAACTTGTT GATTTGATCG ACTTCACCGA CGACTGTATC AACAAGTATA TTCCTGCTAT ATACGACAAT TTTTCAGACA TCGACAAACA GCAGGAGTTG TTGTTTGACA TGATCTACAG GTGGGTCAGA TTCTTTCCTT CGCGGCTAAC AGTGATCACG CCTGCCTCCG ATCCGTTGGA AAAGGTGCTC TATTTGTTCT ACAAGGTGTT GAAGAAGTCA CTCTATGCCA TTTTTCCCCA AGTCAAGTTC TTCTTCTTGC GTGACTTCGA CAGTCCGCTC ATGTTAGATG TCTTTGTAGT GATCAAGGAT GTAGACATAT TCTTCGAATA CTTGGAACAC CCAAAAACGA ACGTGTTGCC TTGGGAGTTG TATGGCCAAA TTTTGCCTGA GTTGAAGAAC ATGTCGTCGT ACTTGATCAG ATTGGTCACG TTTTTGCAGA TTCGTGTTGG TTTGTTGTAC AGGTATGTGG TGTATGAACA AGTAGCAAAG GAAAAGTTCC CTATCAAAGA TTCCCGCGCC TGGAGAGATT CAATCACCGA TATTGAGGGA ACGAGACAAG AGTTCAATAA GGTCATTGGA CTCAAGGAAG TGCCGATTAA GTCGTTTCTT AAGACCTACA TCAAGGTAGA GAACTATCCA AGGCTTCTCC AGAATGGCGA AGATCCATCC ACTCAGGGCC ACGAATGTAT TGAAGCAGAA GTAGATTTCC TGACGCTTCA GCAGAGTGGT CTTTTGAGAG ACGATTTCAA CATCATGGCA GCTATGATGA AAGGTAGTTA G
|
Protein sequence | MVSPISSQSS TTPLAQEALS NSPDSINSED SNENPVQRNE LENASANSTS SITQSSLAAA AAEKHLIRRR KHKNSKLGCP NCKKRRVKCT ENLPACSNCI KHKVKCGYLD YTEEQLNELR QAKLAQDFDE LTTSNVKKSK PKTKSLSAPN KATTVVVPKK AASFATPSIP HSGTGSGAGS INAPESVVTG TGFEGNEFDN DDSNNNNVNN FNNVGVNINN MNNINNNING NNSSFLRDGY ADNMMMVDPT AVDFSNSASS FTMSENSIDF SNPAALNPDS YVAFSNVSTI PPSRVTPAPY ASALPNAIRS NTTFTVINGE QIDYQEKLLE VVGILGPSID NGTCPLPQIR HLYYVWLNSF IYRSYTSEMM FSCLINLTTN YLITNCFINS DSYKKHLPPQ GSSTTSSELP WDESHQSQFG DKLSHYARVI KDMRYFLNKN EDPDLCSSVS YILSLMSIYD PEATLNSSNC FRDGLFSILS YNINLTLKRK GDIGIIITTH LKLMKNIARS VYLPGYDPAL LIEFQSVLNN FSELIRPVIN RVKNYVLSNN LAPVEKLRFV EEKLVDLIDF TDDCINKYIP AIYDNFSDID KQQELLFDMI YRWVRFFPSR LTVITPASDP LEKVLYLFYK VLKKSLYAIF PQVKFFFLRD FDSPLMLDVF VVIKDVDIFF EYLEHPKTNV LPWELYGQIL PELKNMSSYL IRLVTFLQIR VGLLYRYVVY EQVAKEKFPI KDSRAWRDSI TDIEGTRQEF NKVIGLKEVP IKSFLKTYIK VENYPRLLQN GEDPSTQGHE CIEAEVDFST LQQSGLLRDD FNIMAAMMKG S
|
| |