Gene PICST_84956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_84956 
SymbolNAF2.2 
ID4840179 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1106780 
End bp1109900 
Gene Length3121 bp 
Protein Length821 aa 
Translation table12 
GC content44% 
IMG OID640391494 
ProductZn_clus Fungal Zn(2)-Cys(6) binuclear cluster domain 
Protein accessionXP_001385566 
Protein GI150866088 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAACCTGGTC TATCTGGCCG GCTCTATTCC TGTATTCATG CTCTTTATTG CTCTAGTCTA 
AACCATTTTG TCCTTTTCAG ATTTCGTGAT AATTTGAATG AGGTGTAAAA ATAATTTATC
TTGCTTCTGA CATCTCCAAT TTGCATTTCT TAGCAATAGT CAAGTCTTCG TCTCACAAAT
TTCCCGCCTA CAGTTTAGTG CCTGCGTTGA CAGACTTGGT CCGAAAAAAT TATTACTTCA
TAAATTCCGG TTTAAATAGT CAAAAGAGCC GCTGCCCGCT TTCTTTACAT TCTAATACTA
GGGGCACAGA GCTCAATTCT CTAATTCTTT TCCACAAATA GCGATGGTCT CTCCGATACT
GTCGCAGTCA AGTACGACGC CCCTCGCTCA AGAGGCTTTG TCCAATAGTC CCGATAGCAT
CAATAGCGAA GATAGCAACG AAAACCCAGT TCAGCGGAAC GAGCTAGAAA ACGCTTCAGC
GAACTCTACT TCCAGCATAA CTCAAAGTAG CCTCGCAGCT GCTGCGGCCG AGAAACATCT
CATCCGTAGA CGAAAACACA AAAATTCCAA ACTCGGCTGT CCCAACTGTA AAAAGAGAAG
AGTCAAGTGT ACCGAAAACC TTCCGGCATG CTCCAACTGT ATCAAACATA AAGTCAAGTG
TGGTTACTTG GACTACACGG AAGAGCAGCT CAATGAACTC CGACAAGCCA AACTAGCCCA
GGACTTTGAT GAACTAACTA CTTCGGTGAG CGGCCATGAA CGCGATGCCG ATGGAGCAAG
CTCGTCGTAT TCTTCGACTA CGTCCTCCGC CCACGGCTCA ATGTCCGGCG CTGCTTCACA
GAACGTCAAG AAATCTAAAC CGAAGACGAA ATCACTTTCG GCTCCTAATA AGGCAACAAC
GGTCGTAGTT CCAAAGAAGG CTGCGTCGTT CGCAACGCCT TCCATTCCTC ACTCAGGCAC
CGGGTCGGGC GCAGGGTCCA TAAATGCTCC AGAGTCCGTT GTGACAGGCA CTGGATTTGA
AGGCAACGAG TTCGAGTACT CGGCCGATTC TCTCAATAGC ATGTTTAATT TCCAGCAGCA
GTCCATCACC CAGAACTTCG ACAATTTGTT GAACAACTCC ATTAACGACG AGGTGCCCAT
TATCTATCCC GTCTACTCCA TCAATAATAA TAATATTAAC GGCATTAACT TTGGCAGTAA
CGACGACTCT AATAACAATA ACGTTAACAA CTTCAACAAT GTCGGTGTCA ACATTAATAA
CATGAACAAC ATAAATAACA ACATTAATGG TAATAATAGT TCGTTCCTCA GAGATGGGTA
TGCGGACAAC ATGATGATGG TAGATCCTAC GGCCGTTGAC TTCTCTAATT CGGCCTCTTC
GTTCACTATG TCAGAGAACT CGATCGACTT CTCAAACCCA GCAGCCTTGA ATCCCGACAG
CTATGTAGCA TTTTCCAACG TATCGACTAT ACCTCCATCA CGAGTAACTC CAGCTCCGTA
CGCCTCAGCT TTGCCAAATG CAATCAGGTC GAATACTACA TTCACTGTTA TTAATGGAGA
GCAAATCGAC TACCAAGAGA AGCTCTTGGA GGTCGTTGGG ATCTTGGGTC CGTCTATCGA
TAACGGAACT TGCCCATTGC CGCAGATCCG CCACTTGTAC TACGTGTGGT TGAACTCGTT
TATATATAGA TCGTATACAT CCGAGATGAT GTTCAGTTGC TTGATCAATT TAACGACAAA
CTACCTCATC ACAAACTGTT TCATCAACTC CGACTCGTAC AAGAAACATT TGCCTCCTCA
AGGCTCTTCT ACGACTTCGT CTGAATTGCC GTGGGATGAA TCCCACCAAT CGCAGTTTGG
CGACAAGCTA TCGGTGCTAG TAGACAAGAC GAAAGCCAAG AACGTAGCCA TAGTCAAGTC
TATCAAGCAC TACGCTAGAG TTATCAAGGA CATGCGTTAC TTCTTGAACA AGAATGAGGA
CCCAGATTTG TGCTCAAGTG TCTCGTACAT TCTTAGTTTG ATGTCAATCT ACGATCCAGA
GGCTACACTT AATAGTTCCA ACTGTTTCAG AGACGGACTT TTCAGCATTT TGTCGTACAA
CATCAACTTG ACGTTGAAAC GTAAAGGCGA CATAGGTATA ATCATTACAA CGCATCTCAA
GTTGATGAAG AACATCGCCA GATCGGTGTA CTTGCCGGGA TATGATCCAG CTCTTTTAAT
TGAATTCCAG TCTGTCTTGA ACAACTTTAG TGAGTTGATC AGACCAGTCA TCAACAGAGT
CAAAAATTAC GTACTCAGCA ATAACCTAGC TCCTGTTGAG AAGTTGCGAT TTGTTGAGGA
GAAACTTGTT GATTTGATCG ACTTCACCGA CGACTGTATC AACAAGTATA TTCCTGCTAT
ATACGACAAT TTTTCAGACA TCGACAAACA GCAGGAGTTG TTGTTTGACA TGATCTACAG
GTGGGTCAGA TTCTTTCCTT CGCGGCTAAC AGTGATCACG CCTGCCTCCG ATCCGTTGGA
AAAGGTGCTC TATTTGTTCT ACAAGGTGTT GAAGAAGTCA CTCTATGCCA TTTTTCCCCA
AGTCAAGTTC TTCTTCTTGC GTGACTTCGA CAGTCCGCTC ATGTTAGATG TCTTTGTAGT
GATCAAGGAT GTAGACATAT TCTTCGAATA CTTGGAACAC CCAAAAACGA ACGTGTTGCC
TTGGGAGTTG TATGGCCAAA TTTTGCCTGA GTTGAAGAAC ATGTCGTCGT ACTTGATCAG
ATTGGTCACG TTTTTGCAGA TTCGTGTTGG TTTGTTGTAC AGGTATGTGG TGTATGAACA
AGTAGCAAAG GAAAAGTTCC CTATCAAAGA TTCCCGCGCC TGGAGAGATT CAATCACCGA
TATTGAGGGA ACGAGACAAG AGTTCAATAA GGTCATTGGA CTCAAGGAAG TGCCGATTAA
GTCGTTTCTT AAGACCTACA TCAAGGTAGA GAACTATCCA AGGCTTCTCC AGAATGGCGA
AGATCCATCC ACTCAGGGCC ACGAATGTAT TGAAGCAGAA GTAGATTTCC TGACGCTTCA
GCAGAGTGGT CTTTTGAGAG ACGATTTCAA CATCATGGCA GCTATGATGA AAGGTAGTTA
G
 
Protein sequence
MVSPISSQSS TTPLAQEALS NSPDSINSED SNENPVQRNE LENASANSTS SITQSSLAAA 
AAEKHLIRRR KHKNSKLGCP NCKKRRVKCT ENLPACSNCI KHKVKCGYLD YTEEQLNELR
QAKLAQDFDE LTTSNVKKSK PKTKSLSAPN KATTVVVPKK AASFATPSIP HSGTGSGAGS
INAPESVVTG TGFEGNEFDN DDSNNNNVNN FNNVGVNINN MNNINNNING NNSSFLRDGY
ADNMMMVDPT AVDFSNSASS FTMSENSIDF SNPAALNPDS YVAFSNVSTI PPSRVTPAPY
ASALPNAIRS NTTFTVINGE QIDYQEKLLE VVGILGPSID NGTCPLPQIR HLYYVWLNSF
IYRSYTSEMM FSCLINLTTN YLITNCFINS DSYKKHLPPQ GSSTTSSELP WDESHQSQFG
DKLSHYARVI KDMRYFLNKN EDPDLCSSVS YILSLMSIYD PEATLNSSNC FRDGLFSILS
YNINLTLKRK GDIGIIITTH LKLMKNIARS VYLPGYDPAL LIEFQSVLNN FSELIRPVIN
RVKNYVLSNN LAPVEKLRFV EEKLVDLIDF TDDCINKYIP AIYDNFSDID KQQELLFDMI
YRWVRFFPSR LTVITPASDP LEKVLYLFYK VLKKSLYAIF PQVKFFFLRD FDSPLMLDVF
VVIKDVDIFF EYLEHPKTNV LPWELYGQIL PELKNMSSYL IRLVTFLQIR VGLLYRYVVY
EQVAKEKFPI KDSRAWRDSI TDIEGTRQEF NKVIGLKEVP IKSFLKTYIK VENYPRLLQN
GEDPSTQGHE CIEAEVDFST LQQSGLLRDD FNIMAAMMKG S