Gene PICST_36809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36809 
SymbolMTR4 
ID4840256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1394577 
End bp1397783 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table12 
GC content45% 
IMG OID640391571 
ProductDead-box family helicase required for mRNA export from nucleus 
Protein accessionXP_001385966 
Protein GI150866387 
COG category[L] Replication, recombination and repair 
COG ID[COG4581] Superfamily II RNA helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.787056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCTG ACGATCTCTT CGACGTGTTC GACGAAGCGC CTGTGACTGC TCCTCCAGTC 
ATTGAGACAG CTGGGGAGTC GAAAAAAGAG TCTAAGAAAA GAACAGCTGA AGACGTGAAG
AATGAGAGCG AAGTGAAAAA TTCCAGTAGT CAATACAATG ATGAGAACAA CAATGAGGCC
AGCAGTAATA ATAAAAAAGC TAACACGAGG TCAAAGCCAG ATATTAAACC TGTTGTCTTC
GATTCTCTTG AAATTGAGGC CTCCCGTGAA GTCGCAGCGC TGGACGGGCT TATGGCTAGT
GCTGAGTCTG TTTCAGACAA GAAGCCCGAA GGGTTGAAGT TGAGACATCA GGTTAGGCAC
CAGGTTGCCA TTCCGCCTGA TTATCCGTAT GTGCCCATCG GTGAGCACAA GAGACAAAAA
GAGGCGAGAA CTTATCCATT TATTTTAGAT CCTTTCCAGG ACACAGCCAT CTCGTGTATA
GACAGAGATG AGTCTGTACT TGTTTCTGCG CACACCTCAG CTGGTAAGAC AGTTGTAGCC
GAGTATGCTA TTGCACAGTC GTTGCGTGAC AAGCAGCGTG TCATTTATAC TTCGCCCATT
AAGGCTTTAA GTAACCAGAA GTATAGAGAG TTATTGGCAG AATTTGGCGA CGTAGGCTTG
ATGACCGGAG ATGTCACTAT CAACCCCGAT GCTGGGTGTT TGGTGATGAC GACGGAGATT
TTGAGGAGTA TGTTGTACCG TGGTTCGGAA ATCATGCGAG AAGTAGCTTG GGTCATCTTT
GATGAAGTCC ACTATATGAG AGACAAGTCG CGTGGTGTCG TCTGGGAAGA AACTATTATC
TTGTTACCAG ACAAGGTGCA TCATGTTTTT CTTTCTGCTA CCATTCCCAA TGCCATGGAA
TTCGCAGAAT GGATTGTTAA GATCCATGCG CAGCCTTGTC ACGTTGTGTA CACAGACTTC
AGGCCAACTC CATTGCAGCA CTACTTGTTT CCAGCTGCTG GAGATGGGAT CCATTTGGTT
GTAGACGAAA AGGGAACTTT CAGGGAAGAA AATTTCCAGA AAGCCATGGC TTCTATCAGC
GATGCTGGTG GCGATGATCC TGCCTCGGGT GATAAATCTA AAGGTAAGAA GGGTCAAACG
TACAAGGGAG GCAATAAGGA CGGTAAGTCG GACATCTACA AGATCGTCAA GATGATCTAC
ATGAAAAGGT ACAACCCTGT GATTGTGTTT TCGTTTTCGA AAAGAGACTG TGAGTCTCTT
GCATTGAAGA TGTCCAAGTT AGATTTCAAC AACGACGACG AAAGAGATGC CTTGACCAAG
ATCTTCAATA ACGCCATCAA CTTGTTGCCG GAAGCAGACA AAGAGTTGCC CCAGATCAAG
AACATCTTAC CCTTGTTGAA GCGAGGTATC GGTATCCACC ACTCGGGATT GTTGCCTATA
TTGAAGGAGA TTATCGAGAT CTTGTTTCAG GAGGGTCTTT TGAAGGTGTT GTTTGCTACA
GAGACCTTTT CAATTGGGTT GAATATGCCT GCTAAAACAG TAGTATTTAC CTCTGTTCGT
AAATGGGATG GTGTAGGCTT CAGATGGGTC TCGGGAGGTG AGTATATCCA GATGTCTGGT
AGAGCTGGTC GTCGTGGGTT GGATGATCGT GGTATAGTCA TCATGATGAT TGACGAAAAA
ATGGAGCCTC AAGTAGCCAA AGGTATGGTA AAGGGACAGG CTGATAGACT TGACTCGGCC
TTCCATTTGG GCTACAACAT GATCTTGAAT TTAATGAGAG TCGAAGGCAT TTCTCCCGAG
TTCATGTTGG AAAACTCATT CTACCAGTTC CAGAACGCTG CCTCTGTCCC TGTGATGGAA
AAGACGTTGC AGGACTTGAC TTTGAAGTAC AACACTATCG AAGTCGATGA TGAAGCTACA
GTGAAAGAGT ACTACGATCT CAAGAAGCAG TTGGACAAAT ACCAGGAAGA CGTCAGAAAA
GTAATTACTC ATCCTGGCTA CATCTTGCCT TTCTTGCAAG AAGGAAGAGT TATCAAAGTC
AAAATAGGTG ATCAGGACTA CGGCTGGGGA ATGGTGACTT CTTTCTCCAA ACGTAACAAT
AAGAGAAACC AGTCGTTCAC AGACTACGAA ACCTACATTG TCAATGTATT TGTGTACACG
ATGTTTGTGG ATTCGCCCGT TAACTTGATC AAGCCCTTGA ATCCCATGTT ACCTGAAGGG
ATTAGACCTG CTAAGGCCGG TGAGAAGTCC AGAGTGGAGT ACATTCCTAT CACTCTCGAT
TCAATTGAAA AGATCAGTAG TGTGCGGTTG AGAGTACCAG AAGACTTGAA GAGTTCAGCA
TCTAAGAAGA CGTTGTTGAA GACGATGAAG GATTTGCCCA AGAGATTGCC CGATGGAATT
CCATTGATGG ATCCTGTGGA AAACATGAAG ATCACTGACC AGGACTTCCA GATGCTCTTG
AAGAAGATCG ATGTTTTAGA CTCCAAGCTC ATCAGCAACC CCTTGTACAA CTCAGCAAGA
TTGAAGGATT TGTACGAAAA CTACAGCGAA AAGGAACAAA TACAGGAAAA AATCAAAAAC
TTGAAGGAGA AGGTTTTGGA AGCACAAGCA GTTATCCAAT TGGATGACTT GAGACACAGA
AAGAGAGTGT TGAGAAGGTT GGACTTTGTT ACACAGAACG ATATCATCGA GTTGAAGGGT
AGAGTTGCCT GTGAAATCAG TTCAGGTGAT GAGTTGCTTT TGACGGAATT GATTTTCAAC
GGTACCTTCA ACGACTTGAC ATGTGAACAG TGTGCTGCAT TGCTTTCGTG TTTTGTTTTC
CAAGAAAGAG CCAAGGAGAC TCCACGTTTG AAGCCAGAAT TGGCTGAGCC ATTGAAGTCT
ATGCAAGACA TGGCCAGCAA GATAGCTAAG GTGACGAAGG AAAGCAAGAT TGAAATAATA
GAAAAGGACT ACGTCGAGTC GTTCAGGCCA GAGTTGATGG AAGTTACATA TGCTTGGTGT
AAAGGTGCAT CGTTCACTCA GATCTGTAAG ATGACCGATG TGTACGAAGG GTCGTTGATC
AGAACTTTCA AGCGTTTGGA AGAATTGATC AGACAGTTGG TGCAAGCAGC CAAGACCATT
GGAAATACCG ACTTGGAGGA GAAGATGGAG AAGACCATCG AGTTGGTGCA CAGGGACATT
GTCTCTGCTG GATCTTTGTA TCTTTAG
 
Protein sequence
MDADDLFDVF DEAPVTAPPV IETAGESKKE SKKRTAEDVK NESEVKNSSS QYNDENNNEA 
SSNNKKANTR SKPDIKPVVF DSLEIEASRE VAASDGLMAS AESVSDKKPE GLKLRHQVRH
QVAIPPDYPY VPIGEHKRQK EARTYPFILD PFQDTAISCI DRDESVLVSA HTSAGKTVVA
EYAIAQSLRD KQRVIYTSPI KALSNQKYRE LLAEFGDVGL MTGDVTINPD AGCLVMTTEI
LRSMLYRGSE IMREVAWVIF DEVHYMRDKS RGVVWEETII LLPDKVHHVF LSATIPNAME
FAEWIVKIHA QPCHVVYTDF RPTPLQHYLF PAAGDGIHLV VDEKGTFREE NFQKAMASIS
DAGGDDPASG DKSKGKKGQT YKGGNKDGKS DIYKIVKMIY MKRYNPVIVF SFSKRDCESL
ALKMSKLDFN NDDERDALTK IFNNAINLLP EADKELPQIK NILPLLKRGI GIHHSGLLPI
LKEIIEILFQ EGLLKVLFAT ETFSIGLNMP AKTVVFTSVR KWDGVGFRWV SGGEYIQMSG
RAGRRGLDDR GIVIMMIDEK MEPQVAKGMV KGQADRLDSA FHLGYNMILN LMRVEGISPE
FMLENSFYQF QNAASVPVME KTLQDLTLKY NTIEVDDEAT VKEYYDLKKQ LDKYQEDVRK
VITHPGYILP FLQEGRVIKV KIGDQDYGWG MVTSFSKRNN KRNQSFTDYE TYIVNVFVYT
MFVDSPVNLI KPLNPMLPEG IRPAKAGEKS RVEYIPITLD SIEKISSVRL RVPEDLKSSA
SKKTLLKTMK DLPKRLPDGI PLMDPVENMK ITDQDFQMLL KKIDVLDSKL ISNPLYNSAR
LKDLYENYSE KEQIQEKIKN LKEKVLEAQA VIQLDDLRHR KRVLRRLDFV TQNDIIELKG
RVACEISSGD ELLLTELIFN GTFNDLTCEQ CAALLSCFVF QERAKETPRL KPELAEPLKS
MQDMASKIAK VTKESKIEII EKDYVESFRP ELMEVTYAWC KGASFTQICK MTDVYEGSLI
RTFKRLEELI RQLVQAAKTI GNTDLEEKME KTIELVHRDI VSAGSLYL