Gene PICST_36041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36041 
SymbolSNU114 
ID4838865 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1418084 
End bp1421074 
Gene Length2991 bp 
Protein Length978 aa 
Translation table12 
GC content41% 
IMG OID640390180 
ProductATP dependent RNA helicase and U5 mRNA splicing factor 
Protein accessionXP_001384231 
Protein GI150865136 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0480] Translation elongation factors (GTPases) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.556904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACG ACATCTATGA CGAGTTTGGA AACTTGATAG GTGACGCTTT TGACTCGGAT 
GCGGAGTCAC TGGACGAATC TGCATTAGAA AATGAAGTAG AACCACAAGA TGAAGAAGTA
GATATTGAGT CCGATACAGA AGAAAAGGAA AATGGAATTG ACTTGAAAAT GAACGTTGAC
GAAACAGTAA CTGAACAGCA GAACAGCTTG GATTTGGTGA AGAGTACTGG AGCGGGCAAG
TTCGCCGAAG ATGTCAAACA AATCATAGTA GACCCCGCTG AACCACCTCA AGATGAGCCT
GTTATTCAGC CAAGAGTAGA AAAGAAGTTG AAAGTTGATT TCACAGACAA CATCAAGAGT
GACCTGAAGG AAAACGGAGA AGCTAGCATT ATGGCTGGCT TGCCAGAAGT GATTTATTCC
AGAGAATATA TGATACAAAC CATGACACTG TTACCAGAGA GGATACGAAA CATAGCGTTA
GTGGGCAATT TACACTCAGG AAAGACCACT TTCGTGGACT CATTGGTTCT ACATACCCAT
TCGCCTTCTA TTGGATTAAA GAAGTCGCTC AAGAATTTCA AACCTTTACG GTTTATGGAT
AATCACAAGC TCGAGATAGA CAGAGGTACT ACAATCAAAA CTAGCCCCAT CACCTTGATG
TTACAAGATT TGAAAAATAG ATCGGCTATA TTCAACATAC TCGATACTCC TGGCCATGCG
GACTTTGAAG ATGAAACTAT TGCTGCCATT GCGGCTGTAG ACGGAATAAT TCTTGTTGTA
GATGTAGTTG AAGGAATCAC TGCAAGAGAC AGAAGTCTTG TTGACCATGC TGTCAAGGAG
AATGTTCCCA TAGTTTTGAT GTTGAACAAG ATCGATAGGT TGATTCTTGA GTTGAAACTT
CCTGTAAGAG ACTGCTACCA GAAGCTCAAC TACATAGTGG AGGATGTCAA CCAACGACTT
AGTCAGAACG AGTTTATTGC TAACTACACT CACTCTACAA CGGTATCTCC TGTGGAGAAC
AACGTTATTT TTGCATCTTC AACGTTTGAA TTCACTTTTT CGCTAATCAG CTTTGCTGAC
CTCTATCTCC GTAAATCTGG AATAACAGGT GTCGACATAG AAGAATTCAG CAAGCGGCTT
TGGGGTGATT ACTTCTACGA TAAAAAGACT AATAAGTTTT CAACTAATTC ACAGGACGGT
AAGCTTTCGC GATCGTTTGT TTCTTTCATT CTCGAACCCA TCTACAAAAT CATAACTTAC
ACATTGGTGT CAGAGCCAGG GGACACCAGA TTACCTTCAC TTTTATGGGA CAACTTTGGT
GTGAAGCTAA ACAAACAGCA GTACAAACAA GATCCTCAGA TATTATTGAA GGATGTGTTT
AAAGCCATTT TTGACGATAA CAAGGGATTC GTACACTCCG TCAATTCTAG CATAAGCAAT
CCTCGGATTT CACAAATAAG AGGAATCAAC TCCCAGAACT TGCCTGATGA TTCCGTACTT
GCTCGAGTAG TCAAACTAGT GGAATCTTCA GACGCTTCCC AATTCTTGTC GATAGTCAGA
GTATTCAAGG GAGAATTGAT AGTAGGATCC AAGATAAAGG TCTTGGGAGA AAACTATGCA
GAGGATAATG AAGACTACAA GATACAGACA GTTGAAGAGC TCTATTTATC TGGCGGGCGA
TATAAAGTTC CCATAGATGT TGCTGGCGAA GGTGCAATTG TAATTGTGGG TGGTATTGAT
TCCATTGTCA ATAAGGGTGC TACTATCTTA GCAGCTAATA AACTGTTAGA GAATTGTGAA
ATATTTTCCC AGCCTAACTA CGGCAGCAAG TCAGTGTTTA AAGTAGCTGT GGAACCAGCA
AATCCTTCTG AATTACCCAA AATGTTAGAA GGGTTGAGAA AAATCAACAA ATCGTACTTG
GCTGCAGTTA TCAATGTAGA AGAAAGTGGC GAGCATGTTA TTCTTGCACC AGGAGAGCTA
TATTTGGATT GTGTCTTACA TGATTTGCGA CTCTTTTTCA CTGACAATTT GGAAATAAAA
GTCAGTGACC CCATGACAAA GTTCAGCGAA ACCGTTGTAG AAGGCTCAAT TACAAAAATA
ACCACCAGCA CTCCTTCCGG AAACAATCTG ATTTCGATCA TAGCTGAGCC TTTGAATGAT
TCAAAATTGA GCTATGCGAT TGAATCGGGC TCAATTGACC TCAGTCAGCC AGCTAAAATA
ACATCTAAGA TATTGAGAAA AGACTTCGGT TGGGATGCAT TGGCCGCAAG ATCAGTTTGG
TGTTTTGGTC CTGAAGGCTT ACAATCGCCT AGTCTCTTAC TCGATGACAC ACTAGAAGAA
GAGACCGATA AAAAATTGTT ATATTCAGTG AAAGATTCAA TTTGCCAAGG ATTTAAATGG
AGTATAAGCG AGGGGCCACT ATGTAACGAG CCTATCAGAA ATACTAAGTT CAAAATCTTG
GATGCTGTTA TTAGCGGCTC GGAAATTCAT AGAAGTGGAA CTCAGATTAT ACCAATGACC
AGAAAAGCAT GTTATGCAGG GTTTTTAACT GCAACATCTC GTTTGATGGA ACCAATCTAC
TCAGTGACTG TAGTGTGTAC GCATAGTGCC AAAGCATTAG TGTCAAAGCT CTTAGATGGT
AGAAGAGGTA ATATCATCAA AGACTGGCCA GTTCCAGGTA CTCCGCTCTT TGAGTTGGAG
GGGCATGTTC CCGTTATCGA GTCTGTAGGT CTTGAGACAG ATATCCGAAT CCGTGCTCAA
GGTCAAGCTA TGTGTTATCT TACATTTAGC AATTGGCAAG TTGTGCCAGG AGATCCACTC
GATCCTGACT GTTTCTTACC ATCTTTGAAA CCAGTACCTG CAGAGTCACT TGCTAGAGAC
TTCGTAATGA AAACGAGAAG AAGAAAAGGT ATGACAGGCG AGCCAAGCTT ACAAAAGTAC
ATCGATACAA ACTTGTATAC TAGATTAAGA GAAAAGGGAA TTGTTCGTTA G
 
Protein sequence
MDDDIYDEFG NLIGDAFDSD AESSDESALE NEVEPQDEEV DIESDTEEKE NGIDLKMNVD 
ETFAEDVKQI IVDPAEPPQD EPVIQPRVEK KLKVDFTDNI KSDSKENGEA SIMAGLPEVI
YSREYMIQTM TSLPERIRNI ALVGNLHSGK TTFVDSLVLH THSPSIGLKK SLKNFKPLRF
MDNHKLEIDR GTTIKTSPIT LMLQDLKNRS AIFNILDTPG HADFEDETIA AIAAVDGIIL
VVDVVEGITA RDRSLVDHAV KENVPIVLML NKIDRLILEL KLPVRDCYQK LNYIVEDVNQ
RLSQNEFIAN YTHSTTVSPV ENNVIFASST FEFTFSLISF ADLYLRKSGI TGVDIEEFSK
RLWGDYFYDK KTNKFSTNSQ DGKLSRSFVS FILEPIYKII TYTLVSEPGD TRLPSLLWDN
FGVKLNKQQY KQDPQILLKD VFKAIFDDNK GFVHSVNSSI SNPRISQIRG INSQNLPDDS
VLARVVKLVE SSDASQFLSI VRVFKGELIV GSKIKVLGEN YAEDNEDYKI QTVEELYLSG
GRYKVPIDVA GEGAIVIVGG IDSIVNKGAT ILAANKSLEN CEIFSQPNYG SKSVFKVAVE
PANPSELPKM LEGLRKINKS YLAAVINVEE SGEHVILAPG ELYLDCVLHD LRLFFTDNLE
IKVSDPMTKF SETVVEGSIT KITTSTPSGN NSISIIAEPL NDSKLSYAIE SGSIDLSQPA
KITSKILRKD FGWDALAARS VWCFGPEGLQ SPSLLLDDTL EEETDKKLLY SVKDSICQGF
KWSISEGPLC NEPIRNTKFK ILDAVISGSE IHRSGTQIIP MTRKACYAGF LTATSRLMEP
IYSVTVVCTH SAKALVSKLL DGRRGNIIKD WPVPGTPLFE LEGHVPVIES VGLETDIRIR
AQGQAMCYLT FSNWQVVPGD PLDPDCFLPS LKPVPAESLA RDFVMKTRRR KGMTGEPSLQ
KYIDTNLYTR LREKGIVR