Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_36041 |
Symbol | SNU114 |
ID | 4838865 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1418084 |
End bp | 1421074 |
Gene Length | 2991 bp |
Protein Length | 978 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640390180 |
Product | ATP dependent RNA helicase and U5 mRNA splicing factor |
Protein accession | XP_001384231 |
Protein GI | 150865136 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0480] Translation elongation factors (GTPases) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.556904 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACG ACATCTATGA CGAGTTTGGA AACTTGATAG GTGACGCTTT TGACTCGGAT GCGGAGTCAC TGGACGAATC TGCATTAGAA AATGAAGTAG AACCACAAGA TGAAGAAGTA GATATTGAGT CCGATACAGA AGAAAAGGAA AATGGAATTG ACTTGAAAAT GAACGTTGAC GAAACAGTAA CTGAACAGCA GAACAGCTTG GATTTGGTGA AGAGTACTGG AGCGGGCAAG TTCGCCGAAG ATGTCAAACA AATCATAGTA GACCCCGCTG AACCACCTCA AGATGAGCCT GTTATTCAGC CAAGAGTAGA AAAGAAGTTG AAAGTTGATT TCACAGACAA CATCAAGAGT GACCTGAAGG AAAACGGAGA AGCTAGCATT ATGGCTGGCT TGCCAGAAGT GATTTATTCC AGAGAATATA TGATACAAAC CATGACACTG TTACCAGAGA GGATACGAAA CATAGCGTTA GTGGGCAATT TACACTCAGG AAAGACCACT TTCGTGGACT CATTGGTTCT ACATACCCAT TCGCCTTCTA TTGGATTAAA GAAGTCGCTC AAGAATTTCA AACCTTTACG GTTTATGGAT AATCACAAGC TCGAGATAGA CAGAGGTACT ACAATCAAAA CTAGCCCCAT CACCTTGATG TTACAAGATT TGAAAAATAG ATCGGCTATA TTCAACATAC TCGATACTCC TGGCCATGCG GACTTTGAAG ATGAAACTAT TGCTGCCATT GCGGCTGTAG ACGGAATAAT TCTTGTTGTA GATGTAGTTG AAGGAATCAC TGCAAGAGAC AGAAGTCTTG TTGACCATGC TGTCAAGGAG AATGTTCCCA TAGTTTTGAT GTTGAACAAG ATCGATAGGT TGATTCTTGA GTTGAAACTT CCTGTAAGAG ACTGCTACCA GAAGCTCAAC TACATAGTGG AGGATGTCAA CCAACGACTT AGTCAGAACG AGTTTATTGC TAACTACACT CACTCTACAA CGGTATCTCC TGTGGAGAAC AACGTTATTT TTGCATCTTC AACGTTTGAA TTCACTTTTT CGCTAATCAG CTTTGCTGAC CTCTATCTCC GTAAATCTGG AATAACAGGT GTCGACATAG AAGAATTCAG CAAGCGGCTT TGGGGTGATT ACTTCTACGA TAAAAAGACT AATAAGTTTT CAACTAATTC ACAGGACGGT AAGCTTTCGC GATCGTTTGT TTCTTTCATT CTCGAACCCA TCTACAAAAT CATAACTTAC ACATTGGTGT CAGAGCCAGG GGACACCAGA TTACCTTCAC TTTTATGGGA CAACTTTGGT GTGAAGCTAA ACAAACAGCA GTACAAACAA GATCCTCAGA TATTATTGAA GGATGTGTTT AAAGCCATTT TTGACGATAA CAAGGGATTC GTACACTCCG TCAATTCTAG CATAAGCAAT CCTCGGATTT CACAAATAAG AGGAATCAAC TCCCAGAACT TGCCTGATGA TTCCGTACTT GCTCGAGTAG TCAAACTAGT GGAATCTTCA GACGCTTCCC AATTCTTGTC GATAGTCAGA GTATTCAAGG GAGAATTGAT AGTAGGATCC AAGATAAAGG TCTTGGGAGA AAACTATGCA GAGGATAATG AAGACTACAA GATACAGACA GTTGAAGAGC TCTATTTATC TGGCGGGCGA TATAAAGTTC CCATAGATGT TGCTGGCGAA GGTGCAATTG TAATTGTGGG TGGTATTGAT TCCATTGTCA ATAAGGGTGC TACTATCTTA GCAGCTAATA AACTGTTAGA GAATTGTGAA ATATTTTCCC AGCCTAACTA CGGCAGCAAG TCAGTGTTTA AAGTAGCTGT GGAACCAGCA AATCCTTCTG AATTACCCAA AATGTTAGAA GGGTTGAGAA AAATCAACAA ATCGTACTTG GCTGCAGTTA TCAATGTAGA AGAAAGTGGC GAGCATGTTA TTCTTGCACC AGGAGAGCTA TATTTGGATT GTGTCTTACA TGATTTGCGA CTCTTTTTCA CTGACAATTT GGAAATAAAA GTCAGTGACC CCATGACAAA GTTCAGCGAA ACCGTTGTAG AAGGCTCAAT TACAAAAATA ACCACCAGCA CTCCTTCCGG AAACAATCTG ATTTCGATCA TAGCTGAGCC TTTGAATGAT TCAAAATTGA GCTATGCGAT TGAATCGGGC TCAATTGACC TCAGTCAGCC AGCTAAAATA ACATCTAAGA TATTGAGAAA AGACTTCGGT TGGGATGCAT TGGCCGCAAG ATCAGTTTGG TGTTTTGGTC CTGAAGGCTT ACAATCGCCT AGTCTCTTAC TCGATGACAC ACTAGAAGAA GAGACCGATA AAAAATTGTT ATATTCAGTG AAAGATTCAA TTTGCCAAGG ATTTAAATGG AGTATAAGCG AGGGGCCACT ATGTAACGAG CCTATCAGAA ATACTAAGTT CAAAATCTTG GATGCTGTTA TTAGCGGCTC GGAAATTCAT AGAAGTGGAA CTCAGATTAT ACCAATGACC AGAAAAGCAT GTTATGCAGG GTTTTTAACT GCAACATCTC GTTTGATGGA ACCAATCTAC TCAGTGACTG TAGTGTGTAC GCATAGTGCC AAAGCATTAG TGTCAAAGCT CTTAGATGGT AGAAGAGGTA ATATCATCAA AGACTGGCCA GTTCCAGGTA CTCCGCTCTT TGAGTTGGAG GGGCATGTTC CCGTTATCGA GTCTGTAGGT CTTGAGACAG ATATCCGAAT CCGTGCTCAA GGTCAAGCTA TGTGTTATCT TACATTTAGC AATTGGCAAG TTGTGCCAGG AGATCCACTC GATCCTGACT GTTTCTTACC ATCTTTGAAA CCAGTACCTG CAGAGTCACT TGCTAGAGAC TTCGTAATGA AAACGAGAAG AAGAAAAGGT ATGACAGGCG AGCCAAGCTT ACAAAAGTAC ATCGATACAA ACTTGTATAC TAGATTAAGA GAAAAGGGAA TTGTTCGTTA G
|
Protein sequence | MDDDIYDEFG NLIGDAFDSD AESSDESALE NEVEPQDEEV DIESDTEEKE NGIDLKMNVD ETFAEDVKQI IVDPAEPPQD EPVIQPRVEK KLKVDFTDNI KSDSKENGEA SIMAGLPEVI YSREYMIQTM TSLPERIRNI ALVGNLHSGK TTFVDSLVLH THSPSIGLKK SLKNFKPLRF MDNHKLEIDR GTTIKTSPIT LMLQDLKNRS AIFNILDTPG HADFEDETIA AIAAVDGIIL VVDVVEGITA RDRSLVDHAV KENVPIVLML NKIDRLILEL KLPVRDCYQK LNYIVEDVNQ RLSQNEFIAN YTHSTTVSPV ENNVIFASST FEFTFSLISF ADLYLRKSGI TGVDIEEFSK RLWGDYFYDK KTNKFSTNSQ DGKLSRSFVS FILEPIYKII TYTLVSEPGD TRLPSLLWDN FGVKLNKQQY KQDPQILLKD VFKAIFDDNK GFVHSVNSSI SNPRISQIRG INSQNLPDDS VLARVVKLVE SSDASQFLSI VRVFKGELIV GSKIKVLGEN YAEDNEDYKI QTVEELYLSG GRYKVPIDVA GEGAIVIVGG IDSIVNKGAT ILAANKSLEN CEIFSQPNYG SKSVFKVAVE PANPSELPKM LEGLRKINKS YLAAVINVEE SGEHVILAPG ELYLDCVLHD LRLFFTDNLE IKVSDPMTKF SETVVEGSIT KITTSTPSGN NSISIIAEPL NDSKLSYAIE SGSIDLSQPA KITSKILRKD FGWDALAARS VWCFGPEGLQ SPSLLLDDTL EEETDKKLLY SVKDSICQGF KWSISEGPLC NEPIRNTKFK ILDAVISGSE IHRSGTQIIP MTRKACYAGF LTATSRLMEP IYSVTVVCTH SAKALVSKLL DGRRGNIIKD WPVPGTPLFE LEGHVPVIES VGLETDIRIR AQGQAMCYLT FSNWQVVPGD PLDPDCFLPS LKPVPAESLA RDFVMKTRRR KGMTGEPSLQ KYIDTNLYTR LREKGIVR
|
| |