Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_81358 |
Symbol | |
ID | 4837241 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 2223577 |
End bp | 2228239 |
Gene Length | 4663 bp |
Protein Length | 1407 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640388556 |
Product | predicted protein |
Protein accession | XP_001383188 |
Protein GI | 150864398 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1643] HrpA-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.558576 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAA AGACCAAAAC CAAGCAGCAG GAAGAAACGC CTCCTCCAGG CAATGCGAAG GGTAAGAAAA ACTCCAAATC TAAGGTAGAA GAACCACAGG ATGAAGTCAA GAAACCACTT AGAGGTTTGG CTATAGGGGA AAACTTTGGA TGGACGGGAA AACTTCCAGT AACCTTATTG AATGAATATT GCCAAAAACA AAAATGGGGC AAGCCCAGCT TCGATATGGT TAAGAAGGGG CAGGGATTTG TCTGTTTCTT TCATCTTAAT TCGGTTAATC CGAAGACAAA AGAGCCGATC TCCATAAAAT TCATACCCAA GTACGAACCA AAACTGACAA CAAATGAAGC TAGGCATATG TCGGCTACGT ATGTTTTGTA CCGTATTAAT TTCATCAAAA ACATGAAAAT GCTTCTTCCG AACATTTTCC GAGACTATTG GGTCGAACTC GAAAAGGAAC GACTCGAAGT CTTGAAGAAA GACAAGTATA AGCATGACTT GGTATACAAT GTCAATCCCT TCCAAGTATA TATAGAGCAG CAAGAGTTGG TGCAGAAAAA GACTAAACAG GAACTTCTCC GGAAGGAAAA TGAAAGTAAA ATAAAGAAAC CTTCTATCAG CATAGGCACT ACTGTTGTTT CAGCTAAAGC AAGTACCGCC ACCTCGAGTA ATCATAGAAA TCAAGGAAAT ACAGGTACAA AGGTCAACAA GATACCACAA ACTATCAATG TGCCTACATT TCCAAGAAAA GTATGGGGCA ATGCTCCATT CATTGACTTT CCACCGGAGA TTCGTAGTTC CTTGGAACAG TCAATCCGTA AACATATCAA TTGGATCATT GATAACGAAC ATCCAATGTC TCAAAATAAA TCCAAGGCCG AGGAAAATCA GAAGTCATTG CTAGCATTAG GTTTTAGACA AATTCACATC GAAGAAGCTT TCAAGTACAC TTCTTCGTTT GTAGATTCCT TGGAATGGCT CTTATTCCAT ATTCCTGAAG ATGACCTTCC ACCGCAATTC ACTAAGACTG AAAAGGATTC TAGTGTCATG CTTAGAGTTT CCAAAGATAT CAAGCTTGAG TATATGTTGA AACGGTTGTC AGAATCAGGT TTCGATAAAG ATGACGTCTT GACTTCCTTA GAGGAATGCG AGTTTAACGA AGTGCAAGCT GGCGTCAAAC TCACACGTAC ATTTTTAGAA TTAGAGTCAG ATCAAGAATT TGAAGCTACA GATTTGGAAG ACTCTCAGAG CTTATGGGAA CAAGAGCTAG AAGGAATAAA AATGATGGAG TCCAACAAAG TTACCTTTAA TAATTCTGAG AAAACTATTG TAGATATTCA ACTCAGACCA CATAAGATCG CTGAAGACTT ACTTAATGTT CGTCTACTCA AAAGCTTGCA ATACCCTTGT GAGCTCCCTG GTATTCAAAT CGTTGTGAAC AACCCTTCCT TCAAATTAGC GAACTACATC AAGTTGTCAA TTCTCTCTAG TTTGGTACAA TACATCACCG AAAATGAAAT TCTCGGTGAG TGCATGATTT TCAACATTAT CGAATGGCTC GAGGAGAACA TTTCTCGAAT TATTGATAAC CCAGGTCCTT TAATTCTGGA AGGAGTAATT AATAGAGCAA CCCAAAGTAA CAAGAAAAGG ACCATTTCTA CTACTACATC TCAAAACAAA CAGTCGAGAA TTAAAGTTAA ACTGCCAAAG GAAATAGAAG CAGATAAACT TGCCTACAAG GCTAAGTTGA CGTCAAAGGA AATGCAGAAG TCTTTGAAAC AGAGAAGTTC CTTACCAGCT TGGAAAAAGA AAGAACAATT GGTGGACGTT ATTAACTCCA ACAAGGTTAC CTTGGTGACA GGGGAGACTG GTTCGGGTAA ATCCACTCAA ATCGTGCAAT TTATTCTCGA TGATATGAAC TCCAGGGGCA ATTTTTCCGG CAAGATTATG TGTACTCAAC CAAGACGTAT TTCTACCCTT GGTCTTGCTG ATAGAATATC GGAAGAACGT TTGGATAAAG TTGGCGGAGA AACAGGGTAC ATTATTAGAG GTGAAAATAA AACCAGTAAA GATACCCGAA TATCATTTGT GACTACAGGT GTTCTTTTGA GAATGTTGCA ATCTTTCCTT GCATCTTCCA GTTCACATCA GACTTCTATT TTTGATGAGT TAGAATACAT CTTCATTGAT GAAGTTCACG AACGTTCCGT GGATAGTGAT TTTTTACTCA TCATATTAAA AAAGACCATG AATCGTTTTC CTAACCTTAA GATTGTCCTA ATGTCTGCAA CCATAAGTGT CGAGATCTTC AAAAACTTTT TCAATACACC TTTAAATCAT ATTCATATTG AAGGTAGAAC ATTTCCAATT GAAGACTACT ACTTAGATCA AATTATTGAT GATATCGACT ACACCGTGGA AACTGCAAAT GGAATTGTGA AACCAAGGGC GGATTCTCAT TACTTTGAGA AGGGAAACAT TAATTTCGAC TTAGTTGCTA GGTTGTGTTT GCACATCGAT GATAAACTTG ATTCCGAGGG AAATGATGGC TCCATCTTAA TATTCTTACC AGGTATCATG GAGATTAATC AATGTGTCTC CATTATTGAG AGAGCCTTCT CGAAGAGGGA CAAACCATCT TGGACGCTCC CATTACACTC AGCATTGTCT TCTATGGAAC AGAAACGAGT CTTTAAGGTA CCAGCGAAGG GTACAAGAAA GATTGTGGTG TCTACAAACG TTGCAGAAAC TTCGATTACT ATTCCCGACT GTGTGGTGGT CGTTGATGGA GGTAGATCGA AGACGATGTT CTATGATCCC GAAAAGAATA CAACTAGATT AATTGAAAAC TGGTGTTCCA AGGCCGAAAT TGGACAGCGG AGAGGTAGAT CAGGTCGTGT AACCAATGGT AACTGCTACC ATCTTTATAC CAAAGAAATT GAAACCAAAA TGAGACAACA GGCCGTTCCT GAAATCAAAC GGACAAGACT TGAAAATTTG TACTTAGTTG TGAAATCCAT GGGCATAAGG AGTGTTGATG AATTCTTGAA CAGTGGTATC GACGCTCCGG ATCAATCGTC GTTGAAAACT TCGAAGAAAT TCTTAAAAGA AATTGGTGCT TTGGATGCTG ATACTGAAGA ATTATCGCAC CTCGGTAAGT ATTTATCATA TCTTCCAACG GATCTCCAGA GTGGTAAATT GCTCATTCTT GGTTGTATAT TTGGATGTTT AGATATATGC TTGACATTAG CATCTATCAG CTCTACTGGT AATCCTTTTT TCAATCTTGC AGACAAAAGA GCGGAGATCA AGCAAAAGAG AAGAGAGTTT TCCCAGAACC AAGGAGACTT TGTCGCCATA GCTAATGCAT ATGCAGAATA CGATAAAATG AAACAAAATG GCGAGAACAC CAAGAAATTC ATTTCTCTGA ACTACTTGTC ATTTATTACA TTGAATGATA TCTCGTCTAC AAGGGTACAA TACATTTCTT TACTTAAGGA TCTTGGATTT GTTCCACATG GATACTCACA CAGGAATCGT AATGTCGATT TTAACTTTCT TAATAGAAAT AACGAAAATT TCGGAGTCAT AAGAGCTATT ATCACGGCTT CTTTCTATCC TCAGATTGCT AGAGTCCAGC TTCCCGACCC CAAATATGTG AAGTCGGCAG TGGGATCTGT GGCAGTAGAT CCGGAAGCTT CATTGACCAA GTTCTGGGTC CGTAATGAAG AATATGTCGA TCTTGTGGAG AGCAACAAAA ACTTGGGAGA TGTATTACCA GCCAATAGGG CCAGAGTTCA CAACAGTTCT GTTATTTTTG ATGACAATTC TGTTGACGGA TCACTCAGCG AAAAGATATT AGAACAAGCT ACGGATGAGG AAGGAAATCT CGATTTCGTA AAGGCAAGAG AATTGTACGA TTTGACTCCG CAGGCACCTA AGGGAGGTAA TCCAATTCTT AAATCACCAT TTGTGGTGTA CGGAAGATCA CACCAGACTT TTGCATTTTT CTTGAGTGAC ATCACTCCTA CCAGTACAAT TGCTGCCTTA TTATTTGGAG GTAGCATAGG CTACAATTTG AGCGACCAAC TTTCGAATGG ACCTACTGGG ATTGTGTTGG ACAATTGGTT GCCAATTCGA ACGTGGTACA AGAATGGTGT TCTAATCAAA CGACTCAGAA AACTAGTGGA TGGAATGATC GAAGACAGGT TATCCAGTCC CCATTACGTG AACAGTCAGT CAAAAGATGC CAATGACGAC ATTCTTGCAG TTGTTGAGCA ATTGTTGGCA CTTTAGAAAT GGCTGCAAAA TCGATTAGAG AAAACGATAC AAAAACTCCA ACAGAAAAGT ATCAAACCGG AAATACCCAA AACGGAAACC ATGTCACGAG CGCAAATTAC AGTGCGTATA AAATAGAAGC TTAAGTATTG CTTTTTTGTG CGCAGAAGAT CTCTTGATAT GTCAGAAATG TCGCGCGACA TTTCGATGGC TGACAAATTG CGAAATCCAG CCACAGTTAT TCCATCTTGA TTTCAAACTC CAGATTCTTA GTAATAGATA CAACTGGATC ATTCTTCATA TATTTATAGA GTCAGAACCA CTAGATTGGT GAAATAAGCA TTTTTTGTAC ATACATTTAA TCTACAGCAT TTGCATTTAA TAT
|
Protein sequence | MAKKTKTKQQ EETPPPGNAK GKKNSKSKVE EPQDEVKKPL RGLAIGENFG WTGKLPVTLL NEYCQKQKWG KPSFDMVKKG QGFVCFFHLN SVNPKTKEPI SIKFIPKYEP KSTTNEARHM SATYVLYRIN FIKNMKMLLP NIFRDYWVEL EKERLEVLKK DKYKHDLVYN VNPFQVYIEQ QELVQKKTKQ ELLRKENESK IKKPSISIGT TVVSAKASTA TSSTKVNKIP QTINVPTFPR KVWGNAPFID FPPEIRSSLE QSIRKHINWI IDNEHPMSQN KSKAEENQKS LLALGFRQIH IEEAFKYTSS FVDSLEWLLF HIPEDDLPPQ FTKTEKDSSV MLRVSKDIKL EYMLKRLSES GFDKDDVLTS LEECEFNEVQ AGVKLTRTFL ELESDQEFEA TDLEDSQSLW EQELEGIKMM ESNKVTFNNS EKTIVDIQLR PHKIAEDLLN VRLLKSLQYP CELPGIQIVV NNPSFKLANY IKLSILSSLV QYITENEILG ECMIFNIIEW LEENISRIID NPGPLISEGV INRATQSNKK RTISTTTSQN KQSRIKVKSP KEIEADKLAY KAKLTSKEMQ KSLKQRSSLP AWKKKEQLVD VINSNKVTLV TGETGSGKST QIVQFILDDM NSRGNFSGKI MCTQPRRIST LGLADRISEE RLDKVGGETG YIIRGENKTS KDTRISFVTT GVLLRMLQSF LASSSSHQTS IFDELEYIFI DEVHERSVDS DFLLIILKKT MNRFPNLKIV LMSATISVEI FKNFFNTPLN HIHIEGRTFP IEDYYLDQII DDIDYTVETA NGIVKPRADS HYFEKGNINF DLVARLCLHI DDKLDSEGND GSILIFLPGI MEINQCVSII ERAFSKRDKP SWTLPLHSAL SSMEQKRVFK VPAKGTRKIV VSTNVAETSI TIPDCVVVVD GGRSKTMFYD PEKNTTRLIE NWCSKAEIGQ RRGRSGRVTN GNCYHLYTKE IETKMRQQAV PEIKRTRLEN LYLVVKSMGI RSVDEFLNSG IDAPDQSSLK TSKKFLKEIG ALDADTEELS HLGKYLSYLP TDLQSGKLLI LGCIFGCLDI CLTLASISST GNPFFNLADK RAEIKQKRRE FSQNQGDFVA IANAYAEYDK MKQNGENTKK FISSNYLSFI TLNDISSTRV QYISLLKDLG FVPHGYSHRN RNVDFNFLNR NNENFGVIRA IITASFYPQI ARVQLPDPKY VKSAVGSVAV DPEASLTKFW SNKNLGDVLP ANRARVHNSS VIFDDNSVDG SLSEKILEQA TDEEGNLDFV KARELYDLTP QAPKGGNPIL KSPFVVYGRS HQTFAFFLSD ITPTSTIAAL LFGGSIGYNL SDQLSNGPTG IVLDNWLPIR TWYKNGVLIK RLRKLVDGMI EDRLSSPHYS KDANDDILAV VEQLLAL
|
| |