Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31007 |
Symbol | |
ID | 4837982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1499283 |
End bp | 1500326 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640389297 |
Product | predicted protein |
Protein accession | XP_001383577 |
Protein GI | 150864655 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0164] Ribonuclease HII |
TIGRFAM ID | [TIGR00729] ribonuclease H, mammalian HI/archaeal HII subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.718365 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.389801 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCGAC CACTGTCCGT GGAAGCTCCT GAGCTGGAAA GTGCAAGAAA AAGGAGGAAA TTGGAAACTG GCACGGCTTC AGCAGAAGAA GCTGGTGATT CAAGACCATA TCCTTTGTCA GTAACAAGCA TAGAGAACCA TTTTGAATTC AAGTCATCAA CATATCATTC GGCAATCCCC GTAGAAGTTC TCGAGAATCC AGACGAACCA GTTGTCTTGG GAGTAGATGA AGCTGGCAGA GGTCCAGTTT TGGGTCCAAT GGTGTATGGC ATTGCATTTG CATTGGAGAA GTATCTGACA AGATTGCAGA AGGAATATGG GTTTGCCGAT TCCAAGACTT TAAAGGAAGA GAAAAGAGAT GAACTATTCT ACAGCATAGA GGATGAAGCG AATGAGCTTA ACAGAAATGT TGGCTGGGCT ACTACTACGA TGACAGCTAG AGACATTTCT TCAGGGATGT TACGTTCAGT TTTGGGAATA GGCAACTACA ACTTAAACGA ACAAGCCCAC GACACTACCA TTCAGCTTAT CAAGGAAGTT ATTGCCAAAG GAGTTAATGT GAAGAAAATC TACGTAGACA CAGTAGGTCC CCCTGTGACG TACCAGGCCA AATTGCAGAA GATATTTCCA GAAACGGAAG TTACGGTTGC GAAAAAGGCA GACAGTATAT ATCCCATAGT AAGTACTGCT TCCGTGATGG CCAAGGTGAC AAGAGACGCC AATATCAGGT GGTATAACCA CAATTTGGAT GTGTTGAAGG GCCACAAATT GGGTTCAGGC TATCCCAGTG ACCCCAATAC CAGCAAGTGG CTCAATGGTA ATGTCGACAA GGTTTTTGGC TGGTGCTACG GGTTTATTCG ATTCTCATGG CAGACAGCCA AGGACTCGTT GGTGAAACAC GACGGGGTAG AGGTGATTTA CGAAGATGAA TGTGTAAAGC AGGACAATGG ATATGGCAAT GTCAGCGAGT ATTTCAGCCA TAAGGACGAG CCTGTGAGAG GGAGCATCGA TAAGTTGTAT TATAGTAGCG GAGTGAAACT TTGA
|
Protein sequence | MSRPSSVEAP ESESARKRRK LETGTASAEE AGDSRPYPLS VTSIENHFEF KSSTYHSAIP VEVLENPDEP VVLGVDEAGR GPVLGPMVYG IAFALEKYST RLQKEYGFAD SKTLKEEKRD ELFYSIEDEA NELNRNVGWA TTTMTARDIS SGMLRSVLGI GNYNLNEQAH DTTIQLIKEV IAKGVNVKKI YVDTVGPPVT YQAKLQKIFP ETEVTVAKKA DSIYPIVSTA SVMAKVTRDA NIRWYNHNLD VLKGHKLGSG YPSDPNTSKW LNGNVDKVFG WCYGFIRFSW QTAKDSLVKH DGVEVIYEDE CVKQDNGYGN VSEYFSHKDE PVRGSIDKLY YSSGVKL
|
| |