Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_52708 |
Symbol | |
ID | 4851429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1797482 |
End bp | 1799425 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393137 |
Product | predicted protein |
Protein accession | XP_001387979 |
Protein GI | 126274557 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTGA GGTCCTTGGT GTCCCATTCG GTGAAAACCA GCAATTTTTC AGGTGGCAAG CTGATCGTGA AAGTATTGTT AGTAGGAGTT GGTTCATGTT TGACAGCCAA TTTCTACTAC AGAGGGATCA GCAACGATGT CCAGGCAATT GAAACCTCAG TCCATGAGTT GTTACCGAAT CCTGGAGCTG AAACCTACGA GTTGGGACTC TACCAGTCCT CGCAGAAAGA AGAACATGAT CGATTGGAAC AGCACAGGTC ACAACGATTG AACCATGACA ATGTCGCTGT TAGAATAATT TATTATGTTA AATATACTAT AGAGGATTAT GTCATCGATC CGCTTGTGTC GGTAGCTCGC TTTGTTGAAT TGTCGTTGAT ATTTCTTCCT GTGCTTGTTG CTTCTCCTCT CTGTTGGTTT GGAAGACGAG ATCTCCGGAC CGGTAAAACG GTTCGCTCGG GTGCGCAGAT CTGGTTTGCG TATTTGAGAT GGTCTGCGGA AGTGGCTGGT GCTTCCTTTG TGAAGTTGGG CCAGTGGGCT GCTTCCCGTA CAGACATATT TCCGAAGGAG ATGTGCGATG AGCTTGGGCA GTTGCACTCC GATGGAACAC CGCATTCCTT GGAGCTGACA AAGAGGATCA TCAGTGGCAG TTTTGACAAT TTGCCGTTTG AAGAGATCTT CGATGCGTTC TTTGAAAAAC CGTTGGGAGT TGGTGCAATC GCCCAGGTAT ATATTGGAAA GTTGTCTGAC AAAGCTTTGT CCAAAGTCAG AAAGACCACC GACTGGGAAA GCCACCTCCA GCAAGAAGAA GATCTTTTGC AGGATGAGCC ATTTTTGGAT AGGATCCTAG TCACAGAACA CAACGATCCA TTGACTTCTA ACCAGTATGT TGCCATCAAG GTGTTGCACC CAAATGTAGA AACCAAGATC AATCGAGACT TGAAGATTAT GAAATTCTTT GCGACCGTCA TCAACGTCAT TCCGACGATG GAGTGGCTCT CTCTTCCGGA TGAAGTAGAG CAGTTCTCGA TGTTGATGCG TTTGCAATTG GATTTGCGTA TAGAAGGGTT GAACTTGGCC AAGTTCAGAC ATAACTTCCG TCACCGTTTG GATATACACT TCCCCAAACC GTACTTGGGA TTCACTACCA GGGATGTTTT GGTGGAAGAG TTCATTCATG CGATTCCGAT GAGCAAAATG CTCTCGTTGC TGGACAACTT CGGCAAGAAC CTATCGAAAG AAATCAGTGA CAAGGGCCTA GATGCGTTCT TGAAAATGTT AATTCTTGAC AATTTTGTTC ATGCTGATTT GCACCCTGGA AATATGATGG TGCGGTTCTA TAAGAATGAA CTTTTCAAGC ACGAGAGGGA GTACAAGATC GTTAAGTCGA GTAACGAAAC AGAAACGAAC AGAATAACTA ACGAATTAAT GAAGTTGGGA GACGACTCAG ATGCGTGGTG TGCCAAATTG AGCGAGTTGT ATGAAGAGGG ATACCATGCA GAGATATGTT TCCTCGATGT AGGGTTGATC ACAGAGTTGA ATCACACTGA TAGAGTAAAT TTTATAGACC TTTTTAAAGC CCTATCTGAG TTTGATGGGT ATAAGGCTGG AGAGTTGATG GTAGAAAGAT CGCGAACTCC CGAAACGGCC ATCAACAAGG AGATTTTTGC CATCAAGGTG GAGAAGTTGG TAGACAGAAT GAAGGAAAGA ACATTTACCT TGGGTAATAT CAGTATTGGA GACTTGTTGG ACCAGGTATT GGGAATGGTT CGTAACCATC ATGTCAGAAT GGAGGGTGAC TTTGTAACGG TTATAGTGGC TATTTTATTA CTAGAAGGAC TTGGAAGACA ATTGGATCCA GAATTGGATT TGTTTGCAAG GTTCGTATAC GCGCTTTTGA TTGGATTTCC CACCGAGAAT GAATTTTCCA TGTTTTTCAT CTAA
|
Protein sequence | MPLRSLVSHS VKTSNFSGGK LIVKVLLVGV GSCLTANFYY RGISNDVQAI ETSVHELLPN PGAETYELGL YQSSQKEEHD RLEQHRSQRL NHDNVAVRII YYVKYTIEDY VIDPLVSVAR FVELSLIFLP VLVASPLCWF GRRDLRTGKT VRSGAQIWFA YLRWSAEVAG ASFVKLGQWA ASRTDIFPKE MCDELGQLHS DGTPHSLELT KRIISGSFDN LPFEEIFDAF FEKPLGVGAI AQVYIGKLSD KALSKVRKTT DWESHLQQEE DLLQDEPFLD RILVTEHNDP LTSNQYVAIK VLHPNVETKI NRDLKIMKFF ATVINVIPTM EWLSLPDEVE QFSMLMRLQL DLRIEGLNLA KFRHNFRHRL DIHFPKPYLG FTTRDVLVEE FIHAIPMSKM LSLLDNFGKN LSKEISDKGL DAFLKMLILD NFVHADLHPG NMMVRFYKNE LFKHEREYKI VKSSNETETN RITNELMKLG DDSDAWCAKL SELYEEGYHA EICFLDVGLI TELNHTDRVN FIDLFKALSE FDGYKAGELM VERSRTPETA INKEIFAIKV EKLVDRMKER TFTLGNISIG DLLDQVLGMV RNHHVRMEGD FVTVIVAILL LEGLGRQLDP ELDLFARFVY ALLIGFPTEN EFSMFFI
|
| |