Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_85313 |
Symbol | |
ID | 4840791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 743165 |
End bp | 745187 |
Gene Length | 2023 bp |
Protein Length | 658 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392106 |
Product | predicted protein |
Protein accession | XP_001386155 |
Protein GI | 150866519 |
COG category | [R] General function prediction only |
COG ID | [COG5210] GTPase-activating protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.653403 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGCCGCTGA CCCAGTCTTC CGTCATGGCA ACCGACCCAG ACATCGTCAT TCTGATGCCT GCTCTCAGTT ACCCCAAAGC AAGAGCCAAA GAAAATGGCA AGCACATACA GAATGACATA CATAATGAAG ACAACGATAA TGAACAAGAC AACAACAGAG ACAACAACAA CAATAACAAC ACTAATGATA ACAGCATAGA TGATATCAAT AACAGCAATG ACGATTTCAA CAACAGCAAT AACGATAACA GTATTGATGA CGATGACGAG TCTACTACTC CCTCATCACC TCCTCGTCGT CGTTTGTCGT ATTTGGACCC TTCGACACCT TCCCGCCATC TCCGGACTCC GTCGCTGTAT TCACACAATT ACTCACTACT TGATATCTAC GACTCTCCTT CATCGGAGCG TTCATCCATG GCTCCGTTAG CTACAGGGGG TCTTATCGAT ACGTCCATTA TGAAAGGGTT GGATGACCTA GAGTTCAAAG AACTTTCTAA CGACACAACA ATTTTCGCCA GGCACACCAT AGATTCCTTC CCTGAACTTG ACATTCTGGA CCAGGAAAGC GAATCTTACG TTCAGCCCAC TGATCAGGAC ACTATTGATC GTATACTAGC CTCACCTTAT GACCGCTACG GATTTAAAAA GACATCTACA CATCACAACA TCTCTTTGGA GGATTACAAC AAATGGTTTT CTGAATATGC TCAGGATGCT ATCAGACGTA AGAAGAAGTG GAATTTGCTT ATGAAAAGCA ACGGTCTTCA ATTAGACTCT GCACAAGCTA TACCTACGAG ATTTCCGCCA AAATCTGACA AAGTCAAAAA GATGATCCGC CAGGGCATCC CAGCAGAATG GAGAGGCTCG GCGTGGTTTT TCTATGCGGG AGGATATGAT AAGCTAAACA AGCACGTAGG TGTATACTCC AAGATTGTTC GTGACACAAA AGATATCCAG AATAAAGATA CAGAGGTTAT AGAGAGAGAT TTAAACAGAA CGTTTCCGGA CAACATATAC TTCAACAGCC ATATTGGTAC TGATGCCAAC ACTTCAGTGA CTACGTTGGG AACAGAAAAT CTGAGATCGT CAGAAACAGC TATGGTAAAA ACGCTCCGTA GAGTACTTGT AGCATTTGCC CATTACCAAC CACAGATTGG CTACTGTCAA TCGCTTAATT TTCTTGCGGG TTTGCTTCTC TTGTTCATGG AAGAAGAGAG AGCCTTCTGG ATGTTGGTAA TCTTGACTGA GAGAATTATT CCCAAGGTTC ATTCTGCTAA CCTTGAAGGG GTACACACAG ACCAAGGTGT GCTTATGCTC TGTGTAAAAG AATACATTCC ACAACTCTGG GCCATATTGG GTAAGAACTT TGAAGGAGAA TCACTCTCTG AGGACAAAAT ACTTACCAGA TTGCCTCCTG TTACGCTTGT GACATCTTCA TGGTTCATGT CTGTGTTTGT GGGCAACTTG CCTATAGAGA CTACCTTGCG AGTCTGGGAT ATCTTGTGGT ACGAAGGCTC CAAGACCATT TTCAGAATCT CACTAACTAT ATGCAAAATG TGTCTTGAGG ACCCTGAATT CCAGAATTCC AGAAGCTCTA AAGGTAGTGG TGAAATGGAC CAAATCGAAC TTTTCCAGTT CATGCAGAAC TACCCCAAGA CGATACTAGA ACCCAACGTA CTAATAGACA ACTGTTTTAA AAAGATCGGC GGGTATGGAT TTGGGTCACT TTCACAAGAT GAAATCAACA AATGCAGAGA ATTTGTTTCT AAACAGAGAG CCAAACTAAA CTACAAGAAA TCCAATATCA CTGCTGAAAT GACCGAGGAA GAGCGCCAGG CTTTAATTTC CAGTTCTGAC AGCACTTTTG GCGCGGAGGA CCAGTCTATA CACGATGTGT ACGGCTTCCA TCGTTCGATC ATGAGTGGAG TAGTGTGGAA TAAGAGCATA AGCAACAAAA TGAAGCGTAG GTTTGTCAGG CGTACGAGCA GCAGATCGTA AAATAGATAC AAACTTGAAG CAC
|
Protein sequence | MATDPDIVIS MPALSYPKAR AKENGKHIQN DIHNEDNDNE QDNNRDNNNN NNTNDNSIDD INNSNDDFNN SNNDNSIDDD DESTTPSSPP RRRLSYLDPS TPSRHLRTPS SYSHNYSLLD IYDSPSSERS SMAPLATGGL IDTSIMKGLD DLEFKELSND TTIFARHTID SFPELDISDQ ESESYVQPTD QDTIDRILAS PYDRYGFKKT STHHNISLED YNKWFSEYAQ DAIRRKKKWN LLMKSNGLQL DSAQAIPTRF PPKSDKVKKM IRQGIPAEWR GSAWFFYAGG YDKLNKHVGV YSKIVRDTKD IQNKDTEVIE RDLNRTFPDN IYFNSHIGTD ANTSVTTLGT ENSRSSETAM VKTLRRVLVA FAHYQPQIGY CQSLNFLAGL LLLFMEEERA FWMLVILTER IIPKVHSANL EGVHTDQGVL MLCVKEYIPQ LWAILGKNFE GESLSEDKIL TRLPPVTLVT SSWFMSVFVG NLPIETTLRV WDILWYEGSK TIFRISLTIC KMCLEDPEFQ NSRSSKGSGE MDQIELFQFM QNYPKTILEP NVLIDNCFKK IGGYGFGSLS QDEINKCREF VSKQRAKLNY KKSNITAEMT EEERQALISS SDSTFGAEDQ SIHDVYGFHR SIMSGVVWNK SISNKMKRRF VRRTSSRS
|
| |