Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33593 |
Symbol | |
ID | 4840756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 1055288 |
End bp | 1058566 |
Gene Length | 3279 bp |
Protein Length | 1092 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640392071 |
Product | hypothetical protein |
Protein accession | XP_001386394 |
Protein GI | 150866713 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.271146 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAACA CCGTATTGGA CTACTTTCTC AAGTTGCACA TTTTCAATCT GGAGGTGAAT ATTGCCGATC TCGTGGATAT GGCCGTTCAG TTCCAGCGGA TGTTTGAGCT ATCGTTATTG CCAGAAGACA AGGCAAGACT ATACGACGTT ATCTCGGGAA AATGCGCTTC TTTCAACAGC TACAGTCTGG AAGATGCTAT TCTCACTCCA CAAGGGGAAT ACAGGCGATT GAATGAGTTC TTCAAGAACT ACTCACCAAA CGATACGTAT TACTTCGACG AGATACACAT CAATGTCGGA TCGAAGAACT TCACGGTCTC TGTCTCGGGA AATCTTTCTG CCTCTGGCTT CTTAGATCCG CTTGTAGTCT CCAATTTACC TCTGAAAAGC TTCAATTACT ACCCTACAGG GTTGTTGAAC TCGACACATT TACATGCGTA CTTGGCCCGT ATCAACCTCC AAATGAGACT TCGTGGCCAG AAGATCCTTT TGGTACTCCA CAACAACGTA GTCGTAGACA GTCGCCAGTT CTCCAATATA GAATTGTTGT ACATCAGCCC AGAAGTAGAC AAGTACTTCC AGAATAATCC AGACAATTGC ATTAAGTTTC CCGACGAATA TGGACTAAAG TCGTGGATTA GATCCCACAT CATCGACCGG TTGTCGAAGT TGAAGACTTT CGATAATCCA CTGCTATTGA TCAACTCCAT ATTGTACTCA TATAGCACTT GCATCTCCAA ACTCTTACAC GATCCTCTAT ATTTCCAGTT CAACCAGTTG AACTTGATGT CTTCGAACTT TATGGGAAAC AACATGCCTG CCCCTTACTA TGTAAATGAA AACATCAAGA TAACGGATGG TAACCTCATA GTCATGGATG AAGCCCCTGA AATAGGCCAA TACTCGGCCA GAGAAGACAA TTTTAAAAAG CTCAACAATT ACAATTACAA CGTAAAAGAT ATCTACTTCA TTCTTTCAAG TCTTTTACTG TATCAAAATT CTGATATCGG CCATGATTAC GGTTTTGATG CTAAAGAAAT AATGGACTTC CTCGTGCAAA AAGTACATCC TTTCTTGATA CAGAGAGAGT TCCCCTTTGT TGGCCATTTC CATCGCTTTT TGGATGAATT CATTAACGAG TATGCTAGAA TAGGTGCTTC AGACGATACA GATGAAACTG ATAATGCATT TAGCGTGTCG GAGAAATTCC ATCTTGGTCC ATTGCAAGAA GCAGCCAGAA TTGAGAAAAT TTTGGTATCT GCTTTTGATC AGGATGGTGA CAACTTAGCC TCGATGCTGT TTCCTCACAT AGATTCTATT GAAGTGGCCA ATTCTGAAGC ATCCTTCTTG AAGGCTTCTG CTCGTAACCT TTCCAATGGA AATAGTTTGC CTTTACCAAC TTTAGACGAG TATGATATGT TCACTCCAAA GAAAAGAAGA AGCCTGACAA ATTCAATTCC TACCATCAAG AAGTCAAAGA TCATTGAAAT AATGAAAAAA GACAAGTTGA GTTTTGTGAA CTCCGCTCAG GGCTCTGTTT TTTCAGACGA CTCTGATTAT TCAGAAGATT CATTTTCTAG CAAGATAAAG CAATTCAAAG ACAAGGTGGA TTCAAATGGA CTGGTACTTA GTCAAAATGG AAACGGTAAA GTCGTTGAAG AGAACGATGC AAATGCTTAT GAGAACGAAA AATCAGGAGA TGAAGATGAA GAGTCTCTGG AAGAAGAGAA CGATTCTGCG AAAGAAGAAG AACGTAAAGA TGATGATCGT GAAGAGTATT ACATGCACCG AGAACTAATC AACCCTGTAG GCGGTGATGA GAATGGGGAC ATTGAAGAAG AAGTATATGG TCAAGATGAT GACATTGAAA TGGAAAAGGT GCAACAAGAA GAAACAATCA GTATACAACC AGAAGCGAAA GCATCCTCTA TTCAGATGGA AGTTGAGCCA AATAGCACTG AAGAATCGTC TAACTCAGAT AGCAACAGTT CTTCCGATAG CAATGAAGTT GCAGACTCAA ATAATGGAGA GAAAACATAT ATTAAGGAAG CCCATATGAA TGAACCTACT TCTAGGGAAG ACAATCCTAG CGATAGTTCC GAATCTCAAT CTTCAGATTC CAGTGAATCA GATGCAGAAG ACAGCGCAGA AGAAGAAGAA ACACACCCAG AGTCAAATTC TAAAGTTCCA TTATCTTCTT CAACAATATC TCCATCTCAA CTTCCAATTC CAGTTCCGAA TCCAGTTCCG GCTCCAGTCA AGACGAGCTC AATTTCTGCA ACTCCTTCAA CGCAGATTAA GTTACCAAGG AGGTCATTGG GAACTTTATC TTCGTTGGTT CCGCCTAAAT CTACAGTTGA ATCCAAAGAT ACGCCTAAGA CAAATTCTGA TAGGTCCAAA TTCAAGGGAA AGCTTGATTT CAGTTCAGAT GACTCTTTTT CTTCTGTAGA CAGCGATAGT TCAGATGAGT CTAATGATGA TAGAAACGTA TCACAGCTAG CACAAGTTAA AGTTACGTCA ACCAAAGCTG TACCAAAGAC AGATTCTACA AAATCAACAA CTATTACAAA TACAATACGT TCAAAACCTT CTTCTATTCC AGCAAATTCA ACAGCCCTTA CGAAACATAT ATCTACGAAG GCTGTACTTT CTAAACCAGT TCTGCCGTTG TTAAAAAAGG TATTAGATGC GGGACTGAGT CGTGATCCAA AATTGGCACC ATCGGCAATT CTTGAACATG TTTCCAAGCC TAAAGAAAAG AGAATAAACC CAAGTCAATT AAAGAGTATT GAAAGAAGAC AATATGATTC TTCGTCAGAG GACGAAATGT CGGATCCTAT AAACGAAAGC TCTGACGAAT CGGAAGACGA ACTCTTTGGT ACTCTGCCGA ATGTTTCTAC TCTGACTCAA CGTACGCAAA CACAAAATCT GACTCAAATT CAGATTCAAA CATCGAATTC TACTACAAAA TCTCCTGTTA GGAAACCAGG CAGTCCATTA CAATTTAGCT TTGATAGAAA TTCTCCTGGT CAAAGTGCAT TATCACCTCG TGAAAAGTTA CAATTCTCGT CACAGTCGAC TCCAGCATCG GCTGGAAAAG GTAGACCTAC AATTCTATCA ATGAAGTTGA AGGAACTCGA AGCACGCCAA GCAAGTCCAT CGAAGAAAGA AATACCTAAA AAGGGCCAAA GGCAAACCTC TCAAGCAAAG GAAAGGCAAT TATCGGCATC TGATTCTGAT AGCAGCTCTG AAGACTCAGA CGATTCGATC GGATTCTAA
|
Protein sequence | MANTVLDYFL KLHIFNSEVN IADLVDMAVQ FQRMFELSLL PEDKARLYDV ISGKCASFNS YSSEDAILTP QGEYRRLNEF FKNYSPNDTY YFDEIHINVG SKNFTVSVSG NLSASGFLDP LVVSNLPSKS FNYYPTGLLN STHLHAYLAR INLQMRLRGQ KILLVLHNNV VVDSRQFSNI ELLYISPEVD KYFQNNPDNC IKFPDEYGLK SWIRSHIIDR LSKLKTFDNP SLLINSILYS YSTCISKLLH DPLYFQFNQL NLMSSNFMGN NMPAPYYVNE NIKITDGNLI VMDEAPEIGQ YSAREDNFKK LNNYNYNVKD IYFILSSLLS YQNSDIGHDY GFDAKEIMDF LVQKVHPFLI QREFPFVGHF HRFLDEFINE YARIGASDDT DETDNAFSVS EKFHLGPLQE AARIEKILVS AFDQDGDNLA SMSFPHIDSI EVANSEASFL KASARNLSNG NSLPLPTLDE YDMFTPKKRR SSTNSIPTIK KSKIIEIMKK DKLSFVNSAQ GSVFSDDSDY SEDSFSSKIK QFKDKVDSNG SVLSQNGNGK VVEENDANAY ENEKSGDEDE ESSEEENDSA KEEERKDDDR EEYYMHRELI NPVGGDENGD IEEEVYGQDD DIEMEKVQQE ETISIQPEAK ASSIQMEVEP NSTEESSNSD SNSSSDSNEV ADSNNGEKTY IKEAHMNEPT SREDNPSDSS ESQSSDSSES DAEDSAEEEE THPESNSKVP LSSSTISPSQ LPIPVPNPVP APVKTSSISA TPSTQIKLPR RSLGTLSSLV PPKSTVESKD TPKTNSDRSK FKGKLDFSSD DSFSSVDSDS SDESNDDRNV SQLAQVKVTS TKAVPKTDST KSTTITNTIR SKPSSIPANS TALTKHISTK AVLSKPVSPL LKKVLDAGSS RDPKLAPSAI LEHVSKPKEK RINPSQLKSI ERRQYDSSSE DEMSDPINES SDESEDELFG TSPNVSTSTQ RTQTQNSTQI QIQTSNSTTK SPVRKPGSPL QFSFDRNSPG QSALSPREKL QFSSQSTPAS AGKGRPTILS MKLKELEARQ ASPSKKEIPK KGQRQTSQAK ERQLSASDSD SSSEDSDDSI GF
|
| |