Gene PICST_33593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33593 
Symbol 
ID4840756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp1055288 
End bp1058566 
Gene Length3279 bp 
Protein Length1092 aa 
Translation table12 
GC content40% 
IMG OID640392071 
Producthypothetical protein 
Protein accessionXP_001386394 
Protein GI150866713 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.271146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAACA CCGTATTGGA CTACTTTCTC AAGTTGCACA TTTTCAATCT GGAGGTGAAT 
ATTGCCGATC TCGTGGATAT GGCCGTTCAG TTCCAGCGGA TGTTTGAGCT ATCGTTATTG
CCAGAAGACA AGGCAAGACT ATACGACGTT ATCTCGGGAA AATGCGCTTC TTTCAACAGC
TACAGTCTGG AAGATGCTAT TCTCACTCCA CAAGGGGAAT ACAGGCGATT GAATGAGTTC
TTCAAGAACT ACTCACCAAA CGATACGTAT TACTTCGACG AGATACACAT CAATGTCGGA
TCGAAGAACT TCACGGTCTC TGTCTCGGGA AATCTTTCTG CCTCTGGCTT CTTAGATCCG
CTTGTAGTCT CCAATTTACC TCTGAAAAGC TTCAATTACT ACCCTACAGG GTTGTTGAAC
TCGACACATT TACATGCGTA CTTGGCCCGT ATCAACCTCC AAATGAGACT TCGTGGCCAG
AAGATCCTTT TGGTACTCCA CAACAACGTA GTCGTAGACA GTCGCCAGTT CTCCAATATA
GAATTGTTGT ACATCAGCCC AGAAGTAGAC AAGTACTTCC AGAATAATCC AGACAATTGC
ATTAAGTTTC CCGACGAATA TGGACTAAAG TCGTGGATTA GATCCCACAT CATCGACCGG
TTGTCGAAGT TGAAGACTTT CGATAATCCA CTGCTATTGA TCAACTCCAT ATTGTACTCA
TATAGCACTT GCATCTCCAA ACTCTTACAC GATCCTCTAT ATTTCCAGTT CAACCAGTTG
AACTTGATGT CTTCGAACTT TATGGGAAAC AACATGCCTG CCCCTTACTA TGTAAATGAA
AACATCAAGA TAACGGATGG TAACCTCATA GTCATGGATG AAGCCCCTGA AATAGGCCAA
TACTCGGCCA GAGAAGACAA TTTTAAAAAG CTCAACAATT ACAATTACAA CGTAAAAGAT
ATCTACTTCA TTCTTTCAAG TCTTTTACTG TATCAAAATT CTGATATCGG CCATGATTAC
GGTTTTGATG CTAAAGAAAT AATGGACTTC CTCGTGCAAA AAGTACATCC TTTCTTGATA
CAGAGAGAGT TCCCCTTTGT TGGCCATTTC CATCGCTTTT TGGATGAATT CATTAACGAG
TATGCTAGAA TAGGTGCTTC AGACGATACA GATGAAACTG ATAATGCATT TAGCGTGTCG
GAGAAATTCC ATCTTGGTCC ATTGCAAGAA GCAGCCAGAA TTGAGAAAAT TTTGGTATCT
GCTTTTGATC AGGATGGTGA CAACTTAGCC TCGATGCTGT TTCCTCACAT AGATTCTATT
GAAGTGGCCA ATTCTGAAGC ATCCTTCTTG AAGGCTTCTG CTCGTAACCT TTCCAATGGA
AATAGTTTGC CTTTACCAAC TTTAGACGAG TATGATATGT TCACTCCAAA GAAAAGAAGA
AGCCTGACAA ATTCAATTCC TACCATCAAG AAGTCAAAGA TCATTGAAAT AATGAAAAAA
GACAAGTTGA GTTTTGTGAA CTCCGCTCAG GGCTCTGTTT TTTCAGACGA CTCTGATTAT
TCAGAAGATT CATTTTCTAG CAAGATAAAG CAATTCAAAG ACAAGGTGGA TTCAAATGGA
CTGGTACTTA GTCAAAATGG AAACGGTAAA GTCGTTGAAG AGAACGATGC AAATGCTTAT
GAGAACGAAA AATCAGGAGA TGAAGATGAA GAGTCTCTGG AAGAAGAGAA CGATTCTGCG
AAAGAAGAAG AACGTAAAGA TGATGATCGT GAAGAGTATT ACATGCACCG AGAACTAATC
AACCCTGTAG GCGGTGATGA GAATGGGGAC ATTGAAGAAG AAGTATATGG TCAAGATGAT
GACATTGAAA TGGAAAAGGT GCAACAAGAA GAAACAATCA GTATACAACC AGAAGCGAAA
GCATCCTCTA TTCAGATGGA AGTTGAGCCA AATAGCACTG AAGAATCGTC TAACTCAGAT
AGCAACAGTT CTTCCGATAG CAATGAAGTT GCAGACTCAA ATAATGGAGA GAAAACATAT
ATTAAGGAAG CCCATATGAA TGAACCTACT TCTAGGGAAG ACAATCCTAG CGATAGTTCC
GAATCTCAAT CTTCAGATTC CAGTGAATCA GATGCAGAAG ACAGCGCAGA AGAAGAAGAA
ACACACCCAG AGTCAAATTC TAAAGTTCCA TTATCTTCTT CAACAATATC TCCATCTCAA
CTTCCAATTC CAGTTCCGAA TCCAGTTCCG GCTCCAGTCA AGACGAGCTC AATTTCTGCA
ACTCCTTCAA CGCAGATTAA GTTACCAAGG AGGTCATTGG GAACTTTATC TTCGTTGGTT
CCGCCTAAAT CTACAGTTGA ATCCAAAGAT ACGCCTAAGA CAAATTCTGA TAGGTCCAAA
TTCAAGGGAA AGCTTGATTT CAGTTCAGAT GACTCTTTTT CTTCTGTAGA CAGCGATAGT
TCAGATGAGT CTAATGATGA TAGAAACGTA TCACAGCTAG CACAAGTTAA AGTTACGTCA
ACCAAAGCTG TACCAAAGAC AGATTCTACA AAATCAACAA CTATTACAAA TACAATACGT
TCAAAACCTT CTTCTATTCC AGCAAATTCA ACAGCCCTTA CGAAACATAT ATCTACGAAG
GCTGTACTTT CTAAACCAGT TCTGCCGTTG TTAAAAAAGG TATTAGATGC GGGACTGAGT
CGTGATCCAA AATTGGCACC ATCGGCAATT CTTGAACATG TTTCCAAGCC TAAAGAAAAG
AGAATAAACC CAAGTCAATT AAAGAGTATT GAAAGAAGAC AATATGATTC TTCGTCAGAG
GACGAAATGT CGGATCCTAT AAACGAAAGC TCTGACGAAT CGGAAGACGA ACTCTTTGGT
ACTCTGCCGA ATGTTTCTAC TCTGACTCAA CGTACGCAAA CACAAAATCT GACTCAAATT
CAGATTCAAA CATCGAATTC TACTACAAAA TCTCCTGTTA GGAAACCAGG CAGTCCATTA
CAATTTAGCT TTGATAGAAA TTCTCCTGGT CAAAGTGCAT TATCACCTCG TGAAAAGTTA
CAATTCTCGT CACAGTCGAC TCCAGCATCG GCTGGAAAAG GTAGACCTAC AATTCTATCA
ATGAAGTTGA AGGAACTCGA AGCACGCCAA GCAAGTCCAT CGAAGAAAGA AATACCTAAA
AAGGGCCAAA GGCAAACCTC TCAAGCAAAG GAAAGGCAAT TATCGGCATC TGATTCTGAT
AGCAGCTCTG AAGACTCAGA CGATTCGATC GGATTCTAA
 
Protein sequence
MANTVLDYFL KLHIFNSEVN IADLVDMAVQ FQRMFELSLL PEDKARLYDV ISGKCASFNS 
YSSEDAILTP QGEYRRLNEF FKNYSPNDTY YFDEIHINVG SKNFTVSVSG NLSASGFLDP
LVVSNLPSKS FNYYPTGLLN STHLHAYLAR INLQMRLRGQ KILLVLHNNV VVDSRQFSNI
ELLYISPEVD KYFQNNPDNC IKFPDEYGLK SWIRSHIIDR LSKLKTFDNP SLLINSILYS
YSTCISKLLH DPLYFQFNQL NLMSSNFMGN NMPAPYYVNE NIKITDGNLI VMDEAPEIGQ
YSAREDNFKK LNNYNYNVKD IYFILSSLLS YQNSDIGHDY GFDAKEIMDF LVQKVHPFLI
QREFPFVGHF HRFLDEFINE YARIGASDDT DETDNAFSVS EKFHLGPLQE AARIEKILVS
AFDQDGDNLA SMSFPHIDSI EVANSEASFL KASARNLSNG NSLPLPTLDE YDMFTPKKRR
SSTNSIPTIK KSKIIEIMKK DKLSFVNSAQ GSVFSDDSDY SEDSFSSKIK QFKDKVDSNG
SVLSQNGNGK VVEENDANAY ENEKSGDEDE ESSEEENDSA KEEERKDDDR EEYYMHRELI
NPVGGDENGD IEEEVYGQDD DIEMEKVQQE ETISIQPEAK ASSIQMEVEP NSTEESSNSD
SNSSSDSNEV ADSNNGEKTY IKEAHMNEPT SREDNPSDSS ESQSSDSSES DAEDSAEEEE
THPESNSKVP LSSSTISPSQ LPIPVPNPVP APVKTSSISA TPSTQIKLPR RSLGTLSSLV
PPKSTVESKD TPKTNSDRSK FKGKLDFSSD DSFSSVDSDS SDESNDDRNV SQLAQVKVTS
TKAVPKTDST KSTTITNTIR SKPSSIPANS TALTKHISTK AVLSKPVSPL LKKVLDAGSS
RDPKLAPSAI LEHVSKPKEK RINPSQLKSI ERRQYDSSSE DEMSDPINES SDESEDELFG
TSPNVSTSTQ RTQTQNSTQI QIQTSNSTTK SPVRKPGSPL QFSFDRNSPG QSALSPREKL
QFSSQSTPAS AGKGRPTILS MKLKELEARQ ASPSKKEIPK KGQRQTSQAK ERQLSASDSD
SSSEDSDDSI GF