Gene PICST_33002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33002 
Symbol 
ID4839949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1187668 
End bp1190769 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table12 
GC content40% 
IMG OID640391264 
Productpredicted protein 
Protein accessionXP_001385580 
Protein GI150866097 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCT GGGGAACACT CGCTCTAGCT ACTACGCTAG TTTTGCCATG GTTTCCTGTG 
GTTCTGGGAG CAGTACGAGT TGACTCTGAA GCTGAACTAT ATAACTCCTT TGATAGGATA
AAGCGGGGAG CTGTTATAAA CGATGTTTTT GAAATCAAGC AGGCTCTAAT TGACCAACAA
AAAGAGAAGA ACGGTGATAA GGACGAATGT CCACCCTGTT TCAACTGTAA TTTGCCTAAT
TTCGAGTGTG TTCAGTTTTC TGAATGTAAC ATATTCACTG GTAATTGCGG CTGTAGAGAT
GGTTTTGGTG GGGTTGACTG CAGTGAGCCA TTGTGTGGGG CTCTTTCAGA CGGGAACAAC
AATAGACCCG TAAGAAAGAA GGCTACTTGC GAATGCAAAG AGGGCTGGAA GGGTATAAAT
TGTAACATGT GTACTGATGA TTCTGTTTGT GATGCTTTCA TGCCCGATGG TTTGAAGGGG
ACGTGCTATA AGACTGGTAT TATCGTTAAC AAGTTCCACC AGATGTGTGA TGTAACGAAT
CCTAAGATCA TTCTGATTTT GGGCGGAAAG AAGCCTCAAG CCACTTTCAG CTGTAACAAG
ACAGCAGAAA ACTGTAACTT CCAGTTCTGG ATTGACGAGC GGGAGTCCTT CTACTGTGAC
TTGAACAAAT GTGTTTTAGA CTATGACCTA GTAGCAAACA CTACGAAGTA CAACTGTGAA
GAAGTCGCCT GTAAATGTTT GCCTGACAGA ATGTTGTGTG GAGAAGCTGG ATCTATTGAT
ATTTCTGACT TCTTGACAGA AACCATTCGT GGACCAGGTG ACTTCACCTG TGATGTTGCT
GACAGGAAGT GCCGTTTCTC AGAGCCCAGT ATGAACGACT TGATCCAAAG TGTCTTTGGT
GATCCGTACA TCACGTTGAA GTGCAAATCT GGTGAGTGTG TTCATAAAAG TGAGATTCCG
GGCTATGAAG TTCCTGATAG AAATAAGCTT ACTTTGAACA ACATCTTGCT ATTAGCAGGA
GTTGTTTTGG TTACAGCATT ATTGGTAGCA ACTACTATAC ATAACATCCG TCAATCTCCT
TTGTTTAAAG CTGGAATTGG TTCCTTCGAG CCATTAGATG GAGACTCGAG TGCTTTGAAC
AATAACTTTA CTCCTACCAA TATTGCGTTT GAAGGTATCA GCTACAGAGT TAGGAGTGGA
CAGCAGGTTT TGAATAATGT CTCTGGTAAA ATTGAGCCAG GTGAATGTTT GGCCATTATG
GGAGGCTCTG GAGCAGGTAA AACGACTCTC TTAGATATCT TGGCTGGTAA AAACAAGGGC
GGTGAAGTTT ATGGAAGCAT CTATGTCAAC GGCAACATAT TGAACCCAGA CGACTACAAG
AAAATTGTTG GTTTTGTAGA CCAAGAAGAC CATTTGATTC CTACCTTGAC AGTTTATGAA
ACTGTTTTGA ACAGTGCCTT GCTTAGACTT CCAAGGAACA TGACCTTGAG ACAAAAGGAA
TCTAGAGTTA TTGAAGTGTT GAACGAGTTA AGAATTTTAA GTATCAAAGA TAGAGTCATT
GGCTCCAACT TCAAAAGAGG TATTTCGGGA GGTGAAAAGA GAAGAGTTTC TATTGCTTGT
GAAATGGTTA CCTCTCCTTC TATCTTGTTC TTGGACGAGC CCACTTCTGG TCTTGATTCC
TATAATGCCA GAAATGTTGT AGAGTGTTTG GTGAAGTTAT CTAGAGACTT CAACCGCACT
ATTGTATTCA CTATTCATCA ACCAAGAAGT AATATCGTTT CTTTGTTTGA CAAGTTAATT
TTGTTGTCTG AAGGTGATTT GATTTACTCG GGGGATATGA TCAAGTGCAA CGACTTCTTT
GCTAAATATG GTTACCAGTG CCCTTTGGGT TACAATATTG CTGACTACTT GATTGACATC
ACCATCGACC ATAAAAAGAT TGTTAGAGTA CCATCCGAAG ATGAGATTGC AGAAGAAGGA
AGCTCAGAAG GACACGAAGA TATTCACCAA GCTTTTGTTG AAGATACTGC GGGAGAGGTC
GACACTACCA GAGAATGGGA ACACTTTGCT GTTCACAGAG ATGAGTACAA CTACGCACCC
TTAACTAAAA AAGGATCGAA GGATCAAAGC AAGTATATTC AGATCAAGAA TAAGCTTCCT
CAAATCTTTG CTGATTCAGT ATTGGCAATC GAATTACAGA CTGAAATCGA TGAGGCAAAG
AATAACCCTG TTCCTCTTGA CTTAAAGAAT CATATGATGA AGAAGGCTAG CTTCCTTAAT
CAGATCCTTA TCTTGTCTTC CAGAACATTC AAAAACTTGT ACAGAAATCC TAGATTGTTG
TTGACTAACT ATGTCTTGTC TTTGGTGGTT GGTGCATTCT GTGGATATTT GTACTACAGC
GTAGCCAATG ATATTAGTGG ATTCCAGAAT AGATTGGGTC TCTTCTTCTT CGTATTGGCT
TTCTTTGGTT TTTCGGCATT GACTGGTTTA CATTCATTTT CTTCAGAAAG AATCATTTTC
ATCCGTGAAA GAGCAAATAA TTATTACCAT CCATTTGCGT ATTACATCAG TAAGATTGTA
TGTGATATTC TTCCTTTGAG AGTTCTTCCT CCCATCTTGT TGATTAGTAT TGCTTATCCA
TTAGTTGGTT TGACAATGGA ACATAACGGA TTCTTGAAGG CTATGGTCGT GCTAATCTTG
TTCAACGTAG CTGTTGCTGT AGAGATGTTA ATTGTTGGTA TCTTGATCAA AGAGCCAGGT
ACTTCGACTA TGATTGGTGT GTTGATCTTA TTGTTGTCGT TGTTGTTCGC TGGTTTGTTC
ATCAACAGCG AGGATTTGAA GGTTCAAATC AAATGGCTTG AATGGATTTC GCTTTTCCAT
TATGCCTACG AAGCTTTGTC GATCAACGAA GTAAAGGACT TGATATTAAA AGAAAAGAAG
TACGGTTTAT CTATTGAAGT TCCAGGTGCT GTAATCTTGA GTACTTTTGG ATTTGATGTT
GGTGCCTTCT GGAAGGATGT TGCGTTCTTG GGTGGGTTAT CTGGAGCCTT CTTAGTATTG
GGATATCTTT TCTTACACAA TTTTACCATA GAGAAAAGGT AA
 
Protein sequence
MKPWGTLALA TTLVLPWFPV VSGAVRVDSE AELYNSFDRI KRGAVINDVF EIKQALIDQQ 
KEKNGDKDEC PPCFNCNLPN FECVQFSECN IFTGNCGCRD GFGGVDCSEP LCGALSDGNN
NRPVRKKATC ECKEGWKGIN CNMCTDDSVC DAFMPDGLKG TCYKTGIIVN KFHQMCDVTN
PKIISILGGK KPQATFSCNK TAENCNFQFW IDERESFYCD LNKCVLDYDL VANTTKYNCE
EVACKCLPDR MLCGEAGSID ISDFLTETIR GPGDFTCDVA DRKCRFSEPS MNDLIQSVFG
DPYITLKCKS GECVHKSEIP GYEVPDRNKL TLNNILLLAG VVLVTALLVA TTIHNIRQSP
LFKAGIGSFE PLDGDSSALN NNFTPTNIAF EGISYRVRSG QQVLNNVSGK IEPGECLAIM
GGSGAGKTTL LDILAGKNKG GEVYGSIYVN GNILNPDDYK KIVGFVDQED HLIPTLTVYE
TVLNSALLRL PRNMTLRQKE SRVIEVLNEL RILSIKDRVI GSNFKRGISG GEKRRVSIAC
EMVTSPSILF LDEPTSGLDS YNARNVVECL VKLSRDFNRT IVFTIHQPRS NIVSLFDKLI
LLSEGDLIYS GDMIKCNDFF AKYGYQCPLG YNIADYLIDI TIDHKKIVRV PSEDEIAEEG
SSEGHEDIHQ AFVEDTAGEV DTTREWEHFA VHRDEYNYAP LTKKGSKDQS KYIQIKNKLP
QIFADSVLAI ELQTEIDEAK NNPVPLDLKN HMMKKASFLN QILILSSRTF KNLYRNPRLL
LTNYVLSLVV GAFCGYLYYS VANDISGFQN RLGLFFFVLA FFGFSALTGL HSFSSERIIF
IRERANNYYH PFAYYISKIV CDILPLRVLP PILLISIAYP LVGLTMEHNG FLKAMVVLIL
FNVAVAVEML IVGILIKEPG TSTMIGVLIL LLSLLFAGLF INSEDLKVQI KWLEWISLFH
YAYEALSINE VKDLILKEKK YGLSIEVPGA VILSTFGFDV GAFWKDVAFL GGLSGAFLVL
GYLFLHNFTI EKR