Gene PICST_33076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33076 
Symbol 
ID4840250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1375821 
End bp1377767 
Gene Length1947 bp 
Protein Length648 aa 
Translation table12 
GC content38% 
IMG OID640391565 
Productpredicted protein 
Protein accessionXP_001385960 
Protein GI150866382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.759857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATGT CGTCATTTGA TGTAGGAACT GGGTCAGTTC AAAAACCAAG GAAGACCCAG 
CGAGCTCCCA AAAGTTGTTA CCAGTGCTCC AAAAAACGTG TGAAGTGCAA CAAACAAATA
CCTTGCCAAA ACTGCATCAA GCGGGGACAA GAATGCTTTC AGGAGGCCGT AATTGTCAAA
GGAGTCATTT TAAACGACAC CAAGTTCGAC CTAACTGAAA AATTGAAAAC AGAGAACGAA
TTCTTGCATG AGAAAATCCA ACGCTTGGAA GCCAAGCTAT CCCGACAGGA TGTAAAACTG
ATGCAATCAA TGGGATCCGT TGTTAATAGA GATTATGTAG ACAAGCTTGG TACCGGGGCT
CGATTGGTTT CTCGTGATCT TTTACCAGGC TCAGATGTTA TCGACACAGA TACAGAGAGA
CTAACCAGTG CAAAATTAGA AAAACTATCG AGGTTCGTCA CCCGAGATGT GTCAAGGAAA
TTGGTAGAGT TCAATTTGGA GAATCTTTAT TTGGTTCATT CGGCAGTACA TCCAAATTCC
TTTCTAAAGG AGCATGAATT ATATTGGAAC GACAATTCAA GACCTAAGCA TTTGAACTAC
GAGGTTAACC TCTCACAAAA CCAGTATTTA TGGATGGCTA TTTGGTATGC GATGATAAGC
GGCGCACTAT ACACTTTGGA CACTGATTTA GAGCTGTATT TGGGCTTAAC TTCGGAGGGA
TATTTTGAGA TGGCTAAAAT TTCTTCTCTT GTATCTCTAG AGTGTCTACA TAGAGGGCAG
TTTTTGAGAA TTCCTAATAT TCGTTCAATC CAGGCATTTT GCGTATTAGC TTCATGCTTC
CATGGTTTTT CCGGAATACA CTTACAGAAC TCTTTACTAT CTTGCATGAT ATACATTGGC
CAATCCTTGA ACTTACACAG GCTTAGCTTG CTGCAAGCAG AAAGTTTAGT AGACTATGAA
GTATCTTGCA GATTGTGGTG GATACTTGTT GTTATAGATT TCCTCGAGGA TGTCCATAGG
CAAACAATTC TTTCAGATAA TTTCCAAACA CCAATACCAA GAAATATCAG TGAAGATGAT
CTCAATTCTG GAGATTTAAA CGTTACAGAG ACCGACGAAT TTACATGTAT TACTTACAAT
CAGATGATCA TGAAATTATC AAGAATAAAA AAGAGTTTGT ATTATGAGGA TAACGCAGAA
ACTAGCAAGT TCACCTTCAA CCAATTGAAC TTAGCAGATT TGGAATTGTT AAAGTTGCAA
AGCACGATTT CAAACCAAAT TTTGAAACTC AAAGATCCAA AGCGATCAAC TAGATTTGCC
ATATTTTTGA CGGAAGTGAA ATTAGCTCAT GAAAGATTAC TTGTCAATAG AATGGTTATT
AGCCATGTAA GCAAGGAAAA ATGGTTATCG GAATATAGAT ACAAATGTGT ATCTTTCGCT
ATAACGGTGA TCAGCAAATT TAACGACAAG AGCCTACCTT TTTATTTCAA AAAGTACTGG
ATGACTAGTG AACATTCGAT AAATGCAATA GTGTTCTTAA TCTTGGACCT AGTATTGCAC
CAACTGCCCA GAGGTGAGCG TTCCTACAGA CTAAAATTGA TCAACGAATG TATTAATATA
TTGGTGTTGC TAAAGAGAAC TCATACGACT GTTTCTCGGG GATTGAGAAT AGTGGAAGCT
TTGCTCATAA TGTTACAACA AAGCGGCCGT TCAAAATTTA CTAATATGTC AGAGACTGCA
GAGATCAGCA ACACCATTAG CGCCTTAAAA TCGACACCTC GTATTTACGG TAGTGTTGAA
GTTAAGGGCC CATTAAAACC AGAGAATGAA CAATTGATAT TCAAAAATTA TGATGAAAGT
TCAGAGACTA TTTTAAATGA CTTGTTGCAA GACAATAATT GGCAGCAGTT TCTTGAATGG
ATTAATTCAA ATAGTATGAA GCAATGA
 
Protein sequence
MNMSSFDVGT GSVQKPRKTQ RAPKSCYQCS KKRVKCNKQI PCQNCIKRGQ ECFQEAVIVK 
GVILNDTKFD LTEKLKTENE FLHEKIQRLE AKLSRQDVKS MQSMGSVVNR DYVDKLGTGA
RLVSRDLLPG SDVIDTDTER LTSAKLEKLS RFVTRDVSRK LVEFNLENLY LVHSAVHPNS
FLKEHELYWN DNSRPKHLNY EVNLSQNQYL WMAIWYAMIS GALYTLDTDL ESYLGLTSEG
YFEMAKISSL VSLECLHRGQ FLRIPNIRSI QAFCVLASCF HGFSGIHLQN SLLSCMIYIG
QSLNLHRLSL SQAESLVDYE VSCRLWWILV VIDFLEDVHR QTILSDNFQT PIPRNISEDD
LNSGDLNVTE TDEFTCITYN QMIMKLSRIK KSLYYEDNAE TSKFTFNQLN LADLELLKLQ
STISNQILKL KDPKRSTRFA IFLTEVKLAH ERLLVNRMVI SHVSKEKWLS EYRYKCVSFA
ITVISKFNDK SLPFYFKKYW MTSEHSINAI VFLILDLVLH QSPRGERSYR LKLINECINI
LVLLKRTHTT VSRGLRIVEA LLIMLQQSGR SKFTNMSETA EISNTISALK STPRIYGSVE
VKGPLKPENE QLIFKNYDES SETILNDLLQ DNNWQQFLEW INSNSMKQ