Gene PICST_32574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32574 
Symbol 
ID4839882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp135370 
End bp138315 
Gene Length2946 bp 
Protein Length981 aa 
Translation table12 
GC content39% 
IMG OID640391197 
Productpredicted protein 
Protein accessionXP_001385355 
Protein GI150865937 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGGT TGAATCAATT CATCGTAGAT ATCCGCAATT CCAAAGATAT CGAAGAAGAA 
AAGAAGCGAA TCAATTTGGA GCTTAATAAC ATTCAGTCCA AATTCAACTC CAACATCAAC
AGCTACCAGA AGAAAAAGTA TGTCTGCAAG TTGATCTACA TCTACTTGAG TGGCTATGCA
GATTCAGTCG ATTTTGGTTT AAAGGAGTCT TTCCAGCTTG TGTGCTCCTC CAGCCATTCC
GAAAAGCAGT TGGGCTATTT GGCACTTCTG GTTCTCATCA ATAACGACAA ATCCACGCAA
TCCACGCGTG ACTTCTTGGA CTCTTTGTTG GACCAAGTGC ATCTGTATTT GATCAAGGAT
CTTCAATCGT CCAACGAAGA CACGAACTGT CTTGCTGTTC AATTCATTGC TTCCAACTTC
AACTTACCAG AATCCACGAC TGTCAGAGTT AACGAAGCTG ACGAGTCGGC TCCAAAATGG
TTGGAGTTGA TAGATATCGT CTACTCGTTT GTCACGTCTC CTATCCACAA GTCTGTTATC
AAAAAGAAAG CAGCTATTGC ATTGTACTCC TTGTTGAAGT TATATCCCCA GGTTTTGATC
TCTAACAACA ACTGGATTCC TAGATTGCTA TCTCTTGCTG ACGACAAGGA TTACGGCGTA
TCCATTGCCA GCATCCCCTT GATACAATTC GTAGTCAAGC TGAAGCCTCA GTTTGTGAAA
GCGATTATTC CGGCTATCTC TTTGAAATTG TACAATATCA TCATAGAGAA TAAGTGTCCA
GAAGAATACT ACTATTACAA GTCCCCAGCT CCTTGGTTGG TAGTCAAGCT CTTACAGTTG
ATAGAATACT TCTTTTTCTT GAGTGACACA AACGACTACG CTGTTTTGTC AATAGCGGAT
TTAGACGAAC AAACTCTTAA CAACTTGAGA CTGGTGGTAG CCCAATCGAT TCAAAATGCA
TCTCAGCCTA TAAAGGGTTT GCCCAATAGA AACTCGCAAA GTTCAACCTT GTTTCAAGCA
GTGTCGTTGG CAGTTTTCTT GGACGCCTCT TCAGATGCCA TCAATGGAGC TATCAATGCT
CTTATGATGT TGCTTACTTC AAACGAGACA AATACAAGAT ACCTTGCTCT AGACGCTCTC
ATCAAACTCA CAGCAAGACT GACTTCAAAC AATTTGTCAG CTTCACCCTC CATCGACGAA
AAGTACACCA AGATATTCAA ATTATTGTAT GACAGAGACA TTTCTGTTAG AAGAAAGTCG
TTGGACTTAC TCTACACCAT TACTAATGCT TCGAGTTACA GTATGGTTAT AACTAAATTG
CTAGACTACT TTCCTTTATG TGACTTCACC TTAAAACCCG AATTGGCTAT TAAAATCGCC
GTTTTGGCCG AAAAATTTGC TACAGACTCC ACCTGGTATG TCACCACCAT GTTGAAATTA
TTATCCATTA GTGGAGGCGT CAATTCCAAT GGAACGAATT ACATTGGGAA TGAAGTATGG
GAGAGGATTG TTCAGATTAT CGTCAACAAC GAAGACTTAC AAAAGAAAAC ATCAAAATTG
ATTATTAACT TATTGAAAAA ACCATTTTCT TCTACTGACA ACACTCCAAT TGCTCTTTCT
GAAAATCTTA TTAAAGTAGC CGCTTTTGTA CTTGGGGAAT ATGGAGACCA AGTTACTTAT
ATGTCTGAAC TTAGTACCAA ATTACAGTTT CATTTGCTTT ATGATGCCTA CTTTAAGGTG
TCTTTGACTA CAAGGGCCAT GTTGTTAACA ACTTTCTTAA AGTTCTTCGT TAAGTATCCA
GATGAAGATT TCATTCCTGA AATTATGGAT TTATTTGAGA TTGAAGAACT TTCATTGGAT
TTAGAAATAC AGACCAGAGC TCATGAATAT CTTACATTGG CTACGCATAA GTATAGTCAG
CAACTTTTCA AAGAAGTTTT GAAACCAATG CCTATTTTTG TTAAGAAAGA AAGTCATTTG
ATGGACAGAA TAGGCAGTGT TAGTCACATT GTAGGTGTTA ACAGATCCAA GTCACTAGTT
TTAGCTAAAA ACATTAGCAG TAACAAGTCA AAAGCAGCTA GAGGAATTGA TTCTAGTCCT
ATTCTTGACG AGAACTCTGA TGGATCGAAT CCATTTGAAG AAGAATCGAA GCCAGTTGTT
CTTTCTCCCA ACTGGTATTC AGGCTACCAC AGAATGTTGC ATTACGATGC AGGTATCTTT
TATGAAGATC AGCTCATCAA GATCACTTAT AGAGTTATCA AGGAAGGCTG TGCCTTGACA
TTAAAACTTA CGATCATCAA CAATTCTGCC AAAACTGCAG GTACAGATAT TACAGGGTTA
ACAGTATTGA ATCTAGAGAG TTTAACTGAT GACCATGACC CAAATTACGT TCTCAACTTA
AAGCAACTCC CTGAATCCAC ATTTCACGAT AAAGCCAACA TGGAGATCTC AGTCAAAATA
AGAAACGTAG TGGAAAACCA CGAGAGTCCA ATCTTATCGA TCACATTCAT GTGTGGTGGA
TCATTTAACA CCCTAAATTT GAAGTTCCCT GTATTATTGT TGAAGACATT AACTTCAACG
GCCTTAAACG GGTTGGATGA ATTCAACAGA CGTTGGGCTC AAATCGGAGA GTTATTGGGC
CCTCAAGGAG AGTCTTCACA AGCTGTCAAT CTTACTCACA GGTACAACTC TTCAAATATA
GTTAGACTTT TGTCCAGATT AGGCTTTGCA GTCGTACATG CAACACTGGA TGAAACCGAT
AACACTATTC TAGTGATGGG CGCAGGTATC TTGCATACGC AGAAGACTAA CTACGGGGTT
TTGGCTACAT TGAAAAGTAC AGATCAAGTG GGAAAAGAGT TTGAGGTTGC AATCAGATGT
TCAGGCGGGG GAGTTGCCGA GGTTGTGGCT ATTACGATGA AGGAGATTTT AGAAGGGAAG
TTCTGA
 
Protein sequence
MKGLNQFIVD IRNSKDIEEE KKRINLELNN IQSKFNSNIN SYQKKKYVCK LIYIYLSGYA 
DSVDFGLKES FQLVCSSSHS EKQLGYLALS VLINNDKSTQ STRDFLDSLL DQVHSYLIKD
LQSSNEDTNC LAVQFIASNF NLPESTTVRV NEADESAPKW LELIDIVYSF VTSPIHKSVI
KKKAAIALYS LLKLYPQVLI SNNNWIPRLL SLADDKDYGV SIASIPLIQF VVKSKPQFVK
AIIPAISLKL YNIIIENKCP EEYYYYKSPA PWLVVKLLQL IEYFFFLSDT NDYAVLSIAD
LDEQTLNNLR SVVAQSIQNA SQPIKGLPNR NSQSSTLFQA VSLAVFLDAS SDAINGAINA
LMMLLTSNET NTRYLALDAL IKLTARSTSN NLSASPSIDE KYTKIFKLLY DRDISVRRKS
LDLLYTITNA SSYSMVITKL LDYFPLCDFT LKPELAIKIA VLAEKFATDS TWYVTTMLKL
LSISGGVNSN GTNYIGNEVW ERIVQIIVNN EDLQKKTSKL IINLLKKPFS STDNTPIALS
ENLIKVAAFV LGEYGDQVTY MSELSTKLQF HLLYDAYFKV SLTTRAMLLT TFLKFFVKYP
DEDFIPEIMD LFEIEELSLD LEIQTRAHEY LTLATHKYSQ QLFKEVLKPM PIFVKKESHL
MDRIGSVSHI VGVNRSKSLV LAKNISSNKS KAARGIDSSP ILDENSDGSN PFEEESKPVV
LSPNWYSGYH RMLHYDAGIF YEDQLIKITY RVIKEGCALT LKLTIINNSA KTAGTDITGL
TVLNLESLTD DHDPNYVLNL KQLPESTFHD KANMEISVKI RNVVENHESP ILSITFMCGG
SFNTLNLKFP VLLLKTLTST ALNGLDEFNR RWAQIGELLG PQGESSQAVN LTHRYNSSNI
VRLLSRLGFA VVHATSDETD NTILVMGAGI LHTQKTNYGV LATLKSTDQV GKEFEVAIRC
SGGGVAEVVA ITMKEILEGK F