Gene PICST_56847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_56847 
Symbol 
ID4837921 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1619810 
End bp1621210 
Gene Length1401 bp 
Protein Length451 aa 
Translation table12 
GC content46% 
IMG OID640389236 
Productpredicted protein 
Protein accessionXP_001383933 
Protein GI150864921 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTC CTTCTTTTAT GCCCCATGGC TCAAACCCAC AGACAGCCTC TTTCACCTGT 
AATACCTGTG GAATCAAGTT TGTCACGGCA GAGTTGCAAA GACAGCATAT GAAGACCGAC
TGGCACCGGT ACAATCTAAA GAGACGTGTG GCGGAGTTGC CATCGATCAC CTCAGACGTT
TTTGCCGAAA AAATATTGAA CCAGCAAACT TCTCAGGAAC CTGCTGAGGA GGACGAATAT
GGATTCTATG TAGCCCGTAG AAGGACCAAA GCCACCGGAA ATGGCAGGCA GATCACCAAG
AAGTTGATCA AACAACAGCA AAGACAATTA CACGAAGCCA GAGGAAGACC AGAACAGTCT
GAAGTTGTTT CTGGATCTTC CTTGAGAGCA GCTAGTCCTG CGACTTCCAT AGCTTCGGAG
TTTTCGCAAT TTTCACTTGG TGATTCCGAC CAACTTCATG AAGTGGCTTC TACCACAGAA
ACAGGCTCTG AGTTAAACTA CTCGGAGTCG GACTTCACTG ACTTGGAAGG TGACTTACTA
AGCGAAGAAG ACGAAGTGGA AGACCATGAT GCAGATGTAG AGTCGGAGTC AGAGTCGTTG
CAAGAAATCG AATCTATACC GATCACTCAT TGCTTCTATT GTGGGGACAA CAACCATGAG
GTGGAGAACA ATATTAGACA TATGTACAGT AGGCACGGGT TGTATATACC CGAAAGATCT
TACTTAGTGG ATTTGGAAGG GTTGCTCCAC TTTTTGAGCG AAGTAGTTTC TATAGACCAC
GAGTGTTTGG TGTGTGGCTT TGAGGGTAAA AACTTGGAGA GTATCAGACA GCATATCTAC
GCCAAGGGTC ATTGCAAGAT TCCGTATGAG AGCAAGGAGG AAAAACAGGC GATGGCGGAG
TTCTACGACT TCTATACGGA GGAGGAAAAG CCGAAGAGAG CTAGCACTTC GAAATCAGTT
GCATTTAAAG AAGTTGATGA TCAAATACTT GTGGATGTTC ATGAAGATGA ACAAAGAGAG
GAGGATGACG ATGAAAATGT TGATGATGAC GATAGGATGG CAATCGACAA CGGAATCAAC
GACAACTACT CCTTGGTTCA CGTAGACAGA AGCGGGGTAG AGTTGACATT GCCTACTGGC
TCGAGAATTG GCCACAGGTC CATGGCAAGA TATTATCGCC AGAACATCGC CTTGCCCACT
GAGCCCAGTG AATCATCCAA GACGGTGGCT CTTGTGGATC GTAGATTTGC ATCGGGATTG
TCTGCATATC AAGTCTCGAA GGAGGAGAAG GAGATCAGAA AGATCGAGCT GCAGGTCAGA
AACAACTACG AGAGAAAGAC CAAGAACAGG AGAGTTAACT TCCAAAAGCA TTTCCGTGAC
GAGCTCTTGG GACCCATGTA G
 
Protein sequence
MSVPSFMPHG SNPQTASFTC NTCGIKFVTA ELQRQHMKTD WHRYNLKRRV AELPSITSDV 
FAEKILNQQT SQEPAEEDEY GFYVARRRTK ATGNGRQITK KLIKQQQRQL HEARGRPEQS
EVVSGSSLRA ASPATSIASE FSQFSLGDSD QLHEVASTTE TGSELNYSES DFTDLEGDLL
SEEDEVEDHD ADSLQEIESI PITHCFYCGD NNHEVENNIR HMYSRHGLYI PERSYLVDLE
GLLHFLSEVV SIDHECLVCG FEGKNLESIR QHIYAKGHCK IPYESKEEKQ AMAEFYDFYT
EEEKPKRAST SKSVAFKEVD DQILVDDDDE NVDDDDRMAI DNGINDNYSL VHVDRSGVEL
TLPTGSRIGH RSMARYYRQN IALPTEPSES SKTVALVDRR FASGLSAYQV SKEEKEIRKI
ESQVRNNYER KTKNRRVNFQ KHFRDELLGP M