Gene PICST_31308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31308 
Symbol 
ID4838685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp473726 
End bp475354 
Gene Length1629 bp 
Protein Length542 aa 
Translation table12 
GC content39% 
IMG OID640390000 
Productpredicted protein 
Protein accessionXP_001384386 
Protein GI150865247 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0207717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTAA CGGCTGGATA TGAAGATGGA ACGTTCCATT ATACCCTCAC CAAGAAATCT 
ATATCTGGAG TGTTTCGTAC TCGTGCTGCT TCAAACGAGA ATATTGAGGA AGAGGAGTAT
GCGAGTGTCA TACAAGAACT ATTTCCACTT GACAATGAGG ATGTACTTGA GCAGGAGAGG
AAATATCCGA ATCTCCAGCT TTCAGTAAAG ATTTCTGCTA AAGCAGAATA CTACGACGCT
GAAACCAAAG AATTGATAGA GGAAGATAAT GATCCCAATC AGAATCCGGT AGAATTGTCA
ATAAAGACAG ACTCAAGGGT CCCTGTGACT GTTGCCATAA TTGAACTTAC ACCAGTAAAC
TTAGAGGAAG AACCACATCT CAAGGACGAA GGCAATCTCT TCAATTGGAT GGACTTACTA
TTAACGCAGA ACCAGAGTAT GGTATCGCAG ATTAAAAGCT TGCAGAAGAC AATAGAGACT
TTAAAAGAAG AAAACTTCCA AATGCACAAC GGCTACCAGA TCGCTAAGAA TGAGTACAAG
TTTATCATAG ATGATCTAGA GCAGAAGTTC TACCAAGTAT TGAATGCTAA GAAAGATAAA
ATATGGGAGT TGACCAAGAA AACCGTACCG ACAAATCGAG TCGAACATTT AAACCAAGAA
TTTCTCAAGG ATTCGTCGAA GCTAGTAGCA ATTGATAAAG ATATGATTCC GGATAAACTC
GACGATAAGT ACTTGAAGCA GAGGAAGAGA AAGACAGAAA CCAAAACTGA GGGTCTGCCA
AAGAAAAGAG CCAGACGAGG AAGGAAAAAG GCAGAATCTG AGGAACAAGA GGTCGAAGAG
AACAAAAAAG ATATTACAGT AAAAGAAGAA GAAGAAGAAG ATTTAGACGT AGTGGCACCA
AGAAGAAGAT CATTACGTAG AAGAAGTGAT GTAATAGTGA AAAAGGAACC AGATAGTCAA
GCACCATTAA TCAATAGTTT CTCGGACCGA GAGATATTGA ACAGCGTTCT CGAAAAAAGC
GAGGATGATG AAGAAGATGA AGAAGCAAGT GAACACGAAG ATGGCGATGA GTATGATGAT
GACAATGATT TGTACGAATC AAACCATAGT GACGATGATT ATGTGGAACA TGATGAAGAT
GAAGAAAAGA TTGAAAATGA AAACACACAG CCGGATTTTG GCAAATCTGT AGATACAGTT
GATTTAGAGG CTGGGAAAGA TAAAGATTCT GAGAAGTTAG TTCTGGATAG TACAGATATT
GTTTCAGACA GTCTTGAAGG GCAGATACAG AGATCTAAGG TGGTTTTGGA CTTTGAATCT
TCCGAGAGAA TCCAATTGAC CGGTGTAAAG GAAGAGCGAA AGGAAGAAAA TGAAGAGTCT
AATGAAGCTA ACGCTAGTCA AGAAGCCTTG GAGACAGATT ACGGTTCTTC AGATGATCAG
GAACAAAATA ATCATGAAGA AGACGAACTT GAAACTAGAA ACATTAGAAA CAATGGAGGT
GACATTAGTG AAGAAGAAGG CAAGATTGGA GACAATGCAA ATAAAGCAGA TACCAACGCC
GATGCAGATA AGCAGGTGGA TGCGAATCTG AAAGATGCGG AGACGGATTA TTCCTCCGAT
GAAGATTAA
 
Protein sequence
MSLTAGYEDG TFHYTLTKKS ISGVFRTRAA SNENIEEEEY ASVIQELFPL DNEDVLEQER 
KYPNLQLSVK ISAKAEYYDA ETKELIEEDN DPNQNPVELS IKTDSRVPVT VAIIELTPVN
LEEEPHLKDE GNLFNWMDLL LTQNQSMVSQ IKSLQKTIET LKEENFQMHN GYQIAKNEYK
FIIDDLEQKF YQVLNAKKDK IWELTKKTVP TNRVEHLNQE FLKDSSKLVA IDKDMIPDKL
DDKYLKQRKR KTETKTEGSP KKRARRGRKK AESEEQEVEE NKKDITVKEE EEEDLDVVAP
RRRSLRRRSD VIVKKEPDSQ APLINSFSDR EILNSVLEKS EDDEEDEEAS EHEDGDEYDD
DNDLYESNHS DDDYVEHDED EEKIENENTQ PDFGKSVDTV DLEAGKDKDS EKLVSDSTDI
VSDSLEGQIQ RSKVVLDFES SERIQLTGVK EERKEENEES NEANASQEAL ETDYGSSDDQ
EQNNHEEDEL ETRNIRNNGG DISEEEGKIG DNANKADTNA DADKQVDANS KDAETDYSSD
ED