Gene PICST_79414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_79414 
SymbolHYR6.5 
ID4840765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp1094535 
End bp1096497 
Gene Length1963 bp 
Protein Length412 aa 
Translation table12 
GC content39% 
IMG OID640392080 
Productvon Willebrand factor (VWF2) 
Protein accessionXP_001386216 
Protein GI150866572 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATTGTCTATT TGAAGTTAGA AATTCTTATC TCCACATTTA TGGTGTATGC CTTCTTTAAT 
ACATCTAGTT TGCGTGTATG GCTTCATTAG TTTAGTAGTC TCACCAACAC TCACAGTGCT
GCCGTTCAAA TACATTGAAA GCTAGTTCAT AGATAAACGA AAGGGCTATA GCATTCAGGA
ATAACGTGAG TATCGTAATA TCACGAGCAT TGATTGTGTT AACATACTCC AGACTATTAA
TAGTCAACAA GCATCTGTTT ATGCGCAATT GACGATATTC TTTTATAATC AGAGACTCGA
AGATTATAAT TAGCGACTTC ATTTTATATC ATGAGAATGT AGTCAAATGT CGACATGTAC
TCACTCGTCA GTCATACTTT TGAGGAAGTG GGCGCCATTG AGCATCGACG CGGCCTTTGT
AGACAAAATG GATCCCCGTG TACACAAATA TTGAAGATTT TTAACAAGGG ACTTGTTAGA
ATGCCCTAAA AATGATAAAC TTACGCCCTA ATGCAAACCT AAAATTTATC TTACCTAAGA
AAAGACATGT TTACTGCGTC GTCGTCTTGT GTATATACCT GCATATGTCG TGTCTTTAAT
TGCTGCAAGA TTTCCAGATT CGACTAGCAA TGTTGGTAGA TAGTGACGCT TGACTATAAA
AGCAGGCTCA ACACCACTGC AAATAAGTTC TACATCCAGT AGCTAAACTT CAATTATCAC
AGCTATAATG CTTCTCTTCA GATGTTTAGC AAGGGCTTTT CTCTTTGCAT CTGTTGCATT
AGCGCTTACA ATCAGCCATC CTCAGGTAAC TAGAGGTAGC ATAAACCTCT CATTTGGTGA
TTTAACTATT AAGTCTGGCT CTTTTTGGTC TATATTTGAT AATAACATAT CTATTTTTAA
AGGAGATATT CGGGTGAAGA AAAATGCTGG TCTTTTTATC ACATCTACTA ACAAATTGAT
CGGTTTGAAA GTAGAACTTT ATGCTGGTAA GGGTTCGATA AAGAACGAAG GTTTGATTGT
CTTTAACTCT CTCGTATCTT TGACTCCTTC CTTCTACAAG CTTATCGGTA AGAGTTTTAC
CAATAAAGGT GAAATATTTT TGATCTCTAG TGGTTGTGCT TTACCTACAG CTGCCCTTTT
GGCTCCAAAA TGGAAGAACA CTGGATCGAT AACCTTCTAT CAGGCCAAGA GGAACAACGG
TATTGTAAGT TTGGGCTCCC CTGGATCAAA GATTGAAAAC AAGGGTCAGA TTTGTTTATT
TAATCAACTT TACAAGCAAA CAACACGCAT TGTTGGCAGT GGCTGTATTA CTGCAGACCA
AGACTCCAGT GTCTTCTTTT CAAACTGTTT GATGGATATC GATAGAAAAC AAACGGTGTA
CTTGGCCGAT TCGAAATCCT CTGTAAGAGC TGTTGCCATT GCTAAACCCA GAACTTTCAG
AATTGCAGGA TTTGGAAATG GTAACAAGAT TGGATTGGAC TTACCACTTT TCAAATTACT
TCTGTCTCCA TTTTCCTATA ATTCTAGAAC TGGTATCTTG TCTCTCAGAA TTAAGGGCAA
TTGGGGTCAA GATTTTGATA TTGGATTAGG TTATGATTCA AGATTATTCA AAGTTACGAC
CGATACTAGT CTCGGATTGT TGAGTGTTCC GTGGGGAGCA GTTAAGTACG AAGGACCAGT
ACCTAATAAG CAGATTCCAA GCAACTGCAG GCCATGCAAA CCTTTCCCAC CAGCTCCAAC
TACTACTACT ACAACCAAAG CATGCTCTCC AAAATCTACC ACCTCCTTAA CTACTTCCAA
AACATCCTCT CCAAAACCTA CCACCACCTC AACAACATCC AATGCACAAA CTACAATTAC
GACTACTTGG ACTGGCACCA CCACAAGAAC TATTACTGAG ACCGATACGC CAGGTGGTAC
AGACACTGTT ATCGTTGAAG AGCCAACTAC CTCCAACACT CAA
 
Protein sequence
MLLFRCLARA FLFASVALAL TISHPQVTRG SINLSFGDLT IKSGSFWSIF DNNISIFKGD 
IRVKKNAGLF ITSTNKLIGL KVELYAGKGS IKNEGLIVFN SLVSLTPSFY KLIGKSFTNK
GEIFLISSGC ALPTAALLAP KWKNTGSITF YQAKRNNGIV SLGSPGSKIE NKGQICLFNQ
LYKQTTRIVG SGCITADQDS SVFFSNCLMD IDRKQTVYLA DSKSSVRAVA IAKPRTFRIA
GFGNGNKIGL DLPLFKLLSS PFSYNSRTGI LSLRIKGNWG QDFDIGLGYD SRLFKVTTDT
SLGLLSVPWG AVKYEGPVPN KQIPSNCRPC KPFPPAPTTT TTTKACSPKS TTSLTTSKTS
SPKPTTTSTT SNAQTTITTT WTGTTTRTIT ETDTPGGTDT VIVEEPTTSN TQ