Gene PICST_65920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65920 
Symbol 
ID4840286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp681450 
End bp683240 
Gene Length1791 bp 
Protein Length576 aa 
Translation table12 
GC content45% 
IMG OID640391601 
Productpredicted protein 
Protein accessionXP_001385482 
Protein GI126137918 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0024] Methionine aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGA TGCAACTTGC TGTCCACCAA GAGGACGCTG ACATTTTGTT GAAGGAAAAG 
AACGTTCTCA ACGATCCTGT TTTGGACAAA TACAGAGTTT CTGGCCAGAT TGCTCAGACT
GGGTTACAAT ATATTGCTTC TTTAATCAAT GATTCGTACC ATTTAGGCAA ATACCCTCAA
CCCTTGACCG TCCAGGAGTT GTGTATTTTG GGAGACTCGT TTTTGACCAA GTTGTTGTCC
CGTGTGTATA ATAATGTCAT CAGAGAGAAG GGAATTGCCC AGCCTACCTC CATCGAAGTC
AATGAGTTGG TGGCTGGTTT TGCTCCAGAA GTTGATGATG AAGGTGCCTA CACTTTCGTA
GCTGGAGATG TTGTTACCAT CTCGTTGGGT GTCCAGATCG ACGGTTATAC TGCCAATGTG
GCGCATACCG TGGTCATCTA CCCTGCTGGC GTAGAAGTGA ATAACGAAAT CAAGCCCACT
GGACCTTTGT TGGGAGGTAA GGCTGACGCC ATCTGTGCTA CCCATATTGC CACCGAGACC
ATTGTTGCTT TGTTGGGATT GGCTTTATCA CCAGAAAAGA TTCCTGCCCA ACTCAAGATC
AACGGTAGCG CCACTATCAC CGGAGGTCAC ATCCGTGCTC TTGTGGACTC GGTTGCTGAG
TCGTTCAACT GTGTGGTTTT GCCAGGATCC AAGGTCAGAA GAGTAAGAAG ATTCTTGTCG
GGACAAGCTG AGGGTATTGT TGCCGAACGT GATTTCAAGG GTGTCGTTTG GGACGAATCT
CACCAGGAAC AGAAATTGTT ACAGAAGAGT ACCATAAGTA ATAGCACAGA TTTGATCATC
CAAACAAACA ACAGCAACAC TAGCACATCA ACCAACACCT CCAGCGCCAT TCCAACAGAT
GATTTTGTTG TTTTGGCTGG TGAAGTGTAC CAAATCGACA TGAGATTGGC TTCTTTACAG
GAGTTCGAAG GCGAGGCTGG TTTGATCACC ACCGAGGAAA TCGACCATTT CACTGGCAAG
AACCACAAGA ATGAATTCAA CTGTAAGAGC ACTATCCACG TTCGTGATTT CGCAGTGACT
CACCAGTTGA AGTTAAAGAC TTCTCGTAGA TTGTTAGGTG AAGTCGACAA GAGATTCTCG
GTTTACCCAT TCAAGTTATC ATACACCTGC AAGCATTTCC CAGTCAAGTT AGAAAATGAC
AATGTCCAAG AACAATTGGC ACAGATTAAG TCCGAATTGA AGACCAACAA ATTGGGGTTG
TCCGAGTTGT CCAATAGACA TTTAATAAAA TCAAAGCCAG TGCAAGTCAC AAAGTTCATA
CCCTTGGACA AGATCTTGCT TTCAGCTAAC CCTACTGGTA AACACGCAAT TGACATGAGC
AAGCCTGTTT TGCCAGGTAT GGAAATCCCC TTGCCAAACT TGGGAGTCTC GTCTTTGAAG
TTGAAGGCTT TGTTGAAGCA CGCCAAGCCT ATTGCTAACG TCAGAGAATC TACTACTGTT
GTTCTCAACA ACGTCAAGAA CGAAGTTGTT CGTTTGACCG GCGGCTCTAA GACTACTACG
CCAAGCTGGG TCCACTCTCA ATACAAGTTG GGAGGTGCCT ACGTCCAGTC CATTGAACAA
ATTGTGCAAT TGAGCAAAGA CAAGAGATTT GGTATCAAGG TCAAAGAATG CCAGCCATAC
AACTTGAGCA AGACTGTTGG CCAGGCTGCC GAGACCATGG AGTTGGATTA GATAGCGAAG
TAGACGTTGT AAAATAGCAT TTTTGAATAG ACAAGAGATA AAAGTTACGT C
 
Protein sequence
MSKMQLAVHQ EDADILLKEK NVLNDPVLDK YRVSGQIAQT GLQYIASLIN DSYHLGKYPQ 
PLTVQELCIL GDSFLTKLLS RVYNNVIREK GIAQPTSIEV NELVAGFAPE VDDEGAYTFV
AGDVVTISLG VQIDGYTANV AHTVVIYPAG VEVNNEIKPT GPLLGGKADA ICATHIATET
IVALLGLALS PEKIPAQLKI NGSATITGGH IRALVDSVAE SFNCVVLPGS KVRRVRRFLS
GQAEGIVAER DFKGVVWDES HQEQKLLQKS TISNSTDLII QTNNSNTSTS TNTSSAIPTD
DFVVLAGEVY QIDMRLASLQ EFEGEAGLIT TEEIDHFTGK NHKNEFNCKS TIHVRDFAVT
HQLKLKTSRR LLGEVDKRFS VYPFKLSYTC KHFPVKLEND NVQEQLAQIK SELKTNKLGL
SELSNRHLIK SKPVQVTKFI PLDKILLSAN PTGKHAIDMS KPVLPGMEIP LPNLGVSSLK
LKALLKHAKP IANVRESTTV VLNNVKNEVV RLTGGSKTTT PSWVHSQYKL GGAYVQSIEQ
IVQLSKDKRF GIKVKECQPY NLSKTVGQAA ETMELD