Gene PICST_44719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_44719 
Symbol 
ID4838692 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1534621 
End bp1537806 
Gene Length3186 bp 
Protein Length895 aa 
Translation table12 
GC content40% 
IMG OID640390007 
Productpredicted protein 
Protein accessionXP_001384250 
Protein GI150865150 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5096] Vesicle coat complex, various subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00567496 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTCC AATTGCAGAA CTCCGAGGTC TTGGCGCGGT TGAAGCCCTT CGGTATTCTG 
TTTGAGAAGT CGCTCTCCGA CTTGATCAAA GGGATTAGAC ATCAATCCAA GGAGTCTCCA
GAGTCTTTAC TGAACTTTCT AGATGTCGTG ATCCAGGAGT GCAAAACCGA GCTTTCAACG
ACGGATTTGG AGACAAAGGC TACGGCAGTG TTGAAGTTGG CATATTTGGA GATGTATGGC
TTTGATATGG CTTGGTGCAA CTTCCAGATC TTGGAAGTGA TGTCTTCAGG CAAGTTCCAG
CAGAAGAGAA TCGGATATTT GGCTGCGATC CAGCTGTTCA AAAACGAACA GGACTTGTTA
ATCCTTGCTA CCAATCAGTT CAAAAAGGAC TTGAACTCGC ATAATCACAC CGAGATAGGT
TTGGCACTTA GTGGCATTGC TACCATTGTT ACTCCCAATT TGGCGAGAGA CATCAACGAC
GACGTGTTGA TGAAATTGAG CCATTCGAAA CCGTATATTC GTAAAAAGGC TATCTTGGCC
ATGTACAAGA TCTTCTTACA ATATCCTGAA AGTTTGCGAG TTAATTTTAA TCGCGTTATC
GCCATGTTGG ACGACGCAGA CATTTCCGTG GTTAGTGCTA CTGTCAATGT AATCTGTGAA
ATTTCCAAGA AAAACCCGCA TATATTCATG ACAAGTTTGC CCAAATTCTT CTCCATCTTG
GAGGACACCA AGAATAACTG GTTAATCATC AGAATATTGA AGTTGTTCCA GAGTTTGTCG
CGTGTAGAAC CTCGTATGAA GAAGAAGATT CTTCCGACGA TCTTGGACTT GATCCTCAGA
ACCCAAGCAT CGTCCTTGAT CTACGAGTGT ATCAACTGTA TCGTTAACGG CAACATGTTG
AGTGCAGACT CTTCAAAGGA TAAGGAAACG GCAAAAATCT GCATTAAACA AATTATGGAG
TTCTTCAAGA CAAAGGACTC CAACCTAAAA TTCGTGGGCT TAATTGCATT AATTAGCATC
TTGAAGATAT TCCCCGTGTT TATGCACAAA GTTGATGGTG TTTCAACTAT CATAATGGAC
TGTCTCACGG ATCCAGATCT TATCATAAAG AGAAAAGCAT TGGAAATCTG CCATTACTTG
GTTCAAGAAG ATAATATAGC CGAAGTAGTA AAGGTCTTGT TGTTGCAGTT GATTCCAAGT
GATACGAACG CTATTCCAGA GGCTTTAAAG CAGGAAGTCA CTTTGAAAAT CTTGTCAATA
ACATCGAACG ACAAGTATGC GAATGTGCCC AACTTCAAAT GGTATGTGGC AGTATTGAAG
GATATCATCA ATTTGACTTT ACTTCCGCTT CCTTCTTCTT CCAATGCTAG CACGATCTCT
CCAGCAACAG CAAACGTCAT AGCTGCAGAA ATCGGTAAAG AATTCAAAGA GTTAGCCACC
AAGGTGCCTT CTATTAGACC CACAATTCTC AACAAAGTGA TTGTGGAAGC TGTTCAGGAT
GTAAGAATCT TGGACGTGTG TCCTTCATTG CTTAGGGACT TCTACTGGAT TATGGGAGAG
TATATAGACG AGTTGAGATC TCCATCCGAA GAAGAAAGTG ACGTTGAAGA CGAGGATGAT
ATTGAGGAAT CTTCTGTTTT GGACCTTGGC AAGAAGATCC AGATTTTCAA CGCGTTGGTA
AACCACGATA TAGACAAGGT ACTTGGTTTA TCTGTAAATA CCCATTTTCC AATTTCATCC
AAGTTGATTA CCTTATCTGA TTCTAATGTC CAAGTAGTGT TTATCCAGGC AATTGTTAAG
TTGTACAATG GCATTGTGAC CGATTATTTG GTGCACTATT CAGTTCAAGG GAAATTCAAG
CGGGAGCAAT TTAATCAATT GGCCCATTAT TTATACAAGT TGATCAACTT CCTTGGAAAC
TGGGAGAACC ATAGGAACTA TGAAGTTCAG GAGAGAGCTT TGTCGTGGTT GGAATTCTTG
AAGCTTTCAT TAGAAGCAAT GACACATGAA GATATTTCGG CTATCCAGAA ATTGGAAAAG
GACGAGGTTG AGTATTACAG AAACTTACCG AGATCTGAAG AAGGTGAAGA TGAAGACGAT
GAAGTTTATG ACGAAGAATC TTCGGAAGAA GAGGAAAGTG AAAATGACGA TACAGACAAC
AGCATTAAGC CAGTCAGAGA CAATGAATAC GAAAACTTGA GCTCTTCATC TGAAGAGGAT
AACGATGAAG ATGGAGATGA GAGTGAAGAA AATGGTAAAG ACGAAAACGT GGAATATCCT
AATAACGAAG TCAATGGTGG GTTTGATGGC GTTGAACATA GTCCATTTCC TGAAACAGAT
GACTTCCTTA CAGAACCTCT GAAAGAGAAT AGTTTGCCAA TGTTGCTAAC ACATATTCTT
CCATCATTTT TCAAGAGTTA TCCTTTGAAC CCAATTGCAA AGAACTCACA GAAAAAGATT
CCTATTCCAG AAGATTTGAA TCTTGACGAG CCAATCTACA CCATTCCATT TGATGTGTCT
GCTGATGACG TTGACAGTTT TGTCAATGAT GAATATGATT TATTCATTGA AGACGAAGTT
GATTTACATG CTGAAGAAGC ATCTTTGATC AGTTTGTCTA ACAGAGGCAG TGACGATGAC
TTGAAGAAGA AACAGGAGAG ATTGGAGAAA TTGAGAGATG ATCCATACTA TCTTGGATCC
AAGAAGTCTT CTAAGAAGAA GTCTATTAAC AGGAGAGTTC TCTTGGTCGA TGAAGACAAG
ACCCCAAGCC CAGAAAACTT CAGTGAGAAG GGTTCGATTA ATTCAGGAGT TGCTCCTGTC
AAGGAGAGAA AGAAGAAACC GTTGAAGATG AAGAAGGATA AAGTGGTCAT CTTGTCGGAA
GAGACAATAG AAGGTGGTCC AGATGAAGAA GAAGATGAAG AAGCCACTGC TGTTAAAGCC
AAGTCCAAAA AAAAGAAGAG TAACTTTATG ATTGATTCGT CGAATTTGGA TAATTTCGAT
CTTACTTCTT CGGCCATGTC AGAGTCGGTC TCTGGTTTGG ACAAGGACTA CGAGTACAAC
ATTGATTTGG ACGAGTTAAG AAAGAAATTG GCCCTGTCTT CGTTGAAAGA CAAGGAGAAG
AAGGAAAAAA AGGAGAAAAA GAAAAAGAAA AAGAAGAGTT CCGCTTCTAA CGTTGAAAAA
ATCAAA
 
Protein sequence
MSFQLQNSEV LARLKPFGIS FEKSLSDLIK GIRHQSKESP ESLSNFLDVV IQECKTELST 
TDLETKATAV LKLAYLEMYG FDMAWCNFQI LEVMSSGKFQ QKRIGYLAAI QSFKNEQDLL
ILATNQFKKD LNSHNHTEIG LALSGIATIV TPNLARDIND DVLMKLSHSK PYIRKKAILA
MYKIFLQYPE SLRVNFNRVI AMLDDADISV VSATVNVICE ISKKNPHIFM TSLPKFFSIL
EDTKNNWLII RILKLFQSLS RVEPRMKKKI LPTILDLILR TQASSLIYEC INCIVNGNML
SADSSKDKET AKICIKQIME FFKTKDSNLK FVGLIALISI LKIFPVFMHK VDGVSTIIMD
CLTDPDLIIK RKALEICHYL VQEDNIAEVV KVLLLQLIPS DTNAIPEALK QEVTLKILSI
TSNDKYANVP NFKWYVAVLK DIINLTLLPL PSSSNASTIS PATANVIAAE IGKEFKELAT
KVPSIRPTIL NKVIVEAVQD VRILDVCPSL LRDFYWIMGE YIDELRSPSE EEMFIQAIVK
LYNGIVTDYL VHYSVQGKFK REQFNQLAHY LYKLINFLGN WENHRNYEVQ ERALSWLEFL
KLSLEAMTHE DISAIQKLEK DENSLPMLLT HILPSFFKSY PLNPIAKNSQ KKIPIPEDLN
LDEPIYTIPF DVSADDVDSF VNDEYDLFIE DEVDLHAEEA SLISLSNRGS DDDLKKKQER
LEKLRDDPYY LGSKKSSKKK SINRRVLLVD EDKTPSPENF SEKGSINSGV APVKERKKKP
LKMKKDKVVI LSEETIEGGP DEEEDEEATA VKAKSKKKKS NFMIDSSNLD NFDLTSSAMS
ESVSGLDKDY EYNIDLDELR KKLASSSLKD KEKKEKKEKK KKKKKSSASN VEKIK