Gene PICST_33544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33544 
SymbolGPI17 
ID4840673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp925507 
End bp927063 
Gene Length1557 bp 
Protein Length518 aa 
Translation table12 
GC content38% 
IMG OID640391988 
ProductGlycosyl Phosphatidyl Inositol 17 
Protein accessionXP_001386366 
Protein GI150866691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAGA CGGAGAGTGA AAATGGCCTC AAGGTTCAAA ACCAGGAAGT TCCTGAGTCA 
GACACAATTG CATATATCAG ACGTCTCATT GTGTTTATTG TAGCACTCAC AGTATTGGGC
TTAGGTTATC CAGTTTTGCA ATTTACAACA GCCATATACA GAGCAGACTT GCCGGTAGAT
GAAATCACAA GTTTGGCCAG CACATTACAC AACGACATTC TGTTCAAAAT CCCCGTCTAT
TTAGACATAC CGACCACGTT AGACGTCTTT ATTCCTGATT CTCAGGAGAA GTTGAACCAA
TTTGTCAACT CCAAGTATCC AGAACTTGCT AATTTTTGGT CGCTCGACTT GAAAAAAATC
ACACCTGGCA TTGATCCGGA AATTGACTAT GTCGTGAAAC TTGTTCAAGA TGAAAATGAA
AATGGAGACG ATGCTGTTGA TATGTCTCCA TTTTCAAAGG AAACTACGTT GAAAGTGTCG
CAGAATTGGA TAGACTCTAA GTTGGTGGAC CAGGTTTTGT CTTCAGTACT TGTAGACATG
GTTTTCAAAG AGGAAATATC TGAATTAGTG TCTATTATCA ACAATAGGGC CAAAGAGTTG
GATAAAAACA TTGTTGTTCC ATATTCACCG AACTACAATT TAGTCTTTTC TCTACTTGTA
GAAAATGGTA GAACTGTCAA GTGGGATATT GAGACTGCTC TCAAGCAAAT GAAACCATTC
TTAAACAAAT TGACCCACTA CACCAACTTC TCTATCAGCA CTCAGGTTCA GTACTACTCT
AAGACTGAAA AACCTGTGGT GTTTGACGAG AAGAAAAATG CTTACATTTT GAAGGAATCA
GATTTGTCAA CTTTCATCAA CTTTGGTGAC TGGAATTTGA ACACACATGA TATGGATCCT
TCTATTAACT TCTTGGTCTA CTTCCCAGAA TCTAATTACG AAAACAAGCC TTGGGTGATT
GACCACTTGG ATAACGGTGC CTTTTTGGTG AAGCAATGGG GTGGTGTATA CATTTTCAAT
AAGGAAAAGC CGATTCTTGA AGGATATGAT GTCAACATTA CTGAGTTGGA ATTGATCCCA
ATATTGGAAA TTTTTACCTC TCAGCTTTTC CAGTTACTTG GCTTGGCCAC GTTTCCCAAG
TCACCCTCTA TGAGAGTCGA TACCTTGACA AGATTGACCT TATTTAAAAA TTTGAAAAAA
ACATTGGAAA ACTTACATTC TCTTGTCAAG CTCACAGTTT CATTGAATGA AATATCTATT
CCAGATGAAA CTAAAGAACA TGTCTTGAAG TCTATCGAAT TGGTTAAGTT GGCCATTAGC
GAAATTAACC AAAAACAAAA CTACCATAAT TCCATGACCA TATCATCAAA GGCTTTAACG
ATTTCTGACA GAGCCTTCTT TGACAAAGAA ATGGTCCAGC AAGCGTACTT TCCAAATGAA
CATAAGATGG CGGTCTTCTT GCCCCTCCTT GGGCCTGTCA CTTCTATTTT GGCCATAGCG
TTAATCAAGA TCTTAGTTAG TTTCAAAACA GGGTTGAAAA AAAAGAAGGC CGATTAA
 
Protein sequence
MTETESENGL KVQNQEVPES DTIAYIRRLI VFIVALTVLG LGYPVLQFTT AIYRADLPVD 
EITSLASTLH NDISFKIPVY LDIPTTLDVF IPDSQEKLNQ FVNSKYPELA NFWSLDLKKI
TPGIDPEIDY VVKLVQDENE NGDDAVDMSP FSKETTLKVS QNWIDSKLVD QVLSSVLVDM
VFKEEISELV SIINNRAKEL DKNIVVPYSP NYNLVFSLLV ENGRTVKWDI ETALKQMKPF
LNKLTHYTNF SISTQVQYYS KTEKPVVFDE KKNAYILKES DLSTFINFGD WNLNTHDMDP
SINFLVYFPE SNYENKPWVI DHLDNGAFLV KQWGGVYIFN KEKPILEGYD VNITELELIP
ILEIFTSQLF QLLGLATFPK SPSMRVDTLT RLTLFKNLKK TLENLHSLVK LTVSLNEISI
PDETKEHVLK SIELVKLAIS EINQKQNYHN SMTISSKALT ISDRAFFDKE MVQQAYFPNE
HKMAVFLPLL GPVTSILAIA LIKILVSFKT GLKKKKAD