Gene PICST_51733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_51733 
SymbolYHC3 
ID4851198 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1177086 
End bp1178357 
Gene Length1272 bp 
Protein Length406 aa 
Translation table 
GC content43% 
IMG OID640392906 
Productvacuolar CLN3 homolog involved in pH homeostasis 
Protein accessionXP_001387870 
Protein GI126274184 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00839539 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTTAA TAGTACCAGA ATCCACAAAA ATCTTCACTT CCTTCTTTCT CTTTGGACTC 
CTAAACAATA TCCTTTACGT GGTAATCCTA TCTGCCGCTA TTGACTTGGT TGGTGCTGCT
ACTCCCAAGG CTGTGGTACT ATTGGCTGAT GTTGTCCCAG CTTTTATATT CAAGCTCACT
GCTCCCTTCT TTATTCATAC GATATCGTAT GTCTCGAGAA TATGGTCTCT CGTAGCACTT
TCATCTTTGG GAATGTTATT AATCAGCTTG ACTCCTGAAA GTTCCATCGG CGTCAAAGTG
TTTGGAATAG TTCTAGCTTC GTTTTCCTCG GGGTTTGGAG AAGTTTCGTT TTTGCAATTG
ACCCACTATT ATAACGAAGA AATGTCGCTA GGAGGCTTTG CTAGTGGGAC TGGAGGTGCT
GGTCTTTTTG GCAGCTTCTT GTTCATGTTC ATGACAAACA TCTTGGGAAT CAAGGTATGG
TTAGTACTCT TGCTCTTTGC TTTGGTTCCG CTCGGTTTCC TTGCTACTTT CTACTTGCTC
CTACCATCAC CTGGACTAAG TGAAAATGTG TATGAACCGA TATTCGACGA GGAAACTCAG
ATAGACCCAG TGGAAACAGA GTTAGAGTCT TTGGACCGCA TCGACGGCGA ATTATACGAA
CCTAAGACAT ACAGTCTAGA AAGCTTGAAA CTCCATGTTT CTAAAACTAT AACTCTAATC
ACACCGCTAG TACTTCCGTA CATGCTACCA TTGAGTTCCG TATATGTTTC AGAGTATGTA
ATCAACCAGG GAATATCTCC TACCTTATTG TTTCCTTTGG ACGACTTGCC TCATTGGTTG
TTCTCGACTT ACAGAGATAT TTATGTTGTC TACGGATTCT TGTATCAACT AGGAGTTTTC
ATTCTGCGGT CCTCGATGAA TTTCGGCATA AGAATCAAAC AGCTATATGC GTTGTCATTG
CTTCAGTTCG CCAATGTCGT TATTACACTC TACCAGTCTG TTTACGATGC TCCCTTTAGT
TCCATCTGGC CTCTTATGGG GCTTATTTTC TACGAAGGGC TCCTAGGCGG CTTTCTGTAT
GTAAACACTT TCATGTCAGT TAGTGAAGAC ATACCCAAGA CTGAACGAGA ATTTTCTATG
GGATGTGTTT CCATTAGCGA CAGCTTGGGT ATAGTTTTAG CCGGGTGTAT CAACTGGTGG
CTTGAAACAA AACTTTGCGG CTTGCAAGTG CAAAGGGGCA GGGACTGGTG TCTGAAGGGT
AGCTCCTTTT AG
 
Protein sequence
MSLIVPESTK IFTSFFLFGL LNNILYVVIL SAAIDLVGAA TPKAVVLLAD VVPAFIFKLT 
APFFIHTISY VSRIWSLVAL SSLGMLLISL TPESSIGVKV FGIVLASFSS GFGEVSFLQL
THYYNEEMSL GGFASGTGGA GLFGSFLFMF MTNILGIKVW LVLLLFALVP LGFLATFYLL
LPSPGLMETE LESLDRIDGE LYEPKTYSLE SLKLHVSKTI TLITPLVLPY MLPLSSVYVS
EYVINQGISP TLLFPLDDLP HWLFSTYRDI YVVYGFLYQL GVFILRSSMN FGIRIKQLYA
LSLLQFANVV ITLYQSVYDA PFSSIWPLMG LIFYEGLLGG FLYVNTFMSV SEDIPKTERE
FSMGCVSISD SLGIVLAGCI NWWLETKLCG LQVQRGRDWC LKGSSF