Gene PICST_31146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31146 
Symbol 
ID4838367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp62839 
End bp64194 
Gene Length1356 bp 
Protein Length451 aa 
Translation table12 
GC content42% 
IMG OID640389682 
Productpredicted protein 
Protein accessionXP_001383969 
Protein GI126134890 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.815833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGATT TCGAAGCTAT AAAGCAATCG AAACCATACA AATGGGGTAA AGCCGTACTT 
GACTTTGCTA TAGCGCAATG GTTTTTTGTG TTTTTGGCGG TTTTCATAGC GTTGGCCCAT
TCTTTTCCAG AGTTCGCTAA ACAGGGAGGT ACTATCAGAG CTGAGTACTC CATTGGTTAC
GGTGCTGTCG CTGTGATTTT CTTGATCTCA GGTATGTCCA TGTCGTCCAA AGACTTGATG
ATCAATGCTC TAAACTGGAG GGCTCATTTC ACAGTTCTTA CCACTTCATT CTTGGTTACC
AGCTCGATCA TTTACGGAAT TGCTACTGGA ATCAAGGCTT CTCATGATGG TCAAATAGAC
GACTGGTTGC TTGTGGGACT TATAGTGACC CACTCGTGTC CTACAACAGT GTCGTCCAAC
GTTGTCATGA CCAAACAGGC ACATGGTAAT GATATATTGA CATTGTGCGA AGTCTTTGTA
GGGAACATCT TGGGTGCTTT CATAACCCCT GCTCTCTTGC AATTGTACAT GTCTGGAACC
TGGGACTTCG GAAATCCCAG CCACCAGCCT GATGGAGATA GTACCATCAC TCACTTATAT
GCTGAAACTA TGAAGCAGTT GGGATTATCT GTATTCATTC CCCTATTCGT AGGTCAAGTC
ATTCAGAACG TCTTCCCCAA GCAATCAAAA TGGACTTTGA CGACATTCAA GTTGAACAAG
GTCGGTAGTT TCATGTTATT GTTGATCATG TACCAGTCAT TCTCAACAGC ATTTGCTCAG
GATGCGTTTA CCTCTGTCAG CCATGCCTCT ATCATCTTCT TGGTGTTCTT CAACATTGGA
ATCTATCTCT TCTTCACAGT GTTAACATAC TTCTACGCCA GACCATACTT CATTAAAGCT
TACTACAAAG AAGAGCCAAA TGAGAACTCA ACCAGATTGT ATACTCTTGG TTACAAGTTC
TTTAGACCTT TTTACTACAA CAGAAGAGAC ACAGTTGCCC TTATGTTGTG TGGACCTGCC
AAGACAGCTG CTTTGGGTGT GTCTCTTGTT TCTTCTCAAT ATGGTTCCAA CAATCCAAAG
TTGGGTATAA TTCTTGTTCC TTTAGTGTTA TACCAATCGG AACAAGTGAT CTCAGCTCAA
ATTCTTGTGA ATTTCATGAG GAAATGGATC TACGCTGGTG ATGCTAAACA TGCTGACGAA
GAGAACCAAT TGATCCAAGA ACAGGATATT AATAATTCAC AGGACACTAC AAACGACTCT
CTTGATGACA GTTCCCATAA TGTTATTAGA TCGACCACCT CCATCTCGGT GGATTCTCAC
GATCATAAGA GACCGGCTAA CATAGCGACA TTGTAG
 
Protein sequence
MVDFEAIKQS KPYKWGKAVL DFAIAQWFFV FLAVFIALAH SFPEFAKQGG TIRAEYSIGY 
GAVAVIFLIS GMSMSSKDLM INALNWRAHF TVLTTSFLVT SSIIYGIATG IKASHDGQID
DWLLVGLIVT HSCPTTVSSN VVMTKQAHGN DILTLCEVFV GNILGAFITP ALLQLYMSGT
WDFGNPSHQP DGDSTITHLY AETMKQLGLS VFIPLFVGQV IQNVFPKQSK WTLTTFKLNK
VGSFMLLLIM YQSFSTAFAQ DAFTSVSHAS IIFLVFFNIG IYLFFTVLTY FYARPYFIKA
YYKEEPNENS TRLYTLGYKF FRPFYYNRRD TVALMLCGPA KTAALGVSLV SSQYGSNNPK
LGIILVPLVL YQSEQVISAQ ILVNFMRKWI YAGDAKHADE ENQLIQEQDI NNSQDTTNDS
LDDSSHNVIR STTSISVDSH DHKRPANIAT L