Gene PICST_30906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30906 
Symbol 
ID4838122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1237336 
End bp1238355 
Gene Length1020 bp 
Protein Length339 aa 
Translation table12 
GC content42% 
IMG OID640389437 
Productpredicted protein 
Protein accessionXP_001383521 
Protein GI126133993 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.407472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0307188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACCG AAAAGATTTC TTTCTTGCTT AATTGGCAGC CAACTCCATA CCATATTCCT 
ATCTATATTG CCCAAACCAA GGGCTACTTC AAGGAACAAG GTATTGATGT TTCAATCTTG
GAGCCATCCA ACCCTTCTGA TGTTACTGAG CTCATTGGTT CTGGAAAAAT TGACATGGGA
TTGAAGGCTA TGGTTCACAC TTTGGCTGCA AAAGCCAGAG GTTTCCCTGT CACATCAATT
GGTTCCTTGT TAGACGAACC ATTCACTGGG GTCTTGTATT TGGAAGGTTC TGGAATCACT
GCAGACTTCC AATCTTTGAA AGGAAAGAGA ATTGGCTACG TTGGTGAGTT TGGCAAGATC
CAAATCGACG AATTGACCAA ACACTACGGA ATGACTCCAG AAGACTATAC CGCTGTGAGA
TGTGGCATGA ATGTAGCTAA GTACATTATC GAAGGTTCCA TCGATGCTGG TATTGGTATT
GAATGTATTC AGCAAGTCGA ATTGGAAGAC TATCTAAGAA AACAAGGAAG GCCAATTTCT
GATGCTAAGA TGTTGAGAAT CGACAAGTTG GCTGAACTTG GGTGCTGTTG TTTCTGTACT
ATCTTGTACA TTGCAAATGA CAACTTCTTA AAAGAGAACC CTGAAAAGAT TAGAAAGTTT
TTGAAGGCGG TGAAGAATGC TACCGACTTT GTTCTCACTA ACCCCAAGCA GGCTTGGGAG
GAGTACAGCG ACTTCAAGCC GCAGATGACT TCGGAATTGA ACAACAAAAT GTTCGAAAGA
TGTTTCGCCT ACTTCTCAGA CTCATTGTAC AATGTACATC GTGACTGGAA GAAGGTAACT
GCCTATGGTA AGAGATTGGA TATCATTCCT CAGGATTTCC AGTCCAACTA TTCGAACGAG
TACTTGTCTT GGCCAGAACC AAAGGAAGCT GAGGACCCAT TGGAAGTTCA AAGGAAGATG
GCTGTTCACC AGGATGAATG TAAAGCTTGT GGAGGTTACA GAAGATTGGT TCTTTCGTAG
 
Protein sequence
MSTEKISFLL NWQPTPYHIP IYIAQTKGYF KEQGIDVSIL EPSNPSDVTE LIGSGKIDMG 
LKAMVHTLAA KARGFPVTSI GSLLDEPFTG VLYLEGSGIT ADFQSLKGKR IGYVGEFGKI
QIDELTKHYG MTPEDYTAVR CGMNVAKYII EGSIDAGIGI ECIQQVELED YLRKQGRPIS
DAKMLRIDKL AELGCCCFCT ILYIANDNFL KENPEKIRKF LKAVKNATDF VLTNPKQAWE
EYSDFKPQMT SELNNKMFER CFAYFSDSLY NVHRDWKKVT AYGKRLDIIP QDFQSNYSNE
YLSWPEPKEA EDPLEVQRKM AVHQDECKAC GGYRRLVLS