Gene PICST_82631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82631 
Symbol 
ID4837867 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp227934 
End bp229797 
Gene Length1864 bp 
Protein Length551 aa 
Translation table12 
GC content46% 
IMG OID640389182 
Productpredicted protein 
Protein accessionXP_001383673 
Protein GI150864724 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.293318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCCT CGGGAAATTC AGCGGGAGGC GGTCTTTTTG GTGCGTCTCA GAACAATCCA 
GGCTCTTCTT CCATGTTTGG ATCAGCACAA CCAGCGTCTG GAACCTTTGG CCAGAATACA
GCTACGGCTA GCGGCCCATT GCAAGTGAAC ACAAACTCTG GTGGTTTATT TGGCGGTGCT
TCCAAAGCTC CCGCCACCTC TGGAGGGCTT TTTGGTTCTT CTAGCGGAGT TGGAGGTTTT
GGTTCAGCTT CCAATTCGAC CGGTTTTGGC AATGCGGCCG GAAATGCCGC TCCAGCTTCG
GGATTTGGTG CTTCTAGTAA TACTTTAGGT GGCGGATTTG GCCAGACGGG TAACAAACCA
GCCACAGCTG CTGGTGGTGG TCTCTTTGGA GGTTCTACCA ATTCTAACAC TGGAGGTTTG
TTTGGGAAGC CTGCCGGAAT TGCTGCCCCA GCGGCTTCCA CAGGTGGAGG ATTGTTTGGC
GGAAATACCC AGCAAAATCT GGCTGCTGGA GGTGGACTTT TTGGAGGCTC TACAGCTACT
TCTGGTGGCT TATTTGGCAA TAAACCCAGT GGAGCTGCTG GAAATAGTGG AGGATTGTTT
GGAAGTGGGA ACACTGCTTC TGGTGGCAAC ACTGCCGGCA TGTTTGGTGC CAATACTGCC
AATTCTGGAG TCAATACCGG TTCTAGTTTA TTTGGAAGCA AGCCGGCAGC ATCTACAAGT
GGAGGACTTT TTGGAACTTC TAATACAGCC TCTAATACAG GAGGGGGATT GTTTGGATCT
CAGCAGCAAC AGCAACAGCA ACAACAGCAG CAACAACAAC AACCACAACA GTCTTCTTTA
TTTGGAGTCA ATTCAACCAA CAATGCCCAG CCAGCTTTTG GCTGGAACAG TAGCCAACAA
CAGAAGTCTA GTTTTGGCAC GTCTCAACCA GCATTAAACA ACACATTTGG TGCAACTAAT
CCTATCGCAT CGGCAGCTCC AGCTTCCAAC ACTAATAACA AGTATACACC AGCAATAAAC
GATCAGTTGA TCAAGATCAA AGAGCAATGG GACCCTAATT CGCCTAAATG TGCATTGAAG
ACTCACTTCT ACAACAAATT CAGTGAGCAG GAAATCAACA TTCTCTTGAA CCAGCAGAGA
CCTAACAACG AAACCCCAGA AGACTGGGAT AACGCTATGA GCAAAAGACC CACGGCTAGC
CATTATCCTA TCAAGATTTC GTCATTTAAC GATGTAGCAC AGAGAATAGA AACTCAGCTT
GAACATGTGT CAAAGTCTAG GGTGATATTG AATGAAATCA ACGAGAAACT GAATGCATTG
TCTTCCAAGC ACGATTTGGA AAATACTACT AGAATTTTAA AGGCGAAAGC CAAACATACA
AAGTTGTCAA GAAGGTTGTT GAGATTGGCC ACAGTGTTGG CCATCTTGAA GTTGAAGGGT
TACCCTCTTT TGCCAGAAGA AGAAGAGATT TCCAAGCAGT TCGATGTGTT ATCTTCTAAG
CTTAATGATC CAAACAGTCC CGTAGGGAAG TTGAGTGACG TCTTTGCCAG ATTGGCAATC
TTGAAAGAGA GAGCCGAGGA CTTGAACTAC CAGTTCGAAG TCTCTATTGG TGGGTTGAAC
GGTTTGGCTA ATGATGACAA ACAGGAACAG AGAGTGGCTG AACAGAAAAA CAGCAATATC
GAAGAGACCA TCAACAAGTT ATCCAAGGTT CTCTTGAAGC AACAAATGGG ATTGAACTAC
TTGAACGAGG TGTTGGAGAA GGATTTGGAG GTGGTGGAAA AGGTTGCTTC TCGCTAAGAG
AAGATCCTAC TAATGTATAT TAAATAGATC ACAACAAAAG TAATAGTAGA TACTGCTAAG
TGCT
 
Protein sequence
MFSSGNSAGG GLFGASQNNP GSSSMFGSAQ PASGTFGQNT ATASGPLQVN TNSGGLFGGA 
SKAPATSGGL FGSSSGVGGG GFGQTGNKPA TAAGGGLFGG STNSNTGGLF GKPAGIAAPA
ASTGGGLFGG NTQQNSAAGG GLFGGSTATS GGLFGNKPSG AAGNSGGLFG SGNTASVNTG
SSLFGSKPAA STSGGLFGTS NTASNTGGGL FGSQQQQQQQ QQQQQQQPQQ SSLFGVNSTN
NAQPAFGWNS SQQQKSSFGT SQPALNNTFG ATNPIASAAP ASNTNNKYTP AINDQLIKIK
EQWDPNSPKC ALKTHFYNKF SEQEINILLN QQRPNNETPE DWDNAMSKRP TASHYPIKIS
SFNDVAQRIE TQLEHVSKSR VILNEINEKS NALSSKHDLE NTTRILKAKA KHTKLSRRLL
RLATVLAILK LKGYPLLPEE EEISKQFDVL SSKLNDPNSP VGKLSDVFAR LAILKERAED
LNYQFEVSIG GLNGLANDDK QEQRVAEQKN SNIEETINKL SKVLLKQQMG LNYLNEVLEK
DLEVVEKVAS R