Gene PICST_31380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31380 
Symbol 
ID4838913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp650115 
End bp651434 
Gene Length1320 bp 
Protein Length439 aa 
Translation table12 
GC content38% 
IMG OID640390228 
Productpredicted protein 
Protein accessionXP_001384090 
Protein GI150865040 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.593991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCCTG ATAACCAGGA AAGTCCTGAT GGATCTATTA AGAAAGAGGA AGAGTTCTTC 
GAATTTGTTT CGAAGCTACG AAAAGAATTG AGCAGTTCCT GTCCCCCAAA GCTCAGATCG
ATTATGCGCT TTTCGTTATT TGGGATCTTG CTTCTATCAT TCAACATTAT GAACCTCTCG
TATTGTTCAA GATTTGACTT CAAGTATCCT TTGGTTGAAA TAATCACTCC TCTTGTGAAT
ATGAAAGATT TGCCCACAAG CAGTTACAGT TTGTACACCA AACAGAACTA CACTTTTGAC
AATGATTTCT TGACTGAAAT ATTGAAGGAT GTCTACTTAA GACACGAACA AGAATACCTA
CATTCCATGA GCAACAGAAC CATTATTGTC CAGCCTAGCA TTACTTGGGA TATCGAGAGG
ATAATAATTC AACTTAGAGA GGGCACGTAC GATTTTGAGT CCAAAAAATT GATATTCTCT
AACCTCACCG TAATTGATCA AATTGTGGAA GAAAGTCCCA GAAGCACCGC CATTTTCAAT
CTAAAAATAC CAAGCAAGTC CAGTAATGGA TTCTTTCAAC AAAGATTTCT TAGACAGATA
ATCGGCACTT TGTTCTACTT TTCAGAAAAG TCAAAAATTT CTAAGGGCAT CGACTTGTTG
CAGAAACTAC TATTCTTTGC GGATGTAGCA AATGTGTCTA TTCTAGTCTC TGCTATGGTA
CTTTTTTTAA CATCTGTTCC CATGGTTTTA GTCCAAAAGA AAACTACTTA TATGAAGTTG
ATTCTAGAAA GCCTTCCATT TGTGCTGATT TTAACTAATT TTGGCTTTTC GTTATTCAAC
ACCGTTGTTT GGATCATCAT TAGTTACTCT TTCCCATATT GGAAATCTGT TTCATTCGTT
ATCTTCTACT TTTTTCAGTC GTTAGTAACG GTTGTGATTA TATTCAAGTT ATTCTTAAGA
AACACCCAAG ATGATAATCA GAAGTTGTTA TCAGCATTAC CTTCTGTCTT TGCTTCAGAT
AGTTCTTATC CTTCAGTCAC TACTCAAAAT GACAGCCCTG ACGCCATGCA CTCATTCGAG
ATCGAACCAC AAAGTAAGCC GTATTCCAAC AGAATCCGTA TTCCATATAC TGCAGCTGCT
CCCACGAATA CATCAGCGGC CAGAGATACT GATGATATTA TTAGAAGGTG TGCAACAGCT
CCTGCTAGAA GTGAATGCAT AGATGTTGTA GAACCATTGA GAGCATCGTT CGACACTTTG
GATATCCGCT CGCTTACTAG AAGAAGAACA GCCTCTCGAT TTACGGAAGA GTTAGATTAA
 
Protein sequence
MVPDNQESPD GSIKKEEEFF EFVSKLRKEL SSSCPPKLRS IMRFSLFGIL LLSFNIMNLS 
YCSRFDFKYP LVEIITPLVN MKDLPTSSYS LYTKQNYTFD NDFLTEILKD VYLRHEQEYL
HSMSNRTIIV QPSITWDIER IIIQLREGTY DFESKKLIFS NLTVIDQIVE ESPRSTAIFN
LKIPSKSSNG FFQQRFLRQI IGTLFYFSEK SKISKGIDLL QKLLFFADVA NVSILVSAMV
LFLTSVPMVL VQKKTTYMKL ILESLPFVSI LTNFGFSLFN TVVWIIISYS FPYWKSVSFV
IFYFFQSLVT VVIIFKLFLR NTQDDNQKLL SALPSVFASD SSYPSVTTQN DSPDAMHSFE
IEPQSKPYSN RIRIPYTAAA PTNTSAARDT DDIIRRCATA PARSECIDVV EPLRASFDTL
DIRSLTRRRT ASRFTEELD