Gene PICST_58162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_58162 
Symbol 
ID4838623 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp978361 
End bp980466 
Gene Length2106 bp 
Protein Length656 aa 
Translation table12 
GC content42% 
IMG OID640389938 
Productpredicted protein 
Protein accessionXP_001384142 
Protein GI126135236 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0479106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.311392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT ACTACGAACC TACACTTTTG TTCAGGCAAA ACGCCTTAAG AAGATACTGT 
CCAAGTCTTT CCCCAATTTC GAGTGTAGAA ACCTTGAGCT CCAGTATCTT GACAAATGAA
AACATTAAGG GAAATGTATC TTCTTGGATG TTCAATAGTG CCAATCCTAA AGATACAGAG
GTGTTGAACC AGTCCTGTTC CTTGAACAGA ATGAATATCA AGTCTAACTA CTGGAAGATC
CCAGACACTA ATATGAACCT TACGGCAATG GCCATCACAG ACACACATAC CGACAACCCG
TTATTTTCTG TGTCGAGTGC CAACAATGAG TCCAACTTGT TCATCTATGA ATTGGATCTT
CTTGGCAATT ATTTGACCCA CCACAACACA ATCAGTCTTC CAAATATCAA CGGAATGAAA
TGGTTACCAA ATAGCAATAG GCATTTGGTC ACTGGCAATA GCAAAGGCTA TGCTCATTTA
GTTTCCATCC CTGAAGTAAA CAGAACGGGC AACGAAGATT CTGAAGAACA ATCGGCCGAG
ATCTGCAAGA GATTCAACCA CAGAAAGCAC ATCAAGAGCA AACAAGACAT CAACAAACAT
AGTACAATTT CCAAACTCGA TTTCATGAAT AACGACAACA GCAGCCTCCT TTCCATTTAC
AACAACAACC TCTTCTACTG GGACATGAAT GATGCCGAAG CCCAAAGAAG ACCTACTCCA
ATATCCATAT CGTCCATATC TGGTCTTGCC AATTTCGACC CATTACCTAC TCACAATGCC
AACTTGGTAG GAATTTGCGG TAAGTTTGGT GTCTCTTTGT TTGACTTGAG ACAGCCCAAG
TTCACTGTTC CTCCTTCCAT TTTGGAGTAT GCATCCAAGA AGAAATTAGG TGCAAACCAA
ATGAGATGGA ATCCCAACAA TGAAAATGTT TTTGCAGCAG CTCACAGAGA TGGAGTTGTA
CGGTTGTGGG ATATCAGAAA GCAAGACAAC TTTGCCAATT TGAGTGGACA CACCGATAAG
ATCAGCAGTT TGGAGTGGAA CGATGGTGAT TTGTTCAGTG GATCCAGAGA CGGTAATATA
GTGCATTGGG ACTTGACCAG TGATTTAAGT GCCAACAACC AATTCATGAA CTGTGGATTG
AAAGAAGGCT TGGATAGCGT TCATTTCAAC CCACATATGA ACAGGTTGGA GAGAGCCATC
AACGAAAGAC AATGTGGTAC AGTTTTGCCA GCTTCTAACA CCAACATCAT AAGCATGTGT
TCTGTAACCG GCAGTGACAA TTCTAAAGAC GACATGAAAG TGCTATCTAT CGATGGTAGT
TCGTTCTTTG GTGTGCACTC CAAGATATTC GATGCTGTGA ATATTTCCAT GACTTCAGAC
AAGTTGTACT ATACTGAATC TGATATTCAA TTGATGATGA AGAGCGAGAA TTCCAACAAT
ACATTAGTAG GCTCTACCGA TAGCATCAAC GAGCAAGTAA CTGCTCCTCT TGCCATTACT
AGAAAGTCTA CTTTGAAGGA TTTCGCTCAA GCCGCCGATG CTGCCAGACC ATCCAACCTT
TCCAAAGACA CTTTGTTGGG ATCAGTTGAA GATTTGAAAT TGGCTCCTGA GCCTATTGTG
GTGGATGATG ATGATTTGAA AATCACCAAG GAGATTGAAG TTATCGATGT AGATGCTGAA
GCTGGAGAAG CGCAAGAAAT AATAGCAGAG GACGATTTAG AAGATTATAA CGATTTCACA
TTTGCTCCAC CTTCGTTCAT TCCTATACAG AATGGCAATG TGTCTACGTG CTCTGTCGAC
TACAGTGAAA CAAGTTCACG TAATGCTAAA GAGATGTTTA ACGACTCCAC TGATACTCTT
ACTACAGACC CTACCGAGCA CGAGCTTGAT AGTGAAGAAG ACAGCGGGAT CTCTTCCGTC
GAATCGTCGC CTTTAAAGAG GGAAGCTTCA TTCAAGTTCC AGTTATTGGA TTCTCTTGAT
TTCGAGGAGA AGAAGTTGCC TCGTGACGAT TCGTTTAATA CTGAAATGTT TAATGACTTA
AGAATGGCCA GGCAGGCATC TGTCAGAACC ATCGGGACAC ACTATCGCAA TGTTTACAAT
GGTTAG
 
Protein sequence
MTDYYEPTLL FRQNALRRYC PSLSPISSVE TLSSSILTNE NIKGNVSSWM FNSANPKDTE 
VLNQSCSLNR MNIKSNYWKI PDTNMNLTAM AITDTHTDNP LFSVSSANNE SNLFIYELDL
LGNYLTHHNT ISLPNINGMK WLPNSNRHLV TGNSKGYAHL VSIPEVNRTG NEDSEEQSAE
ICKRFNHRKH IKSKQDINKH STISKLDFMN NDNSSLLSIY NNNLFYWDMN DAEAQRRPTP
ISISSISGLA NFDPLPTHNA NLVGICGKFG VSLFDLRQPK FTVPPSILEY ASKKKLGANQ
MRWNPNNENV FAAAHRDGVV RLWDIRKQDN FANLSGHTDK ISSLEWNDGD LFSGSRDGNI
VHWDLTSDLS ANNQFMNCGL KEGLDSVHFN PHMNRLERAI NERQCGTVLP ASNTNIISMC
SVTGSDNSKD DMKVLSIDGS SFFGVHSKIF DAVNISMTSD KLYYTESDIQ LMMKSENSNN
TLVGSTDSIN EQVTAPLAIT RKSTLKDFAQ AADAARPSNL SKDTLLGSVE DLKLAPEPIV
VDDDDLKITK EIEDDLEDYN DFTFAPPSFI PIQNGNVSTC SHELDSEEDS GISSVESSPL
KREASFKFQL LDSLDFEEKK LPRDDSFNTE MFNDLRMARQ ASVRTIGTHY RNVYNG