Gene PICST_31162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31162 
Symbol 
ID4838601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp98469 
End bp99791 
Gene Length1323 bp 
Protein Length440 aa 
Translation table12 
GC content49% 
IMG OID640389916 
Productpredicted protein 
Protein accessionXP_001384318 
Protein GI126135588 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCCC ACTTCGATAC CCTTCAGCTT CACGCTGGCC AGCAGCTTGA AAAGCCTCAC 
AGAGCAAGAG CTCCTCCTAT CTATGCTACT ACTTCGTATG TCTTTGATGA CTCCAAGCAC
GGCGCCCAAT TGTTTGGTTT GGAGACCCCA GGTATGATCT ACTCCAGAAT CATGAACCCA
ACCAACGACG TGTTTGAACA AAGAATTGCT GCCCTTGAAG GCGGTGTTGG AGCCTTGGCT
ACTTCTTCTG GTCAATCTGC CCAGTTCTTG GCCATTGCTG GTTTGGCTCA CGCCGGTGAC
AACGTTTTGT CCACCTCGTA CTTGTATGGA GGTACCTACA ACCAGTTTAA GGTAGCCTTC
AAGCGTTTGG GAATCGAAGC CAGATTTGTC AACGGTGACA GCGCTGAAGA CTTTGCCAAG
TTGATCGACG ACAAGACCAA GGCTATCTAC ATTGAGTCTA TTGGTAACCC CAAATACAAC
GTTCCTGATT TTGAAAAGAT TGCCAAGTTG GCTCACGACA ACGGTATTCC TCTCGTTGTT
GACAACACCT TCGCTGCAGG TGGTTTCTTG ATCAACCCTT TCAAACACGG TGCTGACATC
ATTGTCCACT CTGCTACCAA GTGGATTGGA GGACACGGTA CCACCATTGC TGGTGTCATT
GTCGACTCTG GTAAATTCCC TTGGAAGAAG TACCCTAAGA AGTACCCTCA ATTCTCTGAA
CCTTCTGAGG GTTACCATGG CTTGATTTTG AACGACGCCT TGGGTGAACA AGCCTTCATT
GGCCACGCCA GAATCGAGTT GTTGAGAGAC TTGGGTCCAG CTTTGAACCC ATTCGGTTCC
TTCTTGTTGT TGCAGGGTTT GGAAACCTTG TCTTTGCGTG TAGAAAGACA ATCTTACAAT
GCTCTCAAGC TTGCCCAGTA CTTGGAAAAG TCGCCATACG TTTCCTGGGT TTCGTACTTG
GGTTTGCCTT CTCACGAAAG CTACGAGCTT TCTAAGAAGT ACTTGAACAA CCCTGACCTT
GCTGGTGGTG CTTTGTCCTT CGGTGTCAAG CCTTTGGAAG GCACTAAGTC CGATGATCCA
TTCCAGGCTG CTTCTCCAAG GGTTGTCGAC AACTTGGAAA TCGCCTCCAA CTTGGCCAAC
GTCGGTGACT CAAAGACTTT AGTCATTGCT CCTTACTACA CCACCCACCA GCAATTGTCG
GAAAGCGAGA AGGTTAACTC TGGTGTCACC GAAGACTTGA TCAGAGTCTC CATCGGTACT
GAGTTCATTG ACGACATCAT TGCTGACTTT GAAAAGGCCT TCAAGACCGT CTACGGTGCC
TAG
 
Protein sequence
MPSHFDTLQL HAGQQLEKPH RARAPPIYAT TSYVFDDSKH GAQLFGLETP GMIYSRIMNP 
TNDVFEQRIA ALEGGVGALA TSSGQSAQFL AIAGLAHAGD NVLSTSYLYG GTYNQFKVAF
KRLGIEARFV NGDSAEDFAK LIDDKTKAIY IESIGNPKYN VPDFEKIAKL AHDNGIPLVV
DNTFAAGGFL INPFKHGADI IVHSATKWIG GHGTTIAGVI VDSGKFPWKK YPKKYPQFSE
PSEGYHGLIL NDALGEQAFI GHARIELLRD LGPALNPFGS FLLLQGLETL SLRVERQSYN
ALKLAQYLEK SPYVSWVSYL GLPSHESYEL SKKYLNNPDL AGGALSFGVK PLEGTKSDDP
FQAASPRVVD NLEIASNLAN VGDSKTLVIA PYYTTHQQLS ESEKVNSGVT EDLIRVSIGT
EFIDDIIADF EKAFKTVYGA