Gene PICST_29419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29419 
Symbol 
ID4836957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp212707 
End bp213969 
Gene Length1263 bp 
Protein Length420 aa 
Translation table12 
GC content42% 
IMG OID640388272 
Productpredicted protein 
Protein accessionXP_001382805 
Protein GI126132560 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.106431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATACA AAGTTGCTGG TTCAGAATGG GTTGACCCTT TGGTAGCCAA AGCCATCAAA 
CAAAGTACAT TGGGCGGTTT CATTTCTCCT GACAAAATCG TAGAAGAGAG AAAGAAAATC
AAGTTCCCAG AGTTCTTTGG TCCTTTCAAC CCAGAAGAGG CTGGAAAGTA TGAATTTCTT
AAGAATCCTG ATGGAGCTAA GAATGACAAG GGTCACTTGG GAGATCCAAC ATTCAAAAAC
TTATTCAAGC CAGGGTCCAA GGTGAAGACC ATTGACTTGT CACCAAACTA CGGTACTGAA
ATTGACGGTA TTCAATTGAG TGAATTGGAT GAAGCCGGAA AGAACGACTT GGCTTTATAC
TTAGAAACAA GAGGTTTGGC TGTCTTCAGA AATCAGGATT TTAGGGAGAA AGGTCCAGCT
TTTGCAAAGA AATTTGGTCA ACATTTTGGT CCATTACATA TCCACCCATC GGTAAGCTAT
TCAGCAGAAG AATCCCCAGA GTTGTTGGTT ACATACAGAC CAGCGGGAGG TCCAGAAAGG
TACAATGCAC AATTTGCAGG TACTACTACA ACTACTGGGT GGCATTCGGA CGTCAGTTTT
GAAGAATACC CAGCTTCTTT CAGTTTTTTC GTTGCTTTGG AAGCACCAGA AACTGGCGGA
GATACTGTAT TCCTTGACTT GAGAGAAGCC TATAGGAGAT TGTCTCCCCC AATTCAAAAG
TTCTTTGAAT CTTTGACAAT CATTCACACC AACTATTACC TAAACCAACT TGCTAAATTG
AAGGACTTGG ATACGCGTGT CAATGCTGAT TCTTTTGCTG AACATCCATT GGTCAGAACT
CATCCTGTTA CAGGTGAGAA GTCGTTGTTC TACTCTAAAG GATTTGCCCT CAGAGTAAAG
GGACTCAAGC AGCAAGAGTC AGATGCCATT CTTAGTTTCT TGGAAGACCA TATTAACAAC
AACCCTGAAA TTCAAGTGAG AGCAAGTCAT AGAGGAACCA ATTCGGGAAC TATTATTGCC
TGGGATAATA GAATCTCTAT ACATACTGCT GTTGCTGATT TCTTACAACA CGAGACCGGA
CCTCGTCACC ATTTCAGAAT TACTGTTGTA GGCGAAAAGC CTTACTTCGA GGAAGCTGCC
GAAGAGAAAG TTGCAAATGG ACACATCAAA AGCTCATCTA ATGGACACTC TAACGGACAC
TCTAATGGAA ACTCTAACGG TCACACAATT GCTTCTAATG GCTCCAATGG TAGTGAGAAC
TAA
 
Protein sequence
MTYKVAGSEW VDPLVAKAIK QSTLGGFISP DKIVEERKKI KFPEFFGPFN PEEAGKYEFL 
KNPDGAKNDK GHLGDPTFKN LFKPGSKVKT IDLSPNYGTE IDGIQLSELD EAGKNDLALY
LETRGLAVFR NQDFREKGPA FAKKFGQHFG PLHIHPSVSY SAEESPELLV TYRPAGGPER
YNAQFAGTTT TTGWHSDVSF EEYPASFSFF VALEAPETGG DTVFLDLREA YRRLSPPIQK
FFESLTIIHT NYYLNQLAKL KDLDTRVNAD SFAEHPLVRT HPVTGEKSLF YSKGFALRVK
GLKQQESDAI LSFLEDHINN NPEIQVRASH RGTNSGTIIA WDNRISIHTA VADFLQHETG
PRHHFRITVV GEKPYFEEAA EEKVANGHIK SSSNGHSNGH SNGNSNGHTI ASNGSNGSEN