Gene PICST_33987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33987 
Symbol 
ID4841118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp942466 
End bp943764 
Gene Length1299 bp 
Protein Length432 aa 
Translation table12 
GC content43% 
IMG OID640392433 
Productpredicted protein 
Protein accessionXP_001386601 
Protein GI126140158 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTCA AAGTAAGCGG TGCTGAATTG TTTGACCCTT TAGTGGTCAA GGCCATTAAG 
AGTGGTGCCT ATGGTGCCAT TTCTGCTGAT AAGATCCTAG AAGAAAGAAC CAAGATCAAA
TACCCTGAAT ACTTGAGCCC TTATAAGGCC GAAGACGCCG GGAAATACAA CAACTTCAAG
AACGTTGACG GGGCTAAAAA CGACAAGGGT CACTTGGGAG ATCCTACTTT CAAGAATTTG
TTCAAACCTG GTACTAAGGT CAAGACGGTG GATTTGTCGC CCAACTACGG AACAGAAATA
GACGGTATTC AATTGAGCGA ATTAGATGAT GCTGGGAAAA ATGACTTGGC TCTCTACTTG
GAGACGAGAG GGTTGGCTGT TTTCAGAAAT CAGGATTTCA GAGACAAGGG TCCAGCTTTC
GCTAAACAGT TCGGAGAATA TTTTGGTCCT TTACACATCC ATCCAGTTAG TTTTGCGGCT
GAGAATTATC CTGAGTTGTT GGTGACCTAC AGACCAGCGG GTGGTGCAGA GAGATACCCT
GTACAGTTTG CCAACTCAAC GAATACCGCA GGCTGGCACT CGGATATCAG TTTTGAAGAG
TATCCATCTT CTTTCAGTTT TTTCGTTGCT TTGGAAGCCC CAGAAAGCGG GGGTGACACT
GTGTTCCTTG ATTTGAGAGA AGCATACAAG AGATTGTCAC CTCAAATACA GAAATTCTTT
GAAACTTTGA CAATTATTCA TACCAACTAT TACCAGAACC AGTTTGCCAA GTTGAAGAAC
TACGAAGCAA GAGTGAAGGG CGATTACTTC ACGGAACATC CTTTAGTCAG AACCCACCCG
GTTACTGGCG AAAAATCTTT GTTCTTCTCC AGAGGTTTTG CTCTTAGAAT TAAGGGTCTC
AAGCAGCAAG AATCGGACTC GATTCTTAGT TTCTTGGAAA GTCACGTTTT GAACAACCCT
GAAATTCAAG TTAGAGCTAG CCATCAAGGC ACAGAATCTA GAACTGTTAT TGCCTGGGAC
AACAGAATCT CATTGCATAC TGCAATTGCA GACTTCTTGC AACATGAGAC TCCTGCGCGT
CACCACTATA GAATCACTGT TCTAGGTGAA AAGCCATTTT TTGATGGTTC AGTTGAGGCG
AAGACTATTA ATGGTCACTC GAATGGTCAC TCCAATGGTC ACTCCAATGG TCACTCCAAT
GGTAATTCGA ATGGCCATTC AAATGGTCAC TCGAACGGAA AGTCCAATGG AAAGTCCAAT
GTAGGAGATG TTAGTCTTGA CAAATTGACT ATTTCTTAA
 
Protein sequence
MTFKVSGAEL FDPLVVKAIK SGAYGAISAD KILEERTKIK YPEYLSPYKA EDAGKYNNFK 
NVDGAKNDKG HLGDPTFKNL FKPGTKVKTV DLSPNYGTEI DGIQLSELDD AGKNDLALYL
ETRGLAVFRN QDFRDKGPAF AKQFGEYFGP LHIHPVSFAA ENYPELLVTY RPAGGAERYP
VQFANSTNTA GWHSDISFEE YPSSFSFFVA LEAPESGGDT VFLDLREAYK RLSPQIQKFF
ETLTIIHTNY YQNQFAKLKN YEARVKGDYF TEHPLVRTHP VTGEKSLFFS RGFALRIKGL
KQQESDSILS FLESHVLNNP EIQVRASHQG TESRTVIAWD NRISLHTAIA DFLQHETPAR
HHYRITVLGE KPFFDGSVEA KTINGHSNGH SNGHSNGHSN GNSNGHSNGH SNGKSNGKSN
VGDVSLDKLT IS