Gene PICST_51444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_51444 
SymbolTCD5.2 
ID4851225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1248091 
End bp1249365 
Gene Length1275 bp 
Protein Length424 aa 
Translation table 
GC content46% 
IMG OID640392933 
Producttaurine catabolism dioxygenase 
Protein accessionXP_001387469 
Protein GI126274211 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.353625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.08977 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCTC CTGCTGCTAC CTCATCTTCC GCCCCAACTG AAGACGACAT TGAACAAACC 
GTCAGAAAGT TGGCTGCCTT GAAGCCAATC GGCCACAGAT TTTCCAACAA TGCTAAGACT
GGTCCTCAAT TGGAATGGTT AGCAAAGTTG CCAGAACCTG CTAGAAAAAG ATTTGAAAAG
GCAGGTATTG ATTTGTCCAA CGGTTATCCT GTTATTCCTA AATCGGAAGA TATTCCAAAG
TTTGTTGATG AAGCATTTGA AATCAGAAAC AAGGACTATC CATACATTGA AAGGGGTGCA
AATGCCGACC CTGAGAAGAA GGCATTGTTT GGAGCTGCCA AAGAAGTTAG ACACTTGACC
AAGCACCTTG GTACAGAAAT TGTAGGTTTG CAGTTGAGCG ACTTGAACGA CAAGCAAAAA
GACGAATTGG CCTTGTTGGT GGCTGAAAGA GTTGTCGTCT TTTTCAGAAA CCAAGACTTG
TCTCCTCAGA AGCAATTGGA ATTGGGTGAA TACTGGGGTC AAGTTGAAAG ACACCCACAA
GCTCCACACG TTCCATTGCC AATCCCTGAA GGTACTGAAA CTATTGCCAA GGGTAGTGGT
GTCAGTGTAA TCTGGAGAAA GTTTTTCAGC GAATTCTATG GATTCCCTGG TGGTTTCAGG
AAGAAGTCCA TCACCTCAGG CTGGCACACT GATTTGGTCC ATGAGCATCA ACCAGCAGGT
ATCACCCACT TGCACAACGA CACGATTCCA AAGACTGGAG GTGACACTGC ATGGGCATCT
GGTTATGCTG CATACGACAA GTTGTCTCCA GCCTTGCAAA AGTTCCTTGA CGGAAAGACA
GCTATCTACC GTTCCGCTCA CCAGTACCTT GACCGTGAAA ATCCATTGAA GGGACCAAAG
TACATCGAAA GAGAGCACCC TATTGTGAGA ACCCATCCTG CCACTGGCTG GAAGTACTTG
TTCGTCAACA GATCCATGAC TGACAGAATT GTGGGTTTGG AACCAGGTGA ATCCAAGGTT
ATTTTGGAGT ACTTGTTCTC AGTCTACGAG AAGAACTTGG ACATTCAAGT GAGATTCCAA
TGGCAACCTA CAAACGAAGG CTTTGGAACT TCTGCTATCT GGGATAACAG AGTTTCTCAG
CACAATGCTA TTTCTGACTA CGACTTCGAT GGCGATGAAC GTCATGGAAC TAGAGTCACT
TCTTTAGCTG AGCTTCCTTA CTTCGACCCC AAGTCCAAGT CTCAAAGAGA AGCATTGGGC
TTGTCGTTAG ATTAG
 
Protein sequence
MAPPAATSSS APTEDDIEQT VRKLAALKPI GHRFSNNAKT GPQLEWLAKL PEPARKRFEK 
AGIDLSNGYP VIPKSEDIPK FVDEAFEIRN KDYPYIERGA NADPEKKALF GAAKEVRHLT
KHLGTEIVGL QLSDLNDKQK DELALLVAER VVVFFRNQDL SPQKQLELGE YWGQVERHPQ
APHVPLPIPE GTETIAKGSG VSVIWRKFFS EFYGFPGGFR KKSITSGWHT DLVHEHQPAG
ITHLHNDTIP KTGGDTAWAS GYAAYDKLSP ALQKFLDGKT AIYRSAHQYL DRENPLKGPK
YIEREHPIVR THPATGWKYL FVNRSMTDRI VGLEPGESKV ILEYLFSVYE KNLDIQVRFQ
WQPTNEGFGT SAIWDNRVSQ HNAISDYDFD GDERHGTRVT SLAELPYFDP KSKSQREALG
LSLD