Gene PICST_30111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30111 
SymbolTCD5.1 
ID4836696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1965067 
End bp1966311 
Gene Length1245 bp 
Protein Length414 aa 
Translation table12 
GC content46% 
IMG OID640388011 
Producttaurine catabolism dioxygenase 
Protein accessionXP_001383149 
Protein GI126133248 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCAG CTGCTACTTC CCAAAAGATC TCAGAAAAGG ACGACTTAGA TGCTACCATC 
AAGAAGTTGG CTTCCTTGAA GCCAATCGGT CACTCCAGTT ACACTGGTGA GATCAAGTCT
GGTTACTCTG GTTCTTGGGC CGAAAAGCTT CCAGAAACTA CAAAGGCTAG ATACGCTAGA
CATGGTGTTG ACATTTCCAA GGGTTACCCT TACGTCCCTG AAGTTGACAA GATTCCTAAG
TTTGTCAACG AAGCTTATGC TATTAGAAAC GAAGTGTACC CATATGTCGA GAGAGGTGCC
AAAGCTGATC CAGAAAAGAA GGCATTATTC GGTGCTGCCA AGGAAGTAAT CAACTTGACC
AAGCATCTCG GTACTGAGAT TGTTGGTTTG CAATTGAGCG ACTTGAATGA CCAACAAAAG
GACGAATTGG CTTTATTGGT AGCTGAAAGA GTTGTCGTTT TCTTCAGAGA CCAAGACTTG
TCTCCCCAGA AGCAATTGGA ATTGGGCCAT TACTGGGGCC AAGTTGAAGT TCATCCACAA
GTTCCTCGTA TAAGTGAAGA ATTCAACGGT GTCTCCGTGA TCTGGCAAGA TTACTACCGT
GCCAAGTATG GTTTGCACCT TAGTTTCAAG AAGGCTATTG GTGGTAATGC GCAATGGCAC
ACTGATTTGG TTCACGAGCT TCAGCCAGCT GGTATCACGC ACTTGCACAA CGATGCTATT
CCATCTGTTG GCGGTGACAC TTTATGGGCT TCAGGTTATG CTGCTTACGA TAAGTTGTCT
CCAGCCTTCC AGAAGTTCTT GGACGGCAAG ACTGCCATCT ACAGATCGGC CCATCAATAT
GTTGACCCAG AAAACCCATT GAAGGGTCCT AAGTATGTTG AAAGAGAACA CCCTATTGTT
AGAACTCATC CTGCTACTGG ATGGAAGTTC TTGTTCGTCA ACCGTTCCAT GACTGTCAGA
ATTGTCGGCT TAGAGCCAGA AGAGTCTAAG ACTATTTTGG AATACTTGTT TAGCGTCTAC
GAGAAGAACT TGGATATCCA GGTCAGATTC AACTGGAGAC CAACCAAGGA AGGCTTGGGT
ACTTCTGCTA TTTGGGACAA CAGAGCTTCG CAGCACTTCG CTGTCTGGGA CCACGAAGGC
AAAGAAAACA GACACGGCAC CAGAGTCACT TCTTTGGCCG AAATTCCATT CTTTGACGAA
AACTCAAAGT CTCAGAGAGA AGCCTTGGGC TTATCGTTGG ATTAG
 
Protein sequence
MAPAATSQKI SEKDDLDATI KKLASLKPIG HSSYTGEIKS GYSGSWAEKL PETTKARYAR 
HGVDISKGYP YVPEVDKIPK FVNEAYAIRN EVYPYVERGA KADPEKKALF GAAKEVINLT
KHLGTEIVGL QLSDLNDQQK DELALLVAER VVVFFRDQDL SPQKQLELGH YWGQVEVHPQ
VPRISEEFNG VSVIWQDYYR AKYGLHLSFK KAIGGNAQWH TDLVHELQPA GITHLHNDAI
PSVGGDTLWA SGYAAYDKLS PAFQKFLDGK TAIYRSAHQY VDPENPLKGP KYVEREHPIV
RTHPATGWKF LFVNRSMTVR IVGLEPEESK TILEYLFSVY EKNLDIQVRF NWRPTKEGLG
TSAIWDNRAS QHFAVWDHEG KENRHGTRVT SLAEIPFFDE NSKSQREALG LSLD