Gene PICST_28893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28893 
Symbol 
ID4851633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2389968 
End bp2391206 
Gene Length1239 bp 
Protein Length412 aa 
Translation table 
GC content46% 
IMG OID640393341 
Productpredicted protein 
Protein accessionXP_001387032 
Protein GI126275106 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.284292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCAT CTGTTCAAGT CCAGCAAGTG GACACCTCCG CAGACTCTAT CACTGAGAGT 
GTTAAGAAGA TCGCTTTAGG CGTTGGAAGG ACTGGTGATG GAACTTCCAA CTTCAAGGGA
GGATTTGCCG ATTCCTCCGT AGACAAGCTT CCAGAGCCTA CCAGAAAAAG ATTCGAAAAG
TACGGAATCG ACATTTCTCG TGGTTACCCT GAAAGACCTC CTACTGAGGA GATTCCGGTT
TTCATTGACG ACGCTACTGC CATCAGAAAC ACACCTTGGG AGTTTATTGA CAGAGGTTCC
AAGGCCGATC CGGAGAAGAA GGCATTGTTG GGGGCTGCTA AGGAAGTTAA ACATTTGACA
AAGCACATTG GTACTGAGAT TGTAGGTTTG CAATTGAGCG AACTCACGGA CCAACAGAGA
GATGAATTGG CCCTCTTGAT TGCCGAGAGA GTCGTAGTCT TCTTCAGAGA TCAGGATTTG
TCTCCACAGA AACAGTTCGA ATTGGGCGAA TACTTCGGCA AAGTTGAAGT TCATCCTCAA
CAGGTTCACG TTCCTGGCAT TCGTGGTATT ACGGTCATCT GGCCTGAGCT TTTTAAGAAA
TTTGGTCCTA TCACCTTCAG AAAGACTTTG AACCATTTCA CCTCGAGGTG GCACACTGAC
TTGGTTCACG AATTGCAACC TCCAGGGATC ACTCATTTGC ACAATGATAC CATTCCTGAA
GTTGGGGGAG ACACCGTTTG GGCTTCTGGT TATGCCGCTT ACGACAAGCT TTCTCCAGCT
TTGCAAGAAT TCCTTGATGG GAAGAAGGCT GTATACTTCT CTGCTAACAA GTACGTTGAT
CGTGAGAACC CATTGAAGGG TACTGTTCAC ATTGAAAGGG AACACCCAAT CATCAGAACC
CATCCTGTTA CCGGCTGGAA GTCCTTGTAT GTCAACCGTG CTATGACCAG CAGAATTGTA
GGTTTAGAGC CAGGTGAATC AAAGGTCATC TTAGAGTATT TGTTTGATGT CTTTGAAAAG
AACTTGGACA TCCAGGTCAG GTTCAACTGG AAGCCATCCC AGCCAGGCTT GGGTACTTCT
GCTCTTTGGG ATAACAGAAT CAGTCAGCAT TTTGCTGTTC TTGATTACGA GGGCCAAGAA
CCAAGACACG GTACGAGAGT AAGTTCATTG GCTGAGGTTC CTTTCTACGA TGCCGAATCC
AAGTCTCAGA GAGAAGCTTT GGGATTGTCC TTAGATTAG
 
Protein sequence
MAPSVQVQQV DTSADSITES VKKIALGVGR TGDGTSNFKG GFADSSVDKL PEPTRKRFEK 
YGIDISRGYP ERPPTEEIPV FIDDATAIRN TPWEFIDRGS KADPEKKALL GAAKEVKHLT
KHIGTEIVGL QLSELTDQQR DELALLIAER VVVFFRDQDL SPQKQFELGE YFGKVEVHPQ
QVHVPGIRGI TVIWPELFKK FGPITFRKTL NHFTSRWHTD LVHELQPPGI THLHNDTIPE
VGGDTVWASG YAAYDKLSPA LQEFLDGKKA VYFSANKYVD RENPLKGTVH IEREHPIIRT
HPVTGWKSLY VNRAMTSRIV GLEPGESKVI LEYLFDVFEK NLDIQVRFNW KPSQPGLGTS
ALWDNRISQH FAVLDYEGQE PRHGTRVSSL AEVPFYDAES KSQREALGLS LD