Gene PICST_43041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_43041 
Symbol 
ID4837812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1643892 
End bp1645043 
Gene Length1152 bp 
Protein Length383 aa 
Translation table12 
GC content44% 
IMG OID640389127 
Productpredicted protein 
Protein accessionXP_001383599 
Protein GI150864666 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCCG TTGCTGCTGC TACTAGATAC GACCCCTCGC AGTTCAACCC TGAACATAAT 
GCCACAACGG AATCTGGGTC TTATACCATC AGCAAGGACA ATAGGGCTAT TGCCAAATTC
CCTGATTTTG TTCCAACCTG GAACCCCAAC CAGAAGTTCC CACCTTTGAA GTTCTTCAAA
CATACCGACA AAGGAACATT GGCTGATCCA GAATTAAGGA ACTTGTTCCC AGCCAATGGT
ACCCATAAAG TCAAGAAGGT TACTCCCAAG CTTGGCTCCG AAGTCCATGG AATTCAGTTG
TCTCAACTTG ATGATAAGGG CAAAAACGAC TTGGCTCTCT TTTTAGCCCA GAGAGGTGTT
GCTATTTTCA GAGACCAAGA CTTCAGCAGT TATGGTCCTG AATTTGCTGT AGAATACGGC
AAGTACTTTG GTCCATTGCA TGTTCATCCT ACCTCGGGGT CTCCAGAAGG GTTTCCTCAG
TTGCATATTA CGTTCAGAGG CGCCTCTCAG AATGAATTGG ACAGTGCCTT CGAGACTAGA
ACGAACAACA TTGGCTGGCA TTCTGACGTT TCATACGAGC TCAACCCTCC TCAAATAACA
TTTTTCAGCG TTCTTGAGGG ACCTGAATCT GGGGGTGATA CCATTTTCGC CGACACTCAA
GAAGCGTATA AGAGATTGAG CCCAACCATG CAAAAGATGT TGGAAGGTTT ACACGTTTTG
CATACTTCTG AAGATCAAGC CCATATCAAC CAGGCTGCAG GTGGAATCTG TAGAAGAGCT
CCTGTTTCTA ACATACACCC TCTTGTAAGA CAACACCCGG TGACAAAAGA AAAATTCTTG
TTTTTGAATA GGGAGTTCGG TAGAAGAATT GTAGAGTTGA AAGAGGAAGA ATCAGAGAAT
TTGCTTGAAT TCTTGTTCAA CCATGTTGAG CTGGCTCACG ACTTGCAACT CAGAGCCAAC
TGGGAACCTA ACACGGTGGT TTTATGGGAC AACAGAAGAA CTGTCCACTC AGCCATTATC
GATTGGGATA CTCCAGTGTT AAGACACGCA TTTAGAATCA GTCCCCAAGG AGAAAGGCCC
GTGGAAGACT TGAAGGATTT GAATAATGAG AGTTATTTAA AAGAAAAGTA CTCCGTTATT
AAGAGAGGTT AA
 
Protein sequence
MAPVAAATRY DPSQFNPEHN ATTESGSYTI SKDNRAIAKF PDFVPTWNPN QKFPPLKFFK 
HTDKGTLADP ELRNLFPANG THKVKKVTPK LGSEVHGIQL SQLDDKGKND LALFLAQRGV
AIFRDQDFSS YGPEFAVEYG KYFGPLHVHP TSGSPEGFPQ LHITFRGASQ NELDSAFETR
TNNIGWHSDV SYELNPPQIT FFSVLEGPES GGDTIFADTQ EAYKRLSPTM QKMLEGLHVL
HTSEDQAHIN QAAGGICRRA PVSNIHPLVR QHPVTKEKFL FLNREFGRRI VELKEEESEN
LLEFLFNHVE SAHDLQLRAN WEPNTVVLWD NRRTVHSAII DWDTPVLRHA FRISPQGERP
VEDLKDLNNE SYLKEKYSVI KRG