Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_43041 |
Symbol | |
ID | 4837812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1643892 |
End bp | 1645043 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640389127 |
Product | predicted protein |
Protein accession | XP_001383599 |
Protein GI | 150864666 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCCCG TTGCTGCTGC TACTAGATAC GACCCCTCGC AGTTCAACCC TGAACATAAT GCCACAACGG AATCTGGGTC TTATACCATC AGCAAGGACA ATAGGGCTAT TGCCAAATTC CCTGATTTTG TTCCAACCTG GAACCCCAAC CAGAAGTTCC CACCTTTGAA GTTCTTCAAA CATACCGACA AAGGAACATT GGCTGATCCA GAATTAAGGA ACTTGTTCCC AGCCAATGGT ACCCATAAAG TCAAGAAGGT TACTCCCAAG CTTGGCTCCG AAGTCCATGG AATTCAGTTG TCTCAACTTG ATGATAAGGG CAAAAACGAC TTGGCTCTCT TTTTAGCCCA GAGAGGTGTT GCTATTTTCA GAGACCAAGA CTTCAGCAGT TATGGTCCTG AATTTGCTGT AGAATACGGC AAGTACTTTG GTCCATTGCA TGTTCATCCT ACCTCGGGGT CTCCAGAAGG GTTTCCTCAG TTGCATATTA CGTTCAGAGG CGCCTCTCAG AATGAATTGG ACAGTGCCTT CGAGACTAGA ACGAACAACA TTGGCTGGCA TTCTGACGTT TCATACGAGC TCAACCCTCC TCAAATAACA TTTTTCAGCG TTCTTGAGGG ACCTGAATCT GGGGGTGATA CCATTTTCGC CGACACTCAA GAAGCGTATA AGAGATTGAG CCCAACCATG CAAAAGATGT TGGAAGGTTT ACACGTTTTG CATACTTCTG AAGATCAAGC CCATATCAAC CAGGCTGCAG GTGGAATCTG TAGAAGAGCT CCTGTTTCTA ACATACACCC TCTTGTAAGA CAACACCCGG TGACAAAAGA AAAATTCTTG TTTTTGAATA GGGAGTTCGG TAGAAGAATT GTAGAGTTGA AAGAGGAAGA ATCAGAGAAT TTGCTTGAAT TCTTGTTCAA CCATGTTGAG CTGGCTCACG ACTTGCAACT CAGAGCCAAC TGGGAACCTA ACACGGTGGT TTTATGGGAC AACAGAAGAA CTGTCCACTC AGCCATTATC GATTGGGATA CTCCAGTGTT AAGACACGCA TTTAGAATCA GTCCCCAAGG AGAAAGGCCC GTGGAAGACT TGAAGGATTT GAATAATGAG AGTTATTTAA AAGAAAAGTA CTCCGTTATT AAGAGAGGTT AA
|
Protein sequence | MAPVAAATRY DPSQFNPEHN ATTESGSYTI SKDNRAIAKF PDFVPTWNPN QKFPPLKFFK HTDKGTLADP ELRNLFPANG THKVKKVTPK LGSEVHGIQL SQLDDKGKND LALFLAQRGV AIFRDQDFSS YGPEFAVEYG KYFGPLHVHP TSGSPEGFPQ LHITFRGASQ NELDSAFETR TNNIGWHSDV SYELNPPQIT FFSVLEGPES GGDTIFADTQ EAYKRLSPTM QKMLEGLHVL HTSEDQAHIN QAAGGICRRA PVSNIHPLVR QHPVTKEKFL FLNREFGRRI VELKEEESEN LLEFLFNHVE SAHDLQLRAN WEPNTVVLWD NRRTVHSAII DWDTPVLRHA FRISPQGERP VEDLKDLNNE SYLKEKYSVI KRG
|
| |