Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53191 |
Symbol | |
ID | 4851655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2463938 |
End bp | 2465074 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | |
GC content | 47% |
IMG OID | 640393363 |
Product | predicted protein |
Protein accession | XP_001387045 |
Protein GI | 126275171 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.148721 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCCAG TTCGTTACAC AAACCCAAAC ACCGTCTACA GCAGAGTAGA AGACGAAGAC GCAGACCACG AAGGCTACTT GTCGGTCGCT AAGGAATTCA AGGTCATTTC CAAGGCTCCA GACTACTTGC CAACATGGAA CAAGGCTGAA AAGTACGAAC CCTTGCAATT CCAAGAGCAC ATCGATGCTG GTACTAAGGC TGACCCAGCC CTTCCAAACT TGTTCAACAA GGGAACCGAG TTCAAGACTA AGAACATCAC TCCAAAGTTG GGAACCGAAG TCTTTGGTGT TCAGCTTTCT CAGTTGGACT CTGCCGGTAA GGATGAATTA GCTTTGTTCG TAGCAAAGAG AGGTCTCGTT GTCTTCAGAG ACCAAGACCT CGCTTCCAAG GGTCCAGCCT TCCAGACTGA GCTCGGTAGA CACTTCGGTC CTTTGCACAT CCACCCAACT TCAGGTGCTC CAAAGGACCA CCCAGAGTTG CACGTTGTGT ACAGACGTCC AGATGTAAAG GATCTCTTTG AACACAGAAA CAACTTGGTT GGCTTCCATT CCGATGTTAC CTACGAATTG CAACCTCCAG GAACTACTTT CCTCGCCGTT GTCGAAGGAC CAGAATCAGG TGGTGACACT TTGTTTGCTG ACACCGTTGA AGCTTACAAC CGTTTGTCTC CAGAATTCCA GAAGAGATTG GAAGGTTTGC ACGTCTTGCA CAGTGCTGTG GAACAAGCCA ACTTTTCTAG AAAGAATGGT GGTGTCGTCA AGAGAGACCC AGTACAGAAC ATTCATCCTT TGGTCAGAAC TCACCCAGTC ACTGGTGAAA AGGCTCTTTT CATCAACTCT GGTTTCTCCA GAAAGATTGT CGAATTGAAA GAAGAAGAAT CCGACTACTT GTTGACTTTC TTGTTGAACC ACATCAACAA CTCTCACGAC TTGCAAGCAA GAGCTAAGTG GGAGGCTAAC ACTGTTGTTG TCTGGGATAA CAGAAGAGTC GTTCACTCTG CTATCTTGGA CTGGGACACT AGCGAAGCTA GATTGGCTTA CAGAATCACT CCTCAAGCTG AAAGACCAGT CTACGACTTG AAGGACTTGA ACACTCCAGA CGAAAATAAG TTGTACAAGG GTCCAGACTA CCAGTAG
|
Protein sequence | MAPVRYTNPN TVYSRVEDED ADHEGYLSVA KEFKVISKAP DYLPTWNKAE KYEPLQFQEH IDAGTKADPA LPNLFNKGTE FKTKNITPKL GTEVFGVQLS QLDSAGKDEL ALFVAKRGLV VFRDQDLASK GPAFQTELGR HFGPLHIHPT SGAPKDHPEL HVVYRRPDVK DLFEHRNNLV GFHSDVTYEL QPPGTTFLAV VEGPESGGDT LFADTVEAYN RLSPEFQKRL EGLHVLHSAV EQANFSRKNG GVVKRDPVQN IHPLVRTHPV TGEKALFINS GFSRKIVELK EEESDYLLTF LLNHINNSHD LQARAKWEAN TVVVWDNRRV VHSAILDWDT SEARLAYRIT PQAERPVYDL KDLNTPDENK LYKGPDYQ
|
| |