Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33987 |
Symbol | |
ID | 4841118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 942466 |
End bp | 943764 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392433 |
Product | predicted protein |
Protein accession | XP_001386601 |
Protein GI | 126140158 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTCA AAGTAAGCGG TGCTGAATTG TTTGACCCTT TAGTGGTCAA GGCCATTAAG AGTGGTGCCT ATGGTGCCAT TTCTGCTGAT AAGATCCTAG AAGAAAGAAC CAAGATCAAA TACCCTGAAT ACTTGAGCCC TTATAAGGCC GAAGACGCCG GGAAATACAA CAACTTCAAG AACGTTGACG GGGCTAAAAA CGACAAGGGT CACTTGGGAG ATCCTACTTT CAAGAATTTG TTCAAACCTG GTACTAAGGT CAAGACGGTG GATTTGTCGC CCAACTACGG AACAGAAATA GACGGTATTC AATTGAGCGA ATTAGATGAT GCTGGGAAAA ATGACTTGGC TCTCTACTTG GAGACGAGAG GGTTGGCTGT TTTCAGAAAT CAGGATTTCA GAGACAAGGG TCCAGCTTTC GCTAAACAGT TCGGAGAATA TTTTGGTCCT TTACACATCC ATCCAGTTAG TTTTGCGGCT GAGAATTATC CTGAGTTGTT GGTGACCTAC AGACCAGCGG GTGGTGCAGA GAGATACCCT GTACAGTTTG CCAACTCAAC GAATACCGCA GGCTGGCACT CGGATATCAG TTTTGAAGAG TATCCATCTT CTTTCAGTTT TTTCGTTGCT TTGGAAGCCC CAGAAAGCGG GGGTGACACT GTGTTCCTTG ATTTGAGAGA AGCATACAAG AGATTGTCAC CTCAAATACA GAAATTCTTT GAAACTTTGA CAATTATTCA TACCAACTAT TACCAGAACC AGTTTGCCAA GTTGAAGAAC TACGAAGCAA GAGTGAAGGG CGATTACTTC ACGGAACATC CTTTAGTCAG AACCCACCCG GTTACTGGCG AAAAATCTTT GTTCTTCTCC AGAGGTTTTG CTCTTAGAAT TAAGGGTCTC AAGCAGCAAG AATCGGACTC GATTCTTAGT TTCTTGGAAA GTCACGTTTT GAACAACCCT GAAATTCAAG TTAGAGCTAG CCATCAAGGC ACAGAATCTA GAACTGTTAT TGCCTGGGAC AACAGAATCT CATTGCATAC TGCAATTGCA GACTTCTTGC AACATGAGAC TCCTGCGCGT CACCACTATA GAATCACTGT TCTAGGTGAA AAGCCATTTT TTGATGGTTC AGTTGAGGCG AAGACTATTA ATGGTCACTC GAATGGTCAC TCCAATGGTC ACTCCAATGG TCACTCCAAT GGTAATTCGA ATGGCCATTC AAATGGTCAC TCGAACGGAA AGTCCAATGG AAAGTCCAAT GTAGGAGATG TTAGTCTTGA CAAATTGACT ATTTCTTAA
|
Protein sequence | MTFKVSGAEL FDPLVVKAIK SGAYGAISAD KILEERTKIK YPEYLSPYKA EDAGKYNNFK NVDGAKNDKG HLGDPTFKNL FKPGTKVKTV DLSPNYGTEI DGIQLSELDD AGKNDLALYL ETRGLAVFRN QDFRDKGPAF AKQFGEYFGP LHIHPVSFAA ENYPELLVTY RPAGGAERYP VQFANSTNTA GWHSDISFEE YPSSFSFFVA LEAPESGGDT VFLDLREAYK RLSPQIQKFF ETLTIIHTNY YQNQFAKLKN YEARVKGDYF TEHPLVRTHP VTGEKSLFFS RGFALRIKGL KQQESDSILS FLESHVLNNP EIQVRASHQG TESRTVIAWD NRISLHTAIA DFLQHETPAR HHYRITVLGE KPFFDGSVEA KTINGHSNGH SNGHSNGHSN GNSNGHSNGH SNGKSNGKSN VGDVSLDKLT IS
|
| |