Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28893 |
Symbol | |
ID | 4851633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2389968 |
End bp | 2391206 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | |
GC content | 46% |
IMG OID | 640393341 |
Product | predicted protein |
Protein accession | XP_001387032 |
Protein GI | 126275106 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.284292 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCCAT CTGTTCAAGT CCAGCAAGTG GACACCTCCG CAGACTCTAT CACTGAGAGT GTTAAGAAGA TCGCTTTAGG CGTTGGAAGG ACTGGTGATG GAACTTCCAA CTTCAAGGGA GGATTTGCCG ATTCCTCCGT AGACAAGCTT CCAGAGCCTA CCAGAAAAAG ATTCGAAAAG TACGGAATCG ACATTTCTCG TGGTTACCCT GAAAGACCTC CTACTGAGGA GATTCCGGTT TTCATTGACG ACGCTACTGC CATCAGAAAC ACACCTTGGG AGTTTATTGA CAGAGGTTCC AAGGCCGATC CGGAGAAGAA GGCATTGTTG GGGGCTGCTA AGGAAGTTAA ACATTTGACA AAGCACATTG GTACTGAGAT TGTAGGTTTG CAATTGAGCG AACTCACGGA CCAACAGAGA GATGAATTGG CCCTCTTGAT TGCCGAGAGA GTCGTAGTCT TCTTCAGAGA TCAGGATTTG TCTCCACAGA AACAGTTCGA ATTGGGCGAA TACTTCGGCA AAGTTGAAGT TCATCCTCAA CAGGTTCACG TTCCTGGCAT TCGTGGTATT ACGGTCATCT GGCCTGAGCT TTTTAAGAAA TTTGGTCCTA TCACCTTCAG AAAGACTTTG AACCATTTCA CCTCGAGGTG GCACACTGAC TTGGTTCACG AATTGCAACC TCCAGGGATC ACTCATTTGC ACAATGATAC CATTCCTGAA GTTGGGGGAG ACACCGTTTG GGCTTCTGGT TATGCCGCTT ACGACAAGCT TTCTCCAGCT TTGCAAGAAT TCCTTGATGG GAAGAAGGCT GTATACTTCT CTGCTAACAA GTACGTTGAT CGTGAGAACC CATTGAAGGG TACTGTTCAC ATTGAAAGGG AACACCCAAT CATCAGAACC CATCCTGTTA CCGGCTGGAA GTCCTTGTAT GTCAACCGTG CTATGACCAG CAGAATTGTA GGTTTAGAGC CAGGTGAATC AAAGGTCATC TTAGAGTATT TGTTTGATGT CTTTGAAAAG AACTTGGACA TCCAGGTCAG GTTCAACTGG AAGCCATCCC AGCCAGGCTT GGGTACTTCT GCTCTTTGGG ATAACAGAAT CAGTCAGCAT TTTGCTGTTC TTGATTACGA GGGCCAAGAA CCAAGACACG GTACGAGAGT AAGTTCATTG GCTGAGGTTC CTTTCTACGA TGCCGAATCC AAGTCTCAGA GAGAAGCTTT GGGATTGTCC TTAGATTAG
|
Protein sequence | MAPSVQVQQV DTSADSITES VKKIALGVGR TGDGTSNFKG GFADSSVDKL PEPTRKRFEK YGIDISRGYP ERPPTEEIPV FIDDATAIRN TPWEFIDRGS KADPEKKALL GAAKEVKHLT KHIGTEIVGL QLSELTDQQR DELALLIAER VVVFFRDQDL SPQKQFELGE YFGKVEVHPQ QVHVPGIRGI TVIWPELFKK FGPITFRKTL NHFTSRWHTD LVHELQPPGI THLHNDTIPE VGGDTVWASG YAAYDKLSPA LQEFLDGKKA VYFSANKYVD RENPLKGTVH IEREHPIIRT HPVTGWKSLY VNRAMTSRIV GLEPGESKVI LEYLFDVFEK NLDIQVRFNW KPSQPGLGTS ALWDNRISQH FAVLDYEGQE PRHGTRVSSL AEVPFYDAES KSQREALGLS LD
|
| |