Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29465 |
Symbol | THI7 |
ID | 4836783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 331038 |
End bp | 332705 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640388098 |
Product | Thiamine Metabolism |
Protein accession | XP_001382829 |
Protein GI | 150864123 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG1953] Cytosine/uracil/thiamine/allantoin permeases |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTCT TGCAAAAACT TGAAGTAAAA CCAAAGGACG GTGGTCAAGA AGTAGACAAT CTACAAAACC ATGATTTGAT ACCTATGACT CCTCTGAGAA GGCTCTGGAA CTACGCCAGT TACTTTAGTT TCTGGACTGT TTCAGAGTGT AGTATTTCAA CTTGGTCGTC AGGTGCATCC TTGCTTTCGT TGGGCTTAAA CGTTAAGGAA TCTATTGGTG TTATTATTGT TGGTAATACT ATCATCAGTA CATTGTCTGT GTTGAACGGT GGTCCAGGTT ACTACTTTCA TGTCGGTTAC ACTGTCTGTC AAAGACTTGT ATTTGGTATC AGAGGCAGTT ACTTTGGAGT AGCAATTAGA ACAATCTTGA GCATTGTATG GTATGGTTCA CAGGCATGGC TTGGAGGTCA ATGTTTAGGT ATCATATTCA GTTCCTGGTC CTATTCGTAT TTACATATGG AAAATACGCT CCCGTTATCT GTGCATTTGA CGACTAGAGA TTTGATTTCA TTCCTCCTTT TCCAACTTAT ATCGATTCCA ATGCTTCTTA TCAGACCTGA AAAACTTAGT ATGTTTCTTC ACGTCTCTTC GGTGGCAGTA TTTGTTGCAA TGATCTCGGT TTTTGCATGG TCGATTGGCC ATAATGGAGG TGCTGGGCCA TTGTTGAATG CACAAAGTAA CTTCTCCTCC AAGTCAGCTC ATGCTTGGGC ATGGATATAT GGTATCACTT CATGGTATGG ATCTTTATCT TCTGGCATAA CCAACATGTC TGATTTCACT AGATACTCCA AGAGAAAGTC AAGTTGTGTA CCTGGTACTT TTGGTGCTAT AATGACATTT GGAACTGTTA TGCCTCTTTT TGGCTTGCTT TCTGCATCAG CTACATCTGA GATATACGGT CAGGCTCTTT GGATGCCACA TATGATCGTT GAACAATGGA TTATTGCTGA CTACAGTTCT AGGTCAAGAG TAGCCGCATT CTTTGCGTCT TTATGTTTCC TCTCCTCCCA ATTAGCCCTT AACTTATTGT CCAATGGTAT TGCAGGTGGT ATGGATATGT CTGGATTGTG CCCTAAATAC ATAAACATTA AACGAGGAGC TGTGCTTACT TCCCTATTAT CTTGGGTAGT TCAACCATGG TTATTCTACA ACACGTCTTC GAGATTTGTG GTAGTTATGT CGTCATTCTC AGTATTCATG TCACCAATTA TTGCTATTAT TATGTCGGAA TTCTGGATAA TCAGGAAGAG AAAACTTAAG CTAAGCGATT TGTATTCTAA TGAAGTGGAT TCAATCTACT GGTACTGGAA TGGATTCAAC TTGAAGAGTT TCTTCATATT CATTGTTGTT GCCACACCTG GTCTTCCAGG TTTGATTCAT ATGGCAAACC CAAATATTTC AATTAACCAG GGAATACTTC ACTACTACTA TGGTAACTGT ATCTTTGGAT TCTGTATTGC TTTCTTCTTG AATATCGCTT TGAATTACAT TTTTCCTTCC AAGGCTATAC ATGCACTTGA TTCCGTTGAT TACTTCCACA CCTTTACTAA CGAAGAATGT CTCAAGATGG GCATTACACC AGCTGAAAAC GAGAGTGACC GCCAGTCCAA TCAATCTAAG GATGTGGAAT TAATTCAAGA AATTAATGTT GAGAAGAACT CTGTTTAA
|
Protein sequence | MSFLQKLEVK PKDGGQEVDN LQNHDLIPMT PSRRLWNYAS YFSFWTVSEC SISTWSSGAS LLSLGLNVKE SIGVIIVGNT IISTLSVLNG GPGYYFHVGY TVCQRLVFGI RGSYFGVAIR TILSIVWYGS QAWLGGQCLG IIFSSWSYSY LHMENTLPLS VHLTTRDLIS FLLFQLISIP MLLIRPEKLS MFLHVSSVAV FVAMISVFAW SIGHNGGAGP LLNAQSNFSS KSAHAWAWIY GITSWYGSLS SGITNMSDFT RYSKRKSSCV PGTFGAIMTF GTVMPLFGLL SASATSEIYG QALWMPHMIV EQWIIADYSS RSRVAAFFAS LCFLSSQLAL NLLSNGIAGG MDMSGLCPKY INIKRGAVLT SLLSWVVQPW LFYNTSSRFV VVMSSFSVFM SPIIAIIMSE FWIIRKRKLK LSDLYSNEVD SIYWYWNGFN LKSFFIFIVV ATPGLPGLIH MANPNISINQ GILHYYYGNC IFGFCIAFFL NIALNYIFPS KAIHALDSVD YFHTFTNEEC LKMGITPAEN ESDRQSNQSK DVELIQEINV EKNSV
|
| |