Gene PICST_29465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29465 
SymbolTHI7 
ID4836783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp331038 
End bp332705 
Gene Length1668 bp 
Protein Length555 aa 
Translation table12 
GC content39% 
IMG OID640388098 
ProductThiamine Metabolism 
Protein accessionXP_001382829 
Protein GI150864123 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCT TGCAAAAACT TGAAGTAAAA CCAAAGGACG GTGGTCAAGA AGTAGACAAT 
CTACAAAACC ATGATTTGAT ACCTATGACT CCTCTGAGAA GGCTCTGGAA CTACGCCAGT
TACTTTAGTT TCTGGACTGT TTCAGAGTGT AGTATTTCAA CTTGGTCGTC AGGTGCATCC
TTGCTTTCGT TGGGCTTAAA CGTTAAGGAA TCTATTGGTG TTATTATTGT TGGTAATACT
ATCATCAGTA CATTGTCTGT GTTGAACGGT GGTCCAGGTT ACTACTTTCA TGTCGGTTAC
ACTGTCTGTC AAAGACTTGT ATTTGGTATC AGAGGCAGTT ACTTTGGAGT AGCAATTAGA
ACAATCTTGA GCATTGTATG GTATGGTTCA CAGGCATGGC TTGGAGGTCA ATGTTTAGGT
ATCATATTCA GTTCCTGGTC CTATTCGTAT TTACATATGG AAAATACGCT CCCGTTATCT
GTGCATTTGA CGACTAGAGA TTTGATTTCA TTCCTCCTTT TCCAACTTAT ATCGATTCCA
ATGCTTCTTA TCAGACCTGA AAAACTTAGT ATGTTTCTTC ACGTCTCTTC GGTGGCAGTA
TTTGTTGCAA TGATCTCGGT TTTTGCATGG TCGATTGGCC ATAATGGAGG TGCTGGGCCA
TTGTTGAATG CACAAAGTAA CTTCTCCTCC AAGTCAGCTC ATGCTTGGGC ATGGATATAT
GGTATCACTT CATGGTATGG ATCTTTATCT TCTGGCATAA CCAACATGTC TGATTTCACT
AGATACTCCA AGAGAAAGTC AAGTTGTGTA CCTGGTACTT TTGGTGCTAT AATGACATTT
GGAACTGTTA TGCCTCTTTT TGGCTTGCTT TCTGCATCAG CTACATCTGA GATATACGGT
CAGGCTCTTT GGATGCCACA TATGATCGTT GAACAATGGA TTATTGCTGA CTACAGTTCT
AGGTCAAGAG TAGCCGCATT CTTTGCGTCT TTATGTTTCC TCTCCTCCCA ATTAGCCCTT
AACTTATTGT CCAATGGTAT TGCAGGTGGT ATGGATATGT CTGGATTGTG CCCTAAATAC
ATAAACATTA AACGAGGAGC TGTGCTTACT TCCCTATTAT CTTGGGTAGT TCAACCATGG
TTATTCTACA ACACGTCTTC GAGATTTGTG GTAGTTATGT CGTCATTCTC AGTATTCATG
TCACCAATTA TTGCTATTAT TATGTCGGAA TTCTGGATAA TCAGGAAGAG AAAACTTAAG
CTAAGCGATT TGTATTCTAA TGAAGTGGAT TCAATCTACT GGTACTGGAA TGGATTCAAC
TTGAAGAGTT TCTTCATATT CATTGTTGTT GCCACACCTG GTCTTCCAGG TTTGATTCAT
ATGGCAAACC CAAATATTTC AATTAACCAG GGAATACTTC ACTACTACTA TGGTAACTGT
ATCTTTGGAT TCTGTATTGC TTTCTTCTTG AATATCGCTT TGAATTACAT TTTTCCTTCC
AAGGCTATAC ATGCACTTGA TTCCGTTGAT TACTTCCACA CCTTTACTAA CGAAGAATGT
CTCAAGATGG GCATTACACC AGCTGAAAAC GAGAGTGACC GCCAGTCCAA TCAATCTAAG
GATGTGGAAT TAATTCAAGA AATTAATGTT GAGAAGAACT CTGTTTAA
 
Protein sequence
MSFLQKLEVK PKDGGQEVDN LQNHDLIPMT PSRRLWNYAS YFSFWTVSEC SISTWSSGAS 
LLSLGLNVKE SIGVIIVGNT IISTLSVLNG GPGYYFHVGY TVCQRLVFGI RGSYFGVAIR
TILSIVWYGS QAWLGGQCLG IIFSSWSYSY LHMENTLPLS VHLTTRDLIS FLLFQLISIP
MLLIRPEKLS MFLHVSSVAV FVAMISVFAW SIGHNGGAGP LLNAQSNFSS KSAHAWAWIY
GITSWYGSLS SGITNMSDFT RYSKRKSSCV PGTFGAIMTF GTVMPLFGLL SASATSEIYG
QALWMPHMIV EQWIIADYSS RSRVAAFFAS LCFLSSQLAL NLLSNGIAGG MDMSGLCPKY
INIKRGAVLT SLLSWVVQPW LFYNTSSRFV VVMSSFSVFM SPIIAIIMSE FWIIRKRKLK
LSDLYSNEVD SIYWYWNGFN LKSFFIFIVV ATPGLPGLIH MANPNISINQ GILHYYYGNC
IFGFCIAFFL NIALNYIFPS KAIHALDSVD YFHTFTNEEC LKMGITPAEN ESDRQSNQSK
DVELIQEINV EKNSV