Gene PICST_39517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39517 
SymbolXUT7 
ID4851701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2586586 
End bp2587836 
Gene Length1251 bp 
Protein Length417 aa 
Translation table 
GC content44% 
IMG OID640393409 
Productxylose transporter, high affinity, putative similarity to STL13 
Protein accessionXP_001387067 
Protein GI126275308 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000189877 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.584414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACTTTTGCAG TTAACTTGTA TGTGTTTGCA GTTGGTAGAG TGCTTTCTGG GGTGGGTGTA 
GGAGTTCTAT CGACTATGGT GCCGTCCTAT CAATGCGAAA TTAGTCCCAG CGAAGAAAGA
GGCAAGTTGG TGTGTGGAGA GTTCACGGGA AATATCACTG GTTATGCTCT CAGTGTATGG
GCCGATTACT TCTGCTACTT TATTCAAGAT ATAGGTGATG CAAGGGAGAA GCCTCATAGC
TTCTTTGCCC ACTTGTCCTG GCGATTGCCT CTATTCATCC AGGTGGTGAT AGCGGCTGTT
CTCTTTGTTG GGGGATTTTT TATTGTCGAG TCACCTCGTT GGTTATTAGA TGTAGACCAG
GACCAACAAG GATTCCATGT ATTAGCGTTG CTCTATGATT CACATCTAGA TGATAACAAA
CCACGTGAAG AGTTCTTTAT GATCAAAAAC TCCATCTTGT TAGAAAGAGA AACTACACCT
AAGAGCGAAC GAACTTGGAA ACATATGTTC AAGAACTACA TGACCCGAGT GCTTATAGCT
TGTTCAGCAC TTGGCTTTGC ACAGTTCAAC GGCATAAATA TCATTTCGTA CTATGCCCCC
ATGGTATTTG AAGAAGCAGG CTTCAACAAC TCCAAGGCTT TACTTATGAC AGGCATCAAC
TCTATAGTAT ATTGGTTCAG TACGATTCCT CCGTGGTTTC TCGTGGATCA TTGGGGTAGA
AAGCCAATTT TGATATCCGG GGGTTTATCT ATGGGAATAT GTATTGGTTT GATTGCGGTG
GTAATTCTAC TAGACAAGTC GTTCACACCG TCTATGGTTG CGGTATTGGT GATAATCTAC
AATGCATCTT TTGGCTACAG TTGGGGTCCT ATCGGATTCT TGATCCCGCC GGAGGTGATG
CCATTGGCAG TTAGATCGAA AGGTGTTTCT ATTTCTACGG CTACAAACTG GTTTGCCAAT
TTTGTTGTGG GTCAGATGAC GCCAATTCTA CAGCAGAGAT TGGGCTGGGG AACTTATCTA
TTCCCGGCTG GTAGTTGTAT CATCTCGGTG ATAGTGGTGA TTTTCTTCTA TCCAGAGACA
AAGGGTGCAG AGCTAGAGGA TATGGACTCT GTGTTCGAGA GCTTTTACAA CTACAAGTCT
CCGTTCAAGA TTTCACGAAA GAGACACCAG AATGATGGCC AGGCGTACCA AAGGGTAGAG
AACGATATCC GCCACAACGA TGTAGAAATG GACGATTTGG ACGATTTGGA C
 
Protein sequence
TFAVNLYVFA VGRVLSGVGV GVLSTMVPSY QCEISPSEER GKLVCGEFTG NITGYALSVW 
ADYFCYFIQD IGDAREKPHS FFAHLSWRLP LFIQVVIAAV LFVGGFFIVE SPRWLLDVDQ
DQQGFHVLAL LYDSHLDDNK PREEFFMIKN SILLERETTP KSERTWKHMF KNYMTRVLIA
CSALGFAQFN GINIISYYAP MVFEEAGFNN SKALLMTGIN SIVYWFSTIP PWFLVDHWGR
KPILISGGLS MGICIGLIAV VILLDKSFTP SMVAVLVIIY NASFGYSWGP IGFLIPPEVM
PLAVRSKGVS ISTATNWFAN FVVGQMTPIL QQRLGWGTYL FPAGSCIISV IVVIFFYPET
KGAELEDMDS VFESFYNYKS PFKISRKRHQ NDGQAYQRVE NDIRHNDVEM DDLDDLD