Gene PICST_30034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30034 
SymbolMCH4.2 
ID4836689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1779716 
End bp1781131 
Gene Length1416 bp 
Protein Length471 aa 
Translation table12 
GC content40% 
IMG OID640388004 
Productmonocarboxylate permease homolog 
Protein accessionXP_001382568 
Protein GI150863922 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCTA CGGAATCTAT AGAGCTCACT CATATTAGAC ATGCCAATTC GCAAGCTTCT 
TTTACGGTAA TAAACTCGTC TTCAGGAGCA GTTGACATTT CTGAAGAAAG CATAGCAATA
CAACGACAAG AGGTTGACGG AGAAGAAGAA TCAGAAGATG AATATTATCC TGAAGGAGGA
TGGAGAGCAT ACTTGGTGAT TTTCGGCTCG TTTTTGGGGT TGATAGTCAA TTTGGGTGTC
ATCAACTCCA TTGGTGCCGT TCAAGCATAT GTCTCTACCC ACCAGTTAGC ACTCGTAGAT
GCAACTTCTA TCTCGTGGAT TTTTTCCATC TACTTGGCTT TGGCTTATGC CTTGGGAAGT
ATTGTTGGAC CTATCTTTGA CAGACGAGGA CCCCTAGAGC TTCTTATTGC TTCAACATTA
TTGATTTTTG TTGGTTTAAT GGCATCTGCC AACAGTACGA AAGTGTACCA TTTTATTCTC
AGTTTCGTAG CTTTAGGAGT TGGTAATGGT ATAGGTATGA CTCCTTTGAT AGGGGTCATC
AGCCACTGGT TCAATAAGAA GAGAGGAAAC TTTACCGGTA TTGCCACCAG CGGGGGCTCT
GTAGGTGGAT TGGTGTTTCC GTTGATGCTT AGACACACTT ATGCTCAATA TGGATTCGTC
TGGGCCATGA GAATATTTGC TTTCGTATGT TTGGGATGTA TGGTGTTATC TATATTCTTG
GTGAAGGGCA GATTCAAGCG TGAATCCAAA AAACACGATG TTCAATACTT CAAATCCAAG
TGGAATAATG GTATAGAGAA TTTCCTGATT GCAAATGTGT GGGCAGCAAG CACATCTAAG
TTTGCTTTTC TAATTGCTGG AGCTTTTTTT GGCGAATTAT CCTTGGTATT GCTTCTTACG
TATTTTGCAA CTTATGCTAT GGCTCAAGGT GTATCCGAAG CCAACTCTTT ATTGCTACTT
ACCGTGTGGA ATGCAACAGG TATTCTTGGA AGGTGGATAC CAGGTTATCT TTCGGATTAT
GTTGGTCGCT TCAACATCAA TATTTTAATG CTTTTCAGCT TCAATGTTAG CATATTTGTC
TTGTGGCTAC CATTTGGGTC TAGTTTGAAA GTTCTATTTG CATTTGCCGC TATCGGTGGC
TACTGTCTGG GGTCTATTCT CAGTCTTCTT CCTGCTTGTT TGTCTCAGAT TACTCCAGTC
AATGAAATAG GCACCAAGTA TGGTTTTTTG AATACGATTT TGAGTATCGG TAACTTAGTT
GGTCTTCCAA TAGCGAGTGC AGTGATACAG AATGGAAGCA CCCATAACTA CAATAACTTT
GTGGTATTTG TAGGTTTGCT ATCTGTATTG GGTACAGTCT TTTGGTATAC TAGTCGTTTC
ATGATCGTAG GAAACAGACT AAATGTTAAA GTATAA
 
Protein sequence
MSSTESIELT HIRHANSQAS FTVINSSSGA VDISEESIAI QRQEVDGEEE SEDEYYPEGG 
WRAYLVIFGS FLGLIVNLGV INSIGAVQAY VSTHQLALVD ATSISWIFSI YLALAYALGS
IVGPIFDRRG PLELLIASTL LIFVGLMASA NSTKVYHFIL SFVALGVGNG IGMTPLIGVI
SHWFNKKRGN FTGIATSGGS VGGLVFPLML RHTYAQYGFV WAMRIFAFVC LGCMVLSIFL
VKGRFKRESK KHDVQYFKSK WNNGIENFSI ANVWAASTSK FAFLIAGAFF GELSLVLLLT
YFATYAMAQG VSEANSLLLL TVWNATGILG RWIPGYLSDY VGRFNINILM LFSFNVSIFV
LWLPFGSSLK VLFAFAAIGG YCSGSILSLL PACLSQITPV NEIGTKYGFL NTILSIGNLV
GLPIASAVIQ NGSTHNYNNF VVFVGLLSVL GTVFWYTSRF MIVGNRLNVK V