Gene PICST_30340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30340 
SymbolMCH4.1 
ID4837033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2558778 
End bp2560142 
Gene Length1365 bp 
Protein Length454 aa 
Translation table12 
GC content42% 
IMG OID640388348 
Productmonocarboxylate permease homolog 
Protein accessionXP_001383244 
Protein GI150864432 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.22847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.486952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTGTA TCGAGGAGAG AATTGAGCTT GAAGAGATTG TTGAAATTGA ACTAGAAACT 
ATCAATTCAG GATCAAGTTC TACATCAATT TTGCCTGACA GTGGGGAAAT AGTATACCCA
GATGGAGGAT GGAGAGCTTA TGGAGTTGTT CTAGGGTCTT TTCTCGGTTT GACTGTGAGT
TTTGGACTCA TTAATTCGGT CGGTGCTATC CAGGCTTATA TTGCAACGCA TCAGCTTGTC
AATGAAGCTA CGTCTACAAT CTCATGGATT TTTTCCATCT ACTTGACAAT TGCTTTCGGT
GTGGGAATCT TGGTGGGTCC AGTATTTGAT ACCAACGGAG CACTTCCACT ATTATTATGT
GGTTTGGTAC TTCAATTTGT AGGTTTAATG GCTACGGCTG TTTGTAAATC TGTGGTTGAG
TTCGTATTTG CTTTCTCTGT TTGCGTTGGT GTTGGGAATG CTTTCTGTAT TACACCATTG
ATCGGTTCGG TAAGTCACTG GTTTTTATGC AAAAGAGGTC AGGCCATTGG ATTGGCTACC
GTTGGGGGCT CTATTGGAGG TGTTGTGATA CCATTGATGC TACATGCACT TTATAGAAAT
GTGGGCTTCG TATGGGCTAT AAGAGTATTG GCATTTTTCT GTCTTGGATG TCAGGCTCTC
TCTGTATTAC TTGTTAAAGA AAGGGTAAGA AGAAAGTTGG CTAATTCAGA CGACAACCAA
AGAAAATTGC AACAAATAGT ACATGCTTGC AACGAGCTTA TTGACTTACT GTCTTTAAAG
GACATGAAGT TCGTCTTTCT CACTCTTGGT GTATTCTTTG AAGAAGTGTC ATTGATGTGT
ACTTCGACAT ATTTGTCCAC CTATGCTATT ACTCAGGGAG CTAGTGAGTC TACGGCCTAC
ATCTTAGTAA CTGTATTCAA TGCTAGTGGG ATTGTTGGAA GAGTTGTTCC AGCGTATGCT
GCTGACTACA TAGGTTACTT CAATGTGAAT GCATTGATGT TGACGGGTAT GGTATTAACG
ATGTTTGTAA TGTGGTTTCC TTTTGGATCT CATATAGGTG TTCTCTACGC TTTTTCCATC
TTGTGTGGCT TCTTCGTCTC CTCTGTATTG AGCATTACCC CAGCATGCCT AGGAGCCATT
ACGCCAGTTC ACAACTTTGG CCAAAGGTAC GGAATGTGTT TCTGTCTAGC ATCGTTAGGA
TACTTGATAG GTATACCTGT TGGTGCAGCA ATAATTGGTG ATGGAAGTCG TCATCGATAC
GACATCTTCG CCTTGTATTG TAGTCTTTTG GCTCTTGCAT CCATGTTATG TTGGATGGTG
AGCAGGTACT ACATCGTGGG GCTGAAGATT AACGTACGCA TTTGA
 
Protein sequence
MSCIEERIEL EEIVEIELET INSGSSSTSI LPDSGEIVYP DGGWRAYGVV LGSFLGLTVS 
FGLINSVGAI QAYIATHQLV NEATSTISWI FSIYLTIAFG VGILVGPVFD TNGALPLLLC
GLVLQFVGLM ATAVCKSVVE FVFAFSVCVG VGNAFCITPL IGSVSHWFLC KRGQAIGLAT
VGGSIGGVVI PLMLHALYRN VGFVWAIRVL AFFCLGCQAL SVLLVKERVR RKLANSDDNQ
RKLQQIVHAC NELIDLSSLK DMKFVFLTLG VFFEEVSLMC TSTYLSTYAI TQGASESTAY
ILVTVFNASG IVGRVVPAYA ADYIGYFNVN ALMLTGMVLT MFVMWFPFGS HIGVLYAFSI
LCGFFVSSVL SITPACLGAI TPVHNFGQRY GMCFCLASLG YLIGIPVGAA IIGDGSRHRY
DIFALYCSLL ALASMLCWMV SRYYIVGSKI NVRI