Gene PICST_33613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33613 
SymbolMCH4.5 
ID4841134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp2573 
End bp3934 
Gene Length1362 bp 
Protein Length453 aa 
Translation table12 
GC content40% 
IMG OID640392449 
ProductMonocarboxylate transporter 
Protein accessionXP_001386604 
Protein GI150866867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTGTA TTGATGAGAT AATTGAGCTT GAAGAGAATG ACGAAATTGC TGTAGATACT 
AATTCTATAT CCAGTGCAAT TTCAAACTTG CCCCAAGATG AGGAATTAGA ATATCCAGAT
GGAGGATGGA GAGCGTATGG AGTTGTTCTA GGATCTTTTC TTGGTTTGAC AGTAAGTTTT
GGACTCATCA ACTCAGTCGG TGCTATTCAG GCCTATATTG CTTTACATCA ACTTGTCGAT
GAAGCTACGT CTACAATCTC TTGGATTTTC TCCATCTACT TGACAATTGT GTTCGGCATA
GGCATATTGG TAGGACCAGT ATTTGATACT AATGGAGCAC TTCCACTTTT GTTGTGTGGA
ATGGTACTTC AATTTATAGG TCTAATGGCT ACAGCGGTCT GTAAATCTGT TGTTGAGTTC
ATATTTGCCT TCTCTGTTTG TGTTGGTGTT GGAAATGCTT TCTGCATACC GCCATTGATT
GGTTCAGTAA GCCACTGGTT TTTAAGCAAA AGAGGACAAG CTATTGGATT GGCTACTGTT
GGAGGTTCAA TTGGAGGAGT TGTGATTCCT TTGATGTTAC ATGTCCTTTA CAGCAATGTG
GGCTTCGTAT GGGCTATCAG AATATTGGCA TTTTTCTGCC TTGGTTGTCA GGCTCTCTCC
CTTATACTAG TTAAGGAAAG AGTCAGAAGA AAGTTGGTCT ATATGGATGA TAATCAGAGA
AAATTTCAAC AGATAGTACA AGCTTGTAAC AATCTTGTGG ATGTGCTGTC TTTGTCGGAT
ATGAAGTTCG CATTTCTCAC AGCTGGTGTA TTCTTCGAAG ATGTGACTTT GATGTGTACT
TCCACATACT TGCCTACCTA TGCTATTGCA CAGGGAGCTA GTGAATCTAC CGCTTACATC
TTAGTGACAG TTTTCAATGC TAGTGGGATT GTTGGAAGGG TACTTCCTGC GTATGTCGCT
GACTTCATTG GCTACTTCAA TGTGAATGTA TTGATGCTAA TGGGTATGGT ATTAACTATG
CTTGTATTAT GGTTTCCTTT TGGATCGCAT ATAGGTATTC TCTATGCCTT CTCCATCTTG
TGTGGGTTCT TTGTTTCTTC TGTGTTAAGC CTTTCAACAG CATGTCTAGG TGCAATTACA
CCCGTTCACA ACTTTGGACA AAGATACGGA ATGTGTTTCT GTCTAGCTTC GTTAGGTTAC
TTGATAGGTA TACCTGTTGG AGCAGCAATA ATCGGCGATG GTAGTACTCA TCGTTACGAC
ATTTTCGCAT TATATTGTAG TATTTCGGCT GTTGCTTCGA TGTTTTGTTG GATGGTGAGC
AGATACTACA TTGTGGGATT TAAACTTAAT GTACGCATAT AG
 
Protein sequence
MSCIDEIIEL EENDEIAVDT NSISSAISNL PQDEELEYPD GGWRAYGVVL GSFLGLTVSF 
GLINSVGAIQ AYIALHQLVD EATSTISWIF SIYLTIVFGI GILVGPVFDT NGALPLLLCG
MVLQFIGLMA TAVCKSVVEF IFAFSVCVGV GNAFCIPPLI GSVSHWFLSK RGQAIGLATV
GGSIGGVVIP LMLHVLYSNV GFVWAIRILA FFCLGCQALS LILVKERVRR KLVYMDDNQR
KFQQIVQACN NLVDVSSLSD MKFAFLTAGV FFEDVTLMCT STYLPTYAIA QGASESTAYI
LVTVFNASGI VGRVLPAYVA DFIGYFNVNV LMLMGMVLTM LVLWFPFGSH IGILYAFSIL
CGFFVSSVLS LSTACLGAIT PVHNFGQRYG MCFCLASLGY LIGIPVGAAI IGDGSTHRYD
IFALYCSISA VASMFCWMVS RYYIVGFKLN VRI