Gene PICST_80454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80454 
SymbolITR2 
ID4851257 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1328326 
End bp1330186 
Gene Length1861 bp 
Protein Length522 aa 
Translation table 
GC content43% 
IMG OID640392965 
Productmyo-inositol transporter 
Protein accessionXP_001387902 
Protein GI126274243 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.806803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.231524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAATC CTAAATTAAA AAACAACTCC AGTTCCCAGT TGTTGAACTC AGAGGTTAAC 
CTCTCCACCT CTTCAGTCTC GTCCAGCTCC TCTGCCAACG CGTTGTTGAA TACAAACAAA
GTCAGACAAT ATGAATCTTA TTCAAACGGT TCTCCTGCTA TAAACAAAAC TAATACAAGT
GATTTAGACC TCGATCTCCA GTCTTATGGC GGTTCTGAAT CTGAGTCTAG ACCTTCCAGA
CTCGTCATCA TCTTGACGCT TGCCTCTTCT ATATCTGGTT TCATGTTTGG TTACGATACC
GGTTATATCT CTTCGGCGTT AGTGCAAATA GGTACTGATT TGAGTGACAA AATTCTTACC
AACGGTGAAA AAGAGTTCAT CACTTCAGCA ACATCGTTGG GTGCTTTGAT TGGTGCTGTT
ATATCGGGTA TCTTGGTCAA CTTGATTGGT CGTAAGACGG TATTGTTAGG TTCTAATGTT
GTATTTGTAA TAGGTACTAT CATCCAGTTG GCTTCGAAAA CGGTATGGAC TATGATCGCA
GGAAGATTCG TGTTGGGTCT TGGCGTTGGT ATTGCCTCTT TGATCGCTCC CTTAATGTTG
AGTGAATTGG CTCCGGCCAA GTACAGAGGT AGATTAATTG TCACTAACGT CATGTTCATC
ACAGGTGGTC AACTTGTAGC ATACTTCATC AACTGGGGCC TCACACGGGT CAGCCATGGC
TGGAGAATTT CTGTTGGCTT GTGCATGGTG CCTCCTGTAG TTCAGTTCGT ATTGTTTTGG
TTCTTGCCTG ATACTCCTAG ATACTATGTT ATCAAAGGTG ACATAGAAAC AGCTAAAGAA
GTCGTCAGAA GAACCCATAA TCATCCCTCA GAAGAATTTG TCAACGCTAC TATTGAAGAA
ATGATCGCTT CAAACTCAAC CGTGTCTGGT TCTAGCCAGT TGAGACGTGT ATGGAACTCC
ATCAAATTGA TCCATACCAA CCCTGCTAAT TTCAGAGCCT TGATCTTAGC CACTGGCTTG
CAGGGTATTC AGCAGTTCAC TGGTTTCAAC TCATTGATGT ACTTCTCTGC TACCATTTTC
GAAACCATCG GCTTCAAGAA TGCAACTGCT GTGTCTATCA TCGTTGCTGC CACCAACTTC
GTTTTCACCG CTATTGCCTT GTGTATTGTC GATAAGGTAG GTAGAAGAAG AATCTTGCTT
TGGGCGATTC CCTGTATGGC TGGCTCCTTG GTAATCTGTG CTATTGCTTT CCACTTCTTG
GGTGTAGTTT TCACTTCGGG CTCCAACGTC GAAGTCCGTA GTTCAGGAAT TTCCGGTTGG
GGTATTGTAG TCATTATTGG CATGGTCTTG TATGTCGCTT CCTACGCCAT TGGTATTGGA
AACAGTGCCT GGATCGGTGT GGAATTGTTC TCTGATGTGA ACGTCAGATC TGTCGGAGCC
ATGTATGCTG CTGCTACTAA CTGGGCTGGG TCATTGGTCA TTGCCTCTAC TTTCTTGACT
ATGTTGGAAA ACATTACCCC AACTGGTACC TTCTCATTCT TTGCTGGCTT GTGTGCTGTT
TCGTTCTTGT TTGTCTATTT ATTGTTGCCC GAAGTTGCCG GATTGGAATT GGAAGAGACA
ACTGCCTTCT TAGCCGACGG ATTCAATGTC AAACAAGCTT CCAAGTTGTC CAAAGAAAGA
AAGAAGCACT CAAAATTCAT CAACCAGCAT TCTGCTTAAG AAAGCGGACC CATCAAAAAC
AAAAGGTTGC ATCCACTTTT CAATTCTTGT ATGATTGAAT CTCAAGGAGA ATTCGGTTTC
GGTTTTGCTT GTATGTATCC AATCTTTAAT ATTTATATGC CTTTATTTAC ATTTAATCGT
T
 
Protein sequence
MSNPKLKNNS SSQLLNSEVN LSTSSSRPSR LVIILTLASS ISGFMFGYDT GYISSALVQI 
GTDLSDKILT NGEKEFITSA TSLGALIGAV ISGILVNLIG RKTVLLGSNV VFVIGTIIQL
ASKTVWTMIA GRFVLGLGVG IASLIAPLML SELAPAKYRG RLIVTNVMFI TGGQLVAYFI
NWGLTRVSHG WRISVGLCMV PPVVQFVLFW FLPDTPRYYV IKGDIETAKE VVRRTHNHPS
EEFVNATIEE MIASNSTVSG SSQLRRVWNS IKLIHTNPAN FRALILATGL QGIQQFTGFN
SLMYFSATIF ETIGFKNATA VSIIVAATNF VFTAIALCIV DKVGRRRILL WAIPCMAGSL
VICAIAFHFL GVVFTSGSNV EVRSSGISGW GIVVIIGMVL YVASYAIGIG NSAWIGVELF
SDVNVRSVGA MYAAATNWAG SLVIASTFLT MLENITPTGT FSFFAGLCAV SFLFVYLLLP
EVAGLELEET TAFLADGFNV KQASKLSKER KKHSKFINQH SA