Gene PICST_33428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33428 
SymbolAZR1 
ID4840711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp628798 
End bp630627 
Gene Length1830 bp 
Protein Length609 aa 
Translation table12 
GC content44% 
IMG OID640392026 
ProductMDR transporter 
Protein accessionXP_001386129 
Protein GI150866501 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.442314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTAG CTTCTGATGA TAAAGAGTCC ATTTTGCCGA CTTCCAAAGA CTATGGATCT 
ACCCCACTAA GCCAAGATAA TAGCGAAGAC CTGGTGGCTG TGAACGAAAA GGGCGACATA
GATACTTCCA ACAAGAAAAA GTACTACGAA TCCAATGCTG GAGATGGTTC CAGAGCTGAA
GATCAACATC TTACAGGCTT GACTTTGGCA GTGACGTTAG TGTCTTGTGT TTCGTGTTTG
TTTCTAGTTG CATTGGACCA GACCATCGTG TCCACAATCT TAACGCAGGT TGGAGATAAG
TTTAAAGAGT TCGAGAAAAT CGGATGGTTA ACATCAGGTT ACCTTTTGCC TATGGCTACA
TTGGCTCCTT CTTATGGTAA GGTGGGGATT GCCTTTGGTA GAAAATATAC CCTTCTTGTG
GCCGTAATAG TGTTTGAGAT CGGCTCCTTG ATCTCAGGAT TGTCAAACTC CATGAGCATG
TTGATTGGTG GAAGAGTTAT CCAAGGTATT GGTGGAGGTT GTGTCCAAGC AATGGTAGTT
GTCATTTTGA CAGAGTCTGT GCCTATTTCC AGACGTCCTT TATCGTATAT GTTACTTGGG
GTCACCTTCT CTCTTTCCTC TGTATTGGGA CCTTTCATTG GTGGAGCTTT CGCTACACAT
GTTTCGTGGA GATGGTGCTT CTACATTAAC ATTCCAATTG GTGGTATGGC TGCAGCCTTG
TTGATTTTCG GCTTCCATCC TCCAAAACCC GAAGGAAATA TTAGACAGAA ATTGGCCAAG
ATTGACTATT CGGGAACCTT CTTGCTCACT GTGGGATTGG TGTTGGTCTT GTTAGGTCTT
ACCTTTGGTG GTATCGATTT CCCTTGGAAG TCTGCTGCCG TTATTTGTTC TTTTGTAATT
GGAGGTTTGT TCCTCATAGC ATTCTGTGTC TGGAACTTCA AGTTTTCGAA GAATCCCATC
ATCATCCCGC AGATCGTGAG GATTCCTCCC CTTATGGCTG CATGCATCTC TGGATCTTTC
AACTTTGGCT TTTTCTTGGC CAACTTTACG TATTTGGCTG TCTATTTCCA AGTCATATTC
GGTGCCACTG CTTGGAAATC TGGTGTGGAT TTGCTTCCCA TGATTATTGC CGTTACCATG
ACTTCAATCT TGAACGGAGT GTTCATCAAG TTCACCAGAT ACGTCAAAAT TACCATGTTG
ATTTCTGGTG TCTTGGGCCC AGTCGGTACA GGGTTGCTTT TGCTCCTCAG AAAAGATTCA
CCATTGGCCG ACAGAATCGG TTTATTGATT CTCTGTGGTA TCTCGATTGG TTTGCAGTTC
CAAAGTTCCA TGTTGGCAAC CCAATTGTGT GCTCCTCCTG ATATCACCGG TTCTTTGATT
TTGAGTACCA TCTTCCTCAA CTTCCTTAAA TCCACTGCAG CAGCCATTTC TGTTTCTTTG
GCTCAACTTA TTTTCCAAGC AAGTGGTACT TCATACATAA AAAGTTTGGT AAACAGTTTA
CCTCGTGATT CCGCTGAGTA TCAAGCATTG TATGGTATTC CACCAAAGGA CCTTATTTCA
ACCCCTAGAA TCATTAACAC ATTGCCTGAA TCGGCCAGGC AAATGGTATT GGACCAGTTC
ATGAGAGCAT TGCACAATGT CTTCTACTTG GGACTAGCAT TTTCGATCGT GGCTCTCATC
GGTGCTGTCT TCACCACAAA CAAGAAGATT CCAAAGGCTT CAGATATTGC CAAGAACAGC
GATGTAGAAA AGGGCAACAA GGATGCTGCT ACCACCGAAG AGTCGGAAAT CATCAGTCAG
ATCGAATCCA CTGCAAAGGA TCAAGAATAA
 
Protein sequence
MSLASDDKES ILPTSKDYGS TPLSQDNSED SVAVNEKGDI DTSNKKKYYE SNAGDGSRAE 
DQHLTGLTLA VTLVSCVSCL FLVALDQTIV STILTQVGDK FKEFEKIGWL TSGYLLPMAT
LAPSYGKVGI AFGRKYTLLV AVIVFEIGSL ISGLSNSMSM LIGGRVIQGI GGGCVQAMVV
VILTESVPIS RRPLSYMLLG VTFSLSSVLG PFIGGAFATH VSWRWCFYIN IPIGGMAAAL
LIFGFHPPKP EGNIRQKLAK IDYSGTFLLT VGLVLVLLGL TFGGIDFPWK SAAVICSFVI
GGLFLIAFCV WNFKFSKNPI IIPQIVRIPP LMAACISGSF NFGFFLANFT YLAVYFQVIF
GATAWKSGVD LLPMIIAVTM TSILNGVFIK FTRYVKITML ISGVLGPVGT GLLLLLRKDS
PLADRIGLLI LCGISIGLQF QSSMLATQLC APPDITGSLI LSTIFLNFLK STAAAISVSL
AQLIFQASGT SYIKSLVNSL PRDSAEYQAL YGIPPKDLIS TPRIINTLPE SARQMVLDQF
MRALHNVFYL GLAFSIVALI GAVFTTNKKI PKASDIAKNS DVEKGNKDAA TTEESEIISQ
IESTAKDQE