Gene PICST_33429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33429 
SymbolAZR2 
ID4840712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp631297 
End bp633180 
Gene Length1884 bp 
Protein Length627 aa 
Translation table12 
GC content42% 
IMG OID640392027 
ProductMDR transporter 
Protein accessionXP_001386130 
Protein GI150866502 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0921867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.433771 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGC GACAGAGCAT TCCAGTGTTA GCAAGTACAA TTGAATCATC CAGACGGTCC 
CCAACCGATC TACACTCTCA GGAGCTCTCT CCACAAGACT TGAACTCCTG CTCTCCTAAA
ACAACACGCA AATCAGGAAA GTACTACGAA AGTAATGCAG GTGACGGTAC AAAGAATTCA
GAACAGTATA TCAAAGGAAT CAAGTTGGCT GTGACCCTCG TTAGTTGTGT AGCCTGCTTG
TTCCTTGTAG CATTGGACCA GACTATTATA GCCACCATTT TAACACAAGT AGGTGACAAG
TTCCAGGAAT TCGAGAAGGT GGGATGGTTA ACCACAGGAT TCTTTCTACC TATAGCCTGT
TTGTCGCCGT CCTATGGGAA AATAGGCATA GCCTTTGGAA GAAAATATAC TCTCCTAGTA
GGAGTGATTG TCTTTGAAAT TGGCTCGTTG ATCTCAGCAT TAGCCAACTC TATGGGTATG
TTGATTTCAG GAAGGGTCGT TCTGGGGCTT GGAGGAGGTT GTGTACAAGC CATGGTAGTT
TTGATCTTGA CAGAATCTGT ACCCATAAAC CTAAGACCAC TTTCATATAC TTTGCTTGGA
GTTACATTTT CAGTTGCAAG TGTTCTTGGC CCTTTTGTTG GCGGAGCCCT AGCCACACAC
GTTTCGTGGA GATGGTGTTT CTATATCAAT CTCCCCATTG GAGCTATGGC TCTGCTACTT
CTTATGCTCG GATTCCATCC TCCAAAACCT CATGGAAGCA TCAGACAGAA AATGGCCAGA
ATCGACTACT TGGGAACATT CCTACTTACA ACTGGGTTGG TTCTCGTATT GCTTGCCTTG
ACCTTTGGTG GTATAAGCAA TCCCTGGAAC TCGGCCTTGG TGATATCTTT CTTTGTCGTT
GGTGGAGTAC TTTGTATCGT GTTCAACATC TGGAATTTCG GCTTCTCCAC ACATCCCATG
ATCATTAAGG AGGTAATCAT GGTTCCTCAG GCTGCTGCTG CTTGCATCAA TGCCTGTTTT
AATTTTGGAT TCTTTATTCC ACTAATGACA TATTTGGCCG TCTATTTCCA AGTAATCTTT
GGACATTCAG CATGGAAATC GGGTGTTGAT CTCATTCCCA TGATCATATC AGTAACATTT
TCCTCGATAT CTAATGGTGT CTTTATTAGA TTTTCCAGAA ACGTCAAGCT TACCATGGTG
ATCTCAGGAG TATTGGGTCC GGTCGGTACC GGATTGCTTC TTCTATTGAA CAAACATTCT
TCTGCCAAAG ATAGGATTGG CTTGTTGATC GTCTCTGGTG TTTCTATCGG GTTATCTTTT
CAAAGTTCGA TGCTAGCAGC TCAATTGAAA GCACCACCGG ACATTGAAGG CTCATTGATC
TCAGTCACCA TCTTTGTCAA CTTTGTTAAG AACCTAGGAG CAGCGGTCTC GGTGACTATA
GCACAACTTA TTTTCCAAAC AACAGGTCAG AGATATCTTA ATGACCTTAT TCGTAGTCTT
CCACCAGACT CGAGTGAATA CAGAGAATTA ATTCGATACC CTCCCAAACA GGTCATTTCA
AGTCCGGAAA TCATCAAATC GTTTCTGGAG TCTACAAGAC AATTGGTTTT GGACCAGTTC
ATGAAATGTA TTAAGAATGT CTTTTATCTT GCCTTTGCAT TGTCTGTCAT TGCTATGATT
GCTTCGTTTT TTACAACGAA TAAGAGAATC CCCAAGCATT CGGATATAGA AAGAGATGAT
GATGGAGATG ATGAAGAAGC TAATGAGATT CAAAATGAAG ATACTGAAAG CGATCAACTA
AACGAGAACA ATAAGATATC CCCAGATTCT TCGAATGGAG AAAGCTCTTC TGGAAGCGAA
ACCAAGCAGG AATTTACAGC ATAA
 
Protein sequence
MSERQSIPVL ASTIESSRRS PTDLHSQELS PQDLNSCSPK TTRKSGKYYE SNAGDGTKNS 
EQYIKGIKLA VTLVSCVACL FLVALDQTII ATILTQVGDK FQEFEKVGWL TTGFFLPIAC
LSPSYGKIGI AFGRKYTLLV GVIVFEIGSL ISALANSMGM LISGRVVSGL GGGCVQAMVV
LILTESVPIN LRPLSYTLLG VTFSVASVLG PFVGGALATH VSWRWCFYIN LPIGAMASLL
LMLGFHPPKP HGSIRQKMAR IDYLGTFLLT TGLVLVLLAL TFGGISNPWN SALVISFFVV
GGVLCIVFNI WNFGFSTHPM IIKEVIMVPQ AAAACINACF NFGFFIPLMT YLAVYFQVIF
GHSAWKSGVD LIPMIISVTF SSISNGVFIR FSRNVKLTMV ISGVLGPVGT GLLLLLNKHS
SAKDRIGLLI VSGVSIGLSF QSSMLAAQLK APPDIEGSLI SVTIFVNFVK NLGAAVSVTI
AQLIFQTTGQ RYLNDLIRSL PPDSSEYREL IRYPPKQVIS SPEIIKSFSE STRQLVLDQF
MKCIKNVFYL AFALSVIAMI ASFFTTNKRI PKHSDIERDD DGDDEEANEI QNEDTESDQL
NENNKISPDS SNGESSSGSE TKQEFTA