Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33429 |
Symbol | AZR2 |
ID | 4840712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 631297 |
End bp | 633180 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640392027 |
Product | MDR transporter |
Protein accession | XP_001386130 |
Protein GI | 150866502 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0921867 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.433771 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAGC GACAGAGCAT TCCAGTGTTA GCAAGTACAA TTGAATCATC CAGACGGTCC CCAACCGATC TACACTCTCA GGAGCTCTCT CCACAAGACT TGAACTCCTG CTCTCCTAAA ACAACACGCA AATCAGGAAA GTACTACGAA AGTAATGCAG GTGACGGTAC AAAGAATTCA GAACAGTATA TCAAAGGAAT CAAGTTGGCT GTGACCCTCG TTAGTTGTGT AGCCTGCTTG TTCCTTGTAG CATTGGACCA GACTATTATA GCCACCATTT TAACACAAGT AGGTGACAAG TTCCAGGAAT TCGAGAAGGT GGGATGGTTA ACCACAGGAT TCTTTCTACC TATAGCCTGT TTGTCGCCGT CCTATGGGAA AATAGGCATA GCCTTTGGAA GAAAATATAC TCTCCTAGTA GGAGTGATTG TCTTTGAAAT TGGCTCGTTG ATCTCAGCAT TAGCCAACTC TATGGGTATG TTGATTTCAG GAAGGGTCGT TCTGGGGCTT GGAGGAGGTT GTGTACAAGC CATGGTAGTT TTGATCTTGA CAGAATCTGT ACCCATAAAC CTAAGACCAC TTTCATATAC TTTGCTTGGA GTTACATTTT CAGTTGCAAG TGTTCTTGGC CCTTTTGTTG GCGGAGCCCT AGCCACACAC GTTTCGTGGA GATGGTGTTT CTATATCAAT CTCCCCATTG GAGCTATGGC TCTGCTACTT CTTATGCTCG GATTCCATCC TCCAAAACCT CATGGAAGCA TCAGACAGAA AATGGCCAGA ATCGACTACT TGGGAACATT CCTACTTACA ACTGGGTTGG TTCTCGTATT GCTTGCCTTG ACCTTTGGTG GTATAAGCAA TCCCTGGAAC TCGGCCTTGG TGATATCTTT CTTTGTCGTT GGTGGAGTAC TTTGTATCGT GTTCAACATC TGGAATTTCG GCTTCTCCAC ACATCCCATG ATCATTAAGG AGGTAATCAT GGTTCCTCAG GCTGCTGCTG CTTGCATCAA TGCCTGTTTT AATTTTGGAT TCTTTATTCC ACTAATGACA TATTTGGCCG TCTATTTCCA AGTAATCTTT GGACATTCAG CATGGAAATC GGGTGTTGAT CTCATTCCCA TGATCATATC AGTAACATTT TCCTCGATAT CTAATGGTGT CTTTATTAGA TTTTCCAGAA ACGTCAAGCT TACCATGGTG ATCTCAGGAG TATTGGGTCC GGTCGGTACC GGATTGCTTC TTCTATTGAA CAAACATTCT TCTGCCAAAG ATAGGATTGG CTTGTTGATC GTCTCTGGTG TTTCTATCGG GTTATCTTTT CAAAGTTCGA TGCTAGCAGC TCAATTGAAA GCACCACCGG ACATTGAAGG CTCATTGATC TCAGTCACCA TCTTTGTCAA CTTTGTTAAG AACCTAGGAG CAGCGGTCTC GGTGACTATA GCACAACTTA TTTTCCAAAC AACAGGTCAG AGATATCTTA ATGACCTTAT TCGTAGTCTT CCACCAGACT CGAGTGAATA CAGAGAATTA ATTCGATACC CTCCCAAACA GGTCATTTCA AGTCCGGAAA TCATCAAATC GTTTCTGGAG TCTACAAGAC AATTGGTTTT GGACCAGTTC ATGAAATGTA TTAAGAATGT CTTTTATCTT GCCTTTGCAT TGTCTGTCAT TGCTATGATT GCTTCGTTTT TTACAACGAA TAAGAGAATC CCCAAGCATT CGGATATAGA AAGAGATGAT GATGGAGATG ATGAAGAAGC TAATGAGATT CAAAATGAAG ATACTGAAAG CGATCAACTA AACGAGAACA ATAAGATATC CCCAGATTCT TCGAATGGAG AAAGCTCTTC TGGAAGCGAA ACCAAGCAGG AATTTACAGC ATAA
|
Protein sequence | MSERQSIPVL ASTIESSRRS PTDLHSQELS PQDLNSCSPK TTRKSGKYYE SNAGDGTKNS EQYIKGIKLA VTLVSCVACL FLVALDQTII ATILTQVGDK FQEFEKVGWL TTGFFLPIAC LSPSYGKIGI AFGRKYTLLV GVIVFEIGSL ISALANSMGM LISGRVVSGL GGGCVQAMVV LILTESVPIN LRPLSYTLLG VTFSVASVLG PFVGGALATH VSWRWCFYIN LPIGAMASLL LMLGFHPPKP HGSIRQKMAR IDYLGTFLLT TGLVLVLLAL TFGGISNPWN SALVISFFVV GGVLCIVFNI WNFGFSTHPM IIKEVIMVPQ AAAACINACF NFGFFIPLMT YLAVYFQVIF GHSAWKSGVD LIPMIISVTF SSISNGVFIR FSRNVKLTMV ISGVLGPVGT GLLLLLNKHS SAKDRIGLLI VSGVSIGLSF QSSMLAAQLK APPDIEGSLI SVTIFVNFVK NLGAAVSVTI AQLIFQTTGQ RYLNDLIRSL PPDSSEYREL IRYPPKQVIS SPEIIKSFSE STRQLVLDQF MKCIKNVFYL AFALSVIAMI ASFFTTNKRI PKHSDIERDD DGDDEEANEI QNEDTESDQL NENNKISPDS SNGESSSGSE TKQEFTA
|
| |