Gene PICST_87108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_87108 
SymbolAUT1 
ID4836720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp598958 
End bp601024 
Gene Length2067 bp 
Protein Length635 aa 
Translation table12 
GC content46% 
IMG OID640388035 
ProductArabinose-proton symporter (Arabinose transporter) 
Protein accessionXP_001382352 
Protein GI126131654 
COG category 
COG ID 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.132654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTACAAGTGA CAGTCAGTCG ATTAGACTTT GCATCCACTT GAGTTTGACA ATTGATATAT 
TCCACTAGAG ACAATGAGTG CTGACGAAAA AGTCGCTGCT GCCGGCCAGG ACGGCTTGTT
TGAACACAAC AGTTCCACTT CGAGCATCGA GGACAAGAAG CCCTCCAAGA GCTCCGATGT
CGATTCCGTG AACTCGCAAT TAGTAGACAA CTCGGTAGAG GGCAACATCT TGTCCCAGTA
CACCGAAAGT CAGGTGATGC AGATGGGTAG AAGCTATGCC ACCAAGCACG GCTTGGACCC
AGAATTGTTC GCCAAGGCAG CTGCTGTTGC CAGAACTCCT CTTGGTTTCA ACTCCATGCC
CTTCTTGACA GAGGAAGAGA AGGTTGGTTT GAATGCCGAA GCCACTAATA AGTGGCACAT
TCCACCCAGA TTGATCGGGG TTATTGCCTT GGGTTCTATG GCCGCTGCTG TGCAGGGTAT
GGACGAATCG GTCATTAACG GTGCCAACTT GTTCTACCCC AAGGCTTTCG GAGTCGACAC
CATGCACAAT TCGGACTTGA TTGAAGGTTT GATCAATGGT GCTCCTTACC TTTGCTGTGG
TATTCTTTCC TGTTGGTTGT CTGACGCTTG TAACCGTCGT CTTGGTAGAA AATGGACCAT
TTTCTGGTGT TGTGTCATTT CTGCCATCAC CTGTGTCTGG CAAGGTCTTG TCAACAACTG
GTACCATTTG TTCATTGCTC GTTTCTTCCT TGGATTTGGT GTTGGTATCA AGTCCGCCAC
TGTTCCTGCC TACTCTGCCG AATGTACTCC TAAACACATC AGAGGTTCGT TAGTCATGTT
GTGGCAATTC TTCACAGCTG TTGGTATTAT GTTTGGTTAT GTTGCTTCCT TGGCTTTCTA
CAATGTCGGA GATAGAGGAA TCCATTACGG GTTGAACTGG AGATTGATGC TTGGTTCGGC
CGCTATTCCT GCTGTCATCA TCTTGTTCCA AATTCCTTTC GCTCCTGAAT CTCCACGTTG
GTTAATGGGT AAGGACAGAC ACCTTGAAGC CTTTGAGTCC TTGAAGCAAT TGAGATACGA
AGAACTTGCT GCTGCTCGTG ACTGTTTCTA CCAGTACGTC TTGTTAGCTG AAGAAGGTTC
TTACAAGATC CCAACCCTCA CCAGATTTAA GGAAATGTTC ACCAAGAGAA GAAACAGAAA
CGGTGCCATC GGTGCATTTA TTGTCATGTT CATGCAACAG TTCTGTGGTA TCAACGTCAT
TGCTTACTAC TCTTCGTCTA TCTTTGTCCA ATCTGGTTTC TCTCAAACTT CTGCTTTGAT
CGCTTCTTGG GGTTTCGGTA TGCTTAACTT CACCTTTGCC ATTCCTGCCT TCTTCACAAT
CGATCGTTTC GGTAGAAGAT CCTTATTGTT GGTTACCTTC CCCTTGATGG CTATTTTCTT
ATTGATTGCC GGTTTCGGTT TCTTGATAAA CGAAGAAACA AACTCCAAGG GAAGATTGGG
AATGATCATC ATCGGTATCT ATATGTTCAC CATCTGTTAC TCTTCCGGTG AAGGTCCAGT
TCCTTTCACC TACTCTGCCG AAGCCTTCCC ATTGTACATC AGAGACTTGG GTATGTCTTT
TGCTACTGCC ACCTGTTGGA CTTTCAACTT CATCTTGGCC TTCACCTGGA ACAGATTGGT
CAATGCATTC ACATCTACTG GTGCCTTCGG CTTCTACGCT GCTTGGAACA TCATTGGTTT
CTTCTTGGTC TTATGGTTCT TGCCAGAAAC CAAGGGCTTG ACCTTGGAAG AATTGGACGA
AGTCTTCGCC GTTTCCGCCG TCCAACACGC CAAGTACCAA ACCAAGAGTT TGATCAACTT
CATCCAAAGA TACGTTTTAC GTTCCAAGGT GGCTCCATTG CCTCCATTGT ACGACCACCA
GAGATTGGCT GTCACCAACC CAGAATGGAA CGACAAGCCA GAAGTCTCTT ATGTCGAGTA
GGCTCCTTGA TAACACATTC ATTTATTTCC TCTTTATAAT TAATAGTTAA CTTAGTTGTT
CAATTCTTCA CATCGCCTAG ATAGTAA
 
Protein sequence
MSADEKVAAA GQDGLFEHNS STSSIEDKKP SKSSDVDSVN SQLVDNSVEG NILSQYTESQ 
VMQMGRSYAT KHGLDPELFA KAAAVARTPL GFNSMPFLTE EEKVGLNAEA TNKWHIPPRL
IGVIALGSMA AAVQGMDESV INGANLFYPK AFGVDTMHNS DLIEGLINGA PYLCCGILSC
WLSDACNRRL GRKWTIFWCC VISAITCVWQ GLVNNWYHLF IARFFLGFGV GIKSATVPAY
SAECTPKHIR GSLVMLWQFF TAVGIMFGYV ASLAFYNVGD RGIHYGLNWR LMLGSAAIPA
VIILFQIPFA PESPRWLMGK DRHLEAFESL KQLRYEELAA ARDCFYQYVL LAEEGSYKIP
TLTRFKEMFT KRRNRNGAIG AFIVMFMQQF CGINVIAYYS SSIFVQSGFS QTSALIASWG
FGMLNFTFAI PAFFTIDRFG RRSLLLVTFP LMAIFLLIAG FGFLINEETN SKGRLGMIII
GIYMFTICYS SGEGPVPFTY SAEAFPLYIR DLGMSFATAT CWTFNFILAF TWNRLVNAFT
STGAFGFYAA WNIIGFFLVL WFLPETKGLT LEELDEVFAV SAVQHAKYQT KSLINFIQRY
VLRSKVAPLP PLYDHQRLAV TNPEWNDKPE VSYVE