Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_63832 |
Symbol | HOL31 |
ID | 4840970 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 750355 |
End bp | 752123 |
Gene Length | 1769 bp |
Protein Length | 570 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640392285 |
Product | putative ion transporter |
Protein accession | XP_001386551 |
Protein GI | 150866826 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0755002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0915689 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCTGA CAATTGATAT AGAAAAGACA ACAACAGCCA AGTCTCTCAA CTACGACTTT GTACCTGGGA CCGTCCACCT AGTGGATGTT ATTGGGAACT TGAGCGTCAA GAAAGATGAT GGTGGCGAAG ATATCATACT TCAGCCCCAA CCCACTTCAA ACATCAATGA TCCCCTTAGA TGGTCCAAGG GGAAGAAGAG GTTTCAATTT TTTCTATTAT GGTTATGGTA TGTCTCGATA CATTCTTTCT TACAACTAAT CAATACTAAC TTTTTGTTTT TAGGAGTTTT CTTCTAGCAG TTTCTCTTAA TTTCTCTGGT CCCTTGTTTG TTATTTGGCT GGTGGAATTG AAAACGACTT TCTTCAAATT GAATGTCCAC ATGGCCCTTG GTTTCTTATT CCTTGGTCTT GGGTGTCTCT TCTTACAACC CACTGCTCTC AAGTTGAGTA GAAGATTTAT TTATTTGTGC TGCACTATCA TAGCAATTGT AGGTAATGCA GTTGGTTCTC AAGCTACCAG TATTAACTTC TTGTATGTGG TGAAAATTTT AGTTGGTTTG GCAGCTGCTC CAGTTGACTC GTTGGTTGAG ATTTCTTCCA CGGATGTTTT CTTCTTACAC GAAAGATCGA GAGCATTCAG TTTGATATTA TTGGCATTGT ATGGTGGAAG TAATTTGGGT CCTGTTGCCT GTGGGTACAT TGTGCAAACT TTGAGCTGGA GATGGTGTTT CTATATCCAA ATCATAATTT TAGGAGTTAT GTTTTTCATT CTTCTCTTTT TGTTGGAAGA CACTTCTTTC AGAAGGGACT TCGACGAAGG TGAGTTAGAG AACAAGATCT TGGAGCAGAT TAAGTCTAAT GATTCAATGG CTCAAAGAGA TAAGCCTCAG ACTGAAAAGG GCCATAAATC AGGTGGAATC ATAACCAACT TGAATGAAGT TGTTGATTCT TCTAGTGATG ATGTCTCTCT AGACAATACC ATTCCTCCTA GAACATACTG GCAAAGAATG CAACTCATTC AGACGCATTA TAACGATACA AGATCATGGA TCACTATTTT CTATAGACCT TTCTTGTTAG CCAGCTTTCC AGCTATTATC TGGGGTAGTG TTCTCTATGG TGCACAAATG ATGTGGTTGT CCTTCTTAGG AGTCACACAA GCACAGATTT ATTCAGCCCC TCCTTACAAC TTCAGTCCAT CTCTGACTGG ATTGACAAGT GTAGGTGCCT TTGTTGGAAA TTTAATTGGT ATGGTTTACG GTGGCAACTT CGTCGATTGG ATGACTTTGA AGATGGCCAA GAGGAACCAT GGTATCTTGG AGCCGGAATT CAGATTGTAC GCTATGATTC TTCCAACCAT CTGCAATGCG GCTGGCCTCT TGGCCTACGG GTTAGGTTCA TACTATGGTG CACACTGGGC TATTTCAGTT GTCATCGGGC AAGTCTTTCT TGGATTTGCC ATGAGTGCAG CCGGATCCAT CTGTTTGACA TATGCTGTTG ATTCATACCA TAATGTTGCA AGTGAAAGTC TTGTGTTGAT GCTTTTCATC AGGAACATGA TTGGTATGGG ATTCACCTTT GCTATTCAGC CTTGGTTGGT GAGTAACGGA TTGAAGACTG TGACATGGTT GATGTTCATG CTCTCAATTG TCATCAATGG TTCTTTCATC TTTATGATTA AGTACGGAAA GAGCATGAGA AGGTGGACAG CCACAAGATA CGAGAAGTAT TCTGATCTTA ACTACGGAGA ACTCTTCCCT AGAAAGTAA
|
Protein sequence | MTSTIDIEKT TTAKSLNYDF VPGTVHLVDV IGNLSVKKDD GGEDIILQPQ PTSNINDPLR WSKGKKRFQF FLLWLWSFLL AVSLNFSGPL FVIWSVELKT TFFKLNVHMA LGFLFLGLGC LFLQPTALKL SRRFIYLCCT IIAIVGNAVG SQATSINFLY VVKILVGLAA APVDSLVEIS STDVFFLHER SRAFSLILLA LYGGSNLGPV ACGYIVQTLS WRWCFYIQII ILGVMFFILL FLLEDTSFRR DFDEGELENK ILEQIKSNDS MAQRDKPQTE KGHKSGGIIT NLNEVVDSSS DDVSLDNTIP PRTYWQRMQL IQTHYNDTRS WITIFYRPFL LASFPAIIWG SVLYGAQMMW LSFLGVTQAQ IYSAPPYNFS PSSTGLTSVG AFVGNLIGMV YGGNFVDWMT LKMAKRNHGI LEPEFRLYAM ILPTICNAAG LLAYGLGSYY GAHWAISVVI GQVFLGFAMS AAGSICLTYA VDSYHNVASE SLVLMLFIRN MIGMGFTFAI QPWLVSNGLK TVTWLMFMLS IVINGSFIFM IKYGKSMRRW TATRYEKYSD LNYGELFPRK
|
| |