Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_44978 |
Symbol | |
ID | 4838480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1382031 |
End bp | 1383629 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389795 |
Product | predicted protein |
Protein accession | XP_001384222 |
Protein GI | 150865131 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.800208 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCA ACAAATTCAC CATCCAAGAG CAGCTAAGTG GTTTCCCTGT ATTCCAGATG GTTATCATAG GATTTCTCCG ACTAAGCGAG CCCATCGCCT TTACTTCGAT GTTCCCCTAC ATCTACTTCA TGATCAAGCA TTTTGATATC GCAGAGGACG ATGCCCAAAT TTCTACTTAC AGTGGATATT TAGCAGCTTC CTTCTCATTC AGCCAGTTTC TAAGCACAGT ACACTGGGCA AAGGCTTCAA ACAAATATGG GAGGAAAACT ATACTATTGT GTGGTTGTGC TGGAACAGCA TTTTCCATGA TAATCTTTGG TTTGAGCAAA AACTTTTACA TGGCTCTATT TGCTCGACTG TTGATGGGAT TGCTCAATGG TAACGTCTCT ATCATGAGAA CAACAGTGGG AGAAATCGCT CACGAAAATA GACATCAAGG TCTTGCTTTC AGTAACCTTT CCCTCCTTTG GAGTTTTGGT AAATGTATAG GTGGATGGTT GGGAGGAGTG CTTACGAGTA CAAAAGTATC AAAGTCTTCT ACTACAAAGC TACTGAGAGC AGATGAAGGG TTGTTTTCGC GTTACCCATT CCTCCTTTCA AACGTGGTGG TTGCAGTATT GATATTGATA TTCATCGTTA TAGGCTGGCT TTTCTTGGAA GAAACTCATG AGGAGAAAAA GTATTCCAGA GATATTGGAC TAGAAGTAGG AGATGCGCTT AGACGTCTGT TGGGATTCCA AGTACCCGAA AGACCCTGGA AACTGAGAGG ACAATACTTT GAAGTGGGAC AACCCCTATT GGATGAAGAG ATGGAAGACA ACTCGGTCAT AGAAATGCAC AACTATCCCC ACAAGGGTAA ACTGAGTCGA GTAGACATCT CAGATGCTGA CGCCTCACAA AGTGAAACTG ATACAGAAGC AGAACATGAG CTGATAATAC CATTGGCCGT AAGGAAGTGC ATAATAAGCA ATTTTATGTG TTCCTTCCAG AATTTGATCT ACGTCGAGTT TTATCCAGTC TTACTAGCTA AAGCACTTCG AGTAGAAGAT TTAAAATTCC CGTTTCATAT TAAAGGAGGG TACGGCTTCA GTGCTGCAGA GATCGGAAAA CTTTTATCTA TAACAGGGTT AATTGGGGTA GTTCTTGTTT CATTACTTTT TCCTGTAATT ACCAAATATT GCAGAACTGA TCTTGGGTTC AGGATAGGAT TGTCCATAAA TCCCATTATC TACTTCTTTT TACCGTTATA TGTGTTCACA CTGCACAAGT ACAATGAGGC AATGCCCAAA TATGTCACCG GGTTATTACT ATACTTGAAT AGTAGCGTAG TTTCCTTCGC TAACGGGATT ACCTTCGCCC AAAATCTAAT TTTGATTCAC AGGGCATCTC CAAAGAAGCA AAGAGCGTTA ATAAACAGCT ATGCCATGAC AGTCACCGCC TTAGCTCGGT GTGCTGCCCC AATAATTTGG GGGTGGATTA TTTCTAAGTT TGATGCGCAG GGTTATGGTG GGATGTCGTG GTGGGTTCTC TCAGTGTGGT CAATAATGAC ATTTTCACAT TCGCTTTTCA TCCATGAGAC TGATGCGGAG GAGACCTAA
|
Protein sequence | MTTNKFTIQE QLSGFPVFQM VIIGFLRLSE PIAFTSMFPY IYFMIKHFDI AEDDAQISTY SGYLAASFSF SQFLSTVHWA KASNKYGRKT ILLCGCAGTA FSMIIFGLSK NFYMALFARS LMGLLNGNVS IMRTTVGEIA HENRHQGLAF SNLSLLWSFG KCIGGWLGGV LTSTKVSKSS TTKLSRADEG LFSRYPFLLS NVVVAVLILI FIVIGWLFLE ETHEEKKYSR DIGLEVGDAL RRSLGFQVPE RPWKSRGQYF EVGQPLLDEE MEDNSVIEMH NYPHKGKSSR VDISDADASQ SETDTEAEHE SIIPLAVRKC IISNFMCSFQ NLIYVEFYPV LLAKALRVED LKFPFHIKGG YGFSAAEIGK LLSITGLIGV VLVSLLFPVI TKYCRTDLGF RIGLSINPII YFFLPLYVFT SHKYNEAMPK YVTGLLLYLN SSVVSFANGI TFAQNLILIH RASPKKQRAL INSYAMTVTA LARCAAPIIW GWIISKFDAQ GYGGMSWWVL SVWSIMTFSH SLFIHETDAE ET
|
| |