Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_57811 |
Symbol | |
ID | 4837948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 375362 |
End bp | 376636 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389263 |
Product | predicted protein |
Protein accession | XP_001383700 |
Protein GI | 126134351 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00397212 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0130969 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAATT CCATTTTGGG AGCTGGGATC ATTGGCCAGC CATTTGCTTT CAGAAACGCT GGACTTGTTG GCGGAATCTT GGTAATGATT GGTTTGACTT TTGTCATTGA CTGGACTCTC CGATTGATTG TCATCAACTC CCACTTGTCG CAGACTAGAT CGTACCAAGA CACCGTAAAC TATTGCTATG GAACTTACGG CAAGATCTTA CTTTTGTTTG CTATTAGTTC CTTTGCGTAT GGAGGTTGTA TGGCTTTCTG TGTAATCATC GGTGATACCA TACCTCATGT ATTGAAGGCA TTCATCCCCA AATCGGTCAC TAGTTCTGAC TCCGTTTTTG GATGGTTGTT CAAGCGTAAT TCAATCATTG TATTGTTCAC CACTTGCATT TCATATCCCT TGTCGCTTAA TAGAGACATC TCTAAATTGG CAAAGGCTTC AGGCTTTGCA TTAGTAGGGA TGTTGATCAT CGTAGTTCTC ACTGTAGCCA GAGGGCCATT TGTGGATCCT TCGTTGAGAT CTGATTTGAC AGCCTTGGAA TGGACAGTTA ACTTTAACAT TTTCCAAGGT ATTTCAGTTA TTTCTTTCGC ATTGGTCTGC CACCATAACA CCATTCTCAT CTACCAATCA ATGAAGAATG CTACCCTCTC CAAATTTGCC AAATTGACCC ACATCTCATG TGGTGTTTCT ATGGTCTGTT GTTTGGTTAT GGGAATCAGC GGTTTGTTGA ACTTTGGTGA TGCCACAAAG GGAAACATCT TGAACAACTT CAAGAGCAAC GATAACTGGA TTAACGTAGC CAGGTTTTGC TTTGGATTGA ATATGTTGAC TACCTTTCCG TTAGAGATTT TTGTCGTCAG AGATGTCCTA AAAGACATTG TCCTAGCAAA CAGTTCCGAC GCTCAGAATG GAAGCACAGC CCATCTTGAG TTGAGCTCCA AACAGCATTT TGTTATAACC ACCGTGCTCG TGTTTTCTTC TATGTCTGTT TCATTATTCA CCTGTAATCT CGGTATTATA TTAGAGTTGA TAGGTGCAAC TTCGGCATCA TTGATGGCAT ACATCATCCC TCCATTGTGC TACTTCAAGC TTTCATGGGA TCAAATCGAC TACAAGAACG CAGGAAAAAA AGACAAGAGA GACTTTATCA TATGGAAGGC ATTACCCAGC ATCTCCTGTG TTCTCTTTGG CTTTGCTGTT ATGTTTATTT CGTCATTCAT GAGCATTCGT ACCAGTCTTA AAGACACTGA AGGTGGTCAT TGTGTAGAAG ATTAA
|
Protein sequence | MANSILGAGI IGQPFAFRNA GLVGGILVMI GLTFVIDWTL RLIVINSHLS QTRSYQDTVN YCYGTYGKIL LLFAISSFAY GGCMAFCVII GDTIPHVLKA FIPKSVTSSD SVFGWLFKRN SIIVLFTTCI SYPLSLNRDI SKLAKASGFA LVGMLIIVVL TVARGPFVDP SLRSDLTALE WTVNFNIFQG ISVISFALVC HHNTILIYQS MKNATLSKFA KLTHISCGVS MVCCLVMGIS GLLNFGDATK GNILNNFKSN DNWINVARFC FGLNMLTTFP LEIFVVRDVL KDIVLANSSD AQNGSTAHLE LSSKQHFVIT TVLVFSSMSV SLFTCNLGII LELIGATSAS LMAYIIPPLC YFKLSWDQID YKNAGKKDKR DFIIWKALPS ISCVLFGFAV MFISSFMSIR TSLKDTEGGH CVED
|
| |