Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30939 |
Symbol | CAN1 |
ID | 4837878 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1321137 |
End bp | 1324942 |
Gene Length | 3806 bp |
Protein Length | 552 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389193 |
Product | arginine permease |
Protein accession | XP_001383542 |
Protein GI | 126134035 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0833] Amino acid transporters |
TIGRFAM ID | [TIGR00913] amino acid permease (yeast) [TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0328856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.365315 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAATG ACTACGAGGA CGCCGAAGAT GCCGAAGACG AAGGTGAGGA CGAAGAGGAG GACGAAGATG AAGTCGAAGA TGAAGATTCT GGTAGATTAC ATGTTCCATC CGTCAAACCC AGAAGTAGAT CACCCAGAAT CTCTGCTTTC AACTTATTGA AAAACGACAG CGATGACGAG AACGAAGATT TATCGCCAGA GCAGTTGGAG GAAATCAAGA GAATGAAGTC GGTGTACAAA GTCTACTTTG ATAAGGACAA TGAAAACAAT GGCCAGTTCA AACTGGAAGA TCATCCATTG CCTAATATAG ATTTAAAGGG TCAGCGTATT AACAAACATT TAAATATGGA TACCGACTAC TCTAAAAGAT TGACTACAAC ATCATCCATA TATAATGAAA ACCCCATCTA TCCACACGAA GAGCAGTATT ACTATCAGCC AGAGGAAACC CAGTTTCAAG AACAGTACTA CCAAGATCAA GGGTATCCTG AACCAGAGGC TAGACCATTG CCTCCGTTGC AGAAGTTGCG TAATGCCTCG GATATTCGTC ATTCTACCTT GCAAACATAT ACTGATTTCC AACCTAGATC TAAAAACGCT GCGCAAGCAA ATAGCAAGCT TCCATTTAAC CCCATCGAAA ACGATGGTGT TTGGTCGCCT CCTATGAACT CACCAAGTTT ATCGCAATCG TCATTTTCGT TACAACAACA GCAAGGTGGC CCATCCAGTA ATTATGCCGC TAGTGTAGAC CAAGTTCTTC CATCCAACGC AATTTCACCC AACGGTGGAA TACCGTCTGC CACACAATTG GCCAGATCTT CTGTAGTCAT GTTGAACCCG GTAACAGAAA TCACACACAA GAAGGCGTAC AAACCTGCTG GCTCCTTGCC TTCTAGTGGC TCGAACAGTA ACTTCAACCA GGGCGGTCCT ATGGCATACG AAAATAGTCA GTCGATGCTC AACGGACCTG AAAATGACTT GATACCAGGC AATAGAAAGA GTGATGTTAG ACGTATGATG AACACCAACT TCTGACTCAA GAGTTACGTT ACTATGGGAT TTCTATATAT TTATTCACTT CGAGGGGTCA TTATTAGCTT CATTGCGTAT ACATTCAATA AATATTGTAC GAACTAACAG AACTAGCGTA GGAAAAACAG AAGAAGTTTT AGTACTAAAC AGGCACCCGC ATAACTCAAT TTCGTACCGA ATTAAGGGCG GAATCGATTA TTAATTTTAT TTATTGCAAA TTCGAAAAGG AAAACTTTGC CGAGCAGTAA AACATCAATA TCGTTCAGAA AAAAATCGTA AAAAACAGTA AAAAATATTT AAAAATGAAA TAAAGGAATT AATGATTATA TTAAGGCGGG GGGTTTTCTG CCCCCAAATG CCGTTACAAA CATAATAAGA ATCAAAAAAT TTGCACCGCC GTTTCCTTCA GTTGCAGGTA GCTCATCTCG CTTGTCTTCA ACATTCACAA TAAAATAATT GAAAAAATAA GCAATAAGGA AAGGTTATGG AAAATCGGAG TTCACCTAAA AGCTGTACAT CAAATTTTTT GCACCTATTT GGATTGTCGA GATAAGATCT GTGAATCTGT CCATAGCCAC CTTACCCCAC AAGTCGATAA CGCTGAGAAA GGATATGCAA ATGGTTAAAA CTTAGAGGGG TTTATCACAA TCTAAGCGCT AACAGAAGGT GTGAACGAAG GTTGTGATGG TTTTGGGCTT TGGCAGAATT GGTAAGTTTG CTAACAGCAA CCTTCAACGA AGAGGTCTAA ATTTTTGTTC GTGGCATTTT TCTGTGAGCG GCTTTTTTGG AACCAAATGC CTTATGCGCA CCGCCCAAAT AGGCAACTTA GTCGCAAAAT TGCACAAGGC TTCATGTCTC AGTTGCAGTT GCCTTTCAAC GATACTCTAT GCTATCTGAA GCTCTTGCAT ATCATATACT GCAGCTTCTA AAGTGTAATC ACAGATTGCT TAAGACTTTG CTCGGGCTCT TGCGACAGAT GGCCCTATAT AAAACGCCAA TCCCTCATGA ATTTAATCAG AAGACATAGC TGTAGAATCT TAAGAATTGG CTTATCTTTG TAACATAAAG AACACCAGGT CTCGACTACA ATGACTGCAA CTGAAAAAGA AGTCATACCC ATAATCTCCT TTACCGAAAA CATTTCCACC TCGGAAGGAA AACCCTCTGA CATTCTTGAA TATGCTGGAA GTGAGTCTTC CTTGGAAGAA GTAATTCCTA AGGATGAAAT CAAGAGAAGC TTGAAGGCTA GACAAATTTC CATGATTGCA ATAGGTGGGA GTATTGGTAC TGGTTTATTC ATCTCTACAG GTTCAACTCT TGCTGATGCG GGTCCAGTAT CAGCCCTTAT ATCTTTCTTG TTCATGACAA CGTTAGCATT TTCAGTTACC CAATCTTTGG GAGAAATGGC AACTTTAATC CCCGTTGCTG GATCTTTCAC CCAATTTGTT ACCAGATGGT GCTCCCCTGC TCTAGGTGCT GCCAATGGTT GGAACTATTG GTTCTCGTGG GCCATTACTT TTGCCTTGGA ATTGTCTGTT GTTGGCCAGA TTATTCAGAA GTGGACCGAT GCTGTTCCAC TCGAAGCTTG GATCAGTATC TTCTTCGTTT TGTTGACAGT GTTCAACCTT TTCCCAGTCA AATACTATGG TGAAGTTGAG TTTTGGATTG CTTTTATCAA AGTGATAGCA GTATTTGGTT GGATAATTTA TGCTTTCGTT ATGGTCTGCG GAGGTGGTGA AACGGGTCCA GTTGGTTTCA GATACTGGAG GAATCCCGGC CCTTGGGGAC CTGGTATTTT GGTTGAAAAC AAAAACACAG GCAGATTTTT GGGTTGGTTA GCTTCTTTGA TCAGTGCTGC ATTTACATTC CAAGGTACCG AGTTGGTTGG AATTTCAGCT GGAGAAAGTA GCAACCCTAG GAAAACTGTA CCATCTGCTA TCAGAAAGGT CTTATTCAGA ATTTTGGTAT TCTACGTTTT GTGTATGCTC TTTATTGGAT TATTAGTACC ATACGATGAT CCAAAGCTTT CTGCCAACGG AAGCTACACT GCCTCTTCCC CATTCATTGT TGCTATGCAA AATTCAGGAA CTAAGGTTTT GCCAGATGTC TTCAATGCTG TAGTATTGAT GACAATTGTC TCAGCTGGAA ATTCTAACGT TTACTGCGGA TCACGTATTT TGTATGGTTT AGCACAATCT GGTGTTGCAC CCAAAGTTTT TAAGAAGACC ACAAAGCAAG GTGTACCATA CGTTTCCGTT ATTTTCACTG CCTTGTTTGG TTTATTGGGA TTCTTGGCCA CTTCCTCTAG AGGCAATGAA GCCTTCAATT GGTTGTTGAA CATCATTGCT ACTTCTGGAA TGATTGCCTG GGGTTTTATC AGTTACACAC ACATTAGATT CATGAATGTG TTGAAATCCA GAAACATCAG CCGTGACTCT TTGCCGTTCA AGGCCAAATT TATGCCTTTC AATGCTTACT ATGCTGCAAT TGCTATTTTC ATTTTGGTAT TTGTCCAAGG GTTCCAATGT TTCTTTGATT TCACTGCTAC TGGTTTCTTC ACCGCATATG TTTCGCTTAT TCTTTTTGTT GTGTGTTTCA TAGTGTTCCA CTTCGTTTTC AACGGCTTTG GAAGAAAATC CTTCGAATGG TCATCATTCT TGGTGCCTCT TGATGAATGT GATATTGATA CTGGAGTAAG AGAGATAGAA GAAATGGAAT GGGAAGAGAA CGAGCCAACA AATATATGGC AGAAATTCTG GAGCTGGGTT TCGTAA
|
Protein sequence | MDNDYEDAED AEDEGEDEEE DEDEVEDEDS EVIPKDEIKR SLKARQISMI AIGGSIGTGL FISTGSTLAD AGPVSALISF LFMTTLAFSV TQSLGEMATL IPVAGSFTQF VTRWCSPALG AANGWNYWFS WAITFALELS VVGQIIQKWT DAVPLEAWIS IFFVLLTVFN LFPVKYYGEV EFWIAFIKVI AVFGWIIYAF VMVCGGGETG PVGFRYWRNP GPWGPGILVE NKNTGRFLGW LASLISAAFT FQGTELVGIS AGESSNPRKT VPSAIRKVLF RILVFYVLCM LFIGLLVPYD DPKLSANGSY TASSPFIVAM QNSGTKVLPD VFNAVVLMTI VSAGNSNVYC GSRILYGLAQ SGVAPKVFKK TTKQGVPYVS VIFTALFGLL GFLATSSRGN EAFNWLLNII ATSGMIAWGF ISYTHIRFMN VLKSRNISRD SLPFKAKFMP FNAYYAAIAI FILVFVQGFQ CFFDFTATGF FTAYVSLILF VVCFIVFHFV FNGFGRKSFE WSSFLVPLDE CDIDTGVREI EEMEWEENEP TNIWQKFWSW VS
|
| |