Gene PICST_30939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30939 
SymbolCAN1 
ID4837878 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1321137 
End bp1324942 
Gene Length3806 bp 
Protein Length552 aa 
Translation table12 
GC content41% 
IMG OID640389193 
Productarginine permease 
Protein accessionXP_001383542 
Protein GI126134035 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0833] Amino acid transporters 
TIGRFAM ID[TIGR00913] amino acid permease (yeast)
[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0328856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.365315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAATG ACTACGAGGA CGCCGAAGAT GCCGAAGACG AAGGTGAGGA CGAAGAGGAG 
GACGAAGATG AAGTCGAAGA TGAAGATTCT GGTAGATTAC ATGTTCCATC CGTCAAACCC
AGAAGTAGAT CACCCAGAAT CTCTGCTTTC AACTTATTGA AAAACGACAG CGATGACGAG
AACGAAGATT TATCGCCAGA GCAGTTGGAG GAAATCAAGA GAATGAAGTC GGTGTACAAA
GTCTACTTTG ATAAGGACAA TGAAAACAAT GGCCAGTTCA AACTGGAAGA TCATCCATTG
CCTAATATAG ATTTAAAGGG TCAGCGTATT AACAAACATT TAAATATGGA TACCGACTAC
TCTAAAAGAT TGACTACAAC ATCATCCATA TATAATGAAA ACCCCATCTA TCCACACGAA
GAGCAGTATT ACTATCAGCC AGAGGAAACC CAGTTTCAAG AACAGTACTA CCAAGATCAA
GGGTATCCTG AACCAGAGGC TAGACCATTG CCTCCGTTGC AGAAGTTGCG TAATGCCTCG
GATATTCGTC ATTCTACCTT GCAAACATAT ACTGATTTCC AACCTAGATC TAAAAACGCT
GCGCAAGCAA ATAGCAAGCT TCCATTTAAC CCCATCGAAA ACGATGGTGT TTGGTCGCCT
CCTATGAACT CACCAAGTTT ATCGCAATCG TCATTTTCGT TACAACAACA GCAAGGTGGC
CCATCCAGTA ATTATGCCGC TAGTGTAGAC CAAGTTCTTC CATCCAACGC AATTTCACCC
AACGGTGGAA TACCGTCTGC CACACAATTG GCCAGATCTT CTGTAGTCAT GTTGAACCCG
GTAACAGAAA TCACACACAA GAAGGCGTAC AAACCTGCTG GCTCCTTGCC TTCTAGTGGC
TCGAACAGTA ACTTCAACCA GGGCGGTCCT ATGGCATACG AAAATAGTCA GTCGATGCTC
AACGGACCTG AAAATGACTT GATACCAGGC AATAGAAAGA GTGATGTTAG ACGTATGATG
AACACCAACT TCTGACTCAA GAGTTACGTT ACTATGGGAT TTCTATATAT TTATTCACTT
CGAGGGGTCA TTATTAGCTT CATTGCGTAT ACATTCAATA AATATTGTAC GAACTAACAG
AACTAGCGTA GGAAAAACAG AAGAAGTTTT AGTACTAAAC AGGCACCCGC ATAACTCAAT
TTCGTACCGA ATTAAGGGCG GAATCGATTA TTAATTTTAT TTATTGCAAA TTCGAAAAGG
AAAACTTTGC CGAGCAGTAA AACATCAATA TCGTTCAGAA AAAAATCGTA AAAAACAGTA
AAAAATATTT AAAAATGAAA TAAAGGAATT AATGATTATA TTAAGGCGGG GGGTTTTCTG
CCCCCAAATG CCGTTACAAA CATAATAAGA ATCAAAAAAT TTGCACCGCC GTTTCCTTCA
GTTGCAGGTA GCTCATCTCG CTTGTCTTCA ACATTCACAA TAAAATAATT GAAAAAATAA
GCAATAAGGA AAGGTTATGG AAAATCGGAG TTCACCTAAA AGCTGTACAT CAAATTTTTT
GCACCTATTT GGATTGTCGA GATAAGATCT GTGAATCTGT CCATAGCCAC CTTACCCCAC
AAGTCGATAA CGCTGAGAAA GGATATGCAA ATGGTTAAAA CTTAGAGGGG TTTATCACAA
TCTAAGCGCT AACAGAAGGT GTGAACGAAG GTTGTGATGG TTTTGGGCTT TGGCAGAATT
GGTAAGTTTG CTAACAGCAA CCTTCAACGA AGAGGTCTAA ATTTTTGTTC GTGGCATTTT
TCTGTGAGCG GCTTTTTTGG AACCAAATGC CTTATGCGCA CCGCCCAAAT AGGCAACTTA
GTCGCAAAAT TGCACAAGGC TTCATGTCTC AGTTGCAGTT GCCTTTCAAC GATACTCTAT
GCTATCTGAA GCTCTTGCAT ATCATATACT GCAGCTTCTA AAGTGTAATC ACAGATTGCT
TAAGACTTTG CTCGGGCTCT TGCGACAGAT GGCCCTATAT AAAACGCCAA TCCCTCATGA
ATTTAATCAG AAGACATAGC TGTAGAATCT TAAGAATTGG CTTATCTTTG TAACATAAAG
AACACCAGGT CTCGACTACA ATGACTGCAA CTGAAAAAGA AGTCATACCC ATAATCTCCT
TTACCGAAAA CATTTCCACC TCGGAAGGAA AACCCTCTGA CATTCTTGAA TATGCTGGAA
GTGAGTCTTC CTTGGAAGAA GTAATTCCTA AGGATGAAAT CAAGAGAAGC TTGAAGGCTA
GACAAATTTC CATGATTGCA ATAGGTGGGA GTATTGGTAC TGGTTTATTC ATCTCTACAG
GTTCAACTCT TGCTGATGCG GGTCCAGTAT CAGCCCTTAT ATCTTTCTTG TTCATGACAA
CGTTAGCATT TTCAGTTACC CAATCTTTGG GAGAAATGGC AACTTTAATC CCCGTTGCTG
GATCTTTCAC CCAATTTGTT ACCAGATGGT GCTCCCCTGC TCTAGGTGCT GCCAATGGTT
GGAACTATTG GTTCTCGTGG GCCATTACTT TTGCCTTGGA ATTGTCTGTT GTTGGCCAGA
TTATTCAGAA GTGGACCGAT GCTGTTCCAC TCGAAGCTTG GATCAGTATC TTCTTCGTTT
TGTTGACAGT GTTCAACCTT TTCCCAGTCA AATACTATGG TGAAGTTGAG TTTTGGATTG
CTTTTATCAA AGTGATAGCA GTATTTGGTT GGATAATTTA TGCTTTCGTT ATGGTCTGCG
GAGGTGGTGA AACGGGTCCA GTTGGTTTCA GATACTGGAG GAATCCCGGC CCTTGGGGAC
CTGGTATTTT GGTTGAAAAC AAAAACACAG GCAGATTTTT GGGTTGGTTA GCTTCTTTGA
TCAGTGCTGC ATTTACATTC CAAGGTACCG AGTTGGTTGG AATTTCAGCT GGAGAAAGTA
GCAACCCTAG GAAAACTGTA CCATCTGCTA TCAGAAAGGT CTTATTCAGA ATTTTGGTAT
TCTACGTTTT GTGTATGCTC TTTATTGGAT TATTAGTACC ATACGATGAT CCAAAGCTTT
CTGCCAACGG AAGCTACACT GCCTCTTCCC CATTCATTGT TGCTATGCAA AATTCAGGAA
CTAAGGTTTT GCCAGATGTC TTCAATGCTG TAGTATTGAT GACAATTGTC TCAGCTGGAA
ATTCTAACGT TTACTGCGGA TCACGTATTT TGTATGGTTT AGCACAATCT GGTGTTGCAC
CCAAAGTTTT TAAGAAGACC ACAAAGCAAG GTGTACCATA CGTTTCCGTT ATTTTCACTG
CCTTGTTTGG TTTATTGGGA TTCTTGGCCA CTTCCTCTAG AGGCAATGAA GCCTTCAATT
GGTTGTTGAA CATCATTGCT ACTTCTGGAA TGATTGCCTG GGGTTTTATC AGTTACACAC
ACATTAGATT CATGAATGTG TTGAAATCCA GAAACATCAG CCGTGACTCT TTGCCGTTCA
AGGCCAAATT TATGCCTTTC AATGCTTACT ATGCTGCAAT TGCTATTTTC ATTTTGGTAT
TTGTCCAAGG GTTCCAATGT TTCTTTGATT TCACTGCTAC TGGTTTCTTC ACCGCATATG
TTTCGCTTAT TCTTTTTGTT GTGTGTTTCA TAGTGTTCCA CTTCGTTTTC AACGGCTTTG
GAAGAAAATC CTTCGAATGG TCATCATTCT TGGTGCCTCT TGATGAATGT GATATTGATA
CTGGAGTAAG AGAGATAGAA GAAATGGAAT GGGAAGAGAA CGAGCCAACA AATATATGGC
AGAAATTCTG GAGCTGGGTT TCGTAA
 
Protein sequence
MDNDYEDAED AEDEGEDEEE DEDEVEDEDS EVIPKDEIKR SLKARQISMI AIGGSIGTGL 
FISTGSTLAD AGPVSALISF LFMTTLAFSV TQSLGEMATL IPVAGSFTQF VTRWCSPALG
AANGWNYWFS WAITFALELS VVGQIIQKWT DAVPLEAWIS IFFVLLTVFN LFPVKYYGEV
EFWIAFIKVI AVFGWIIYAF VMVCGGGETG PVGFRYWRNP GPWGPGILVE NKNTGRFLGW
LASLISAAFT FQGTELVGIS AGESSNPRKT VPSAIRKVLF RILVFYVLCM LFIGLLVPYD
DPKLSANGSY TASSPFIVAM QNSGTKVLPD VFNAVVLMTI VSAGNSNVYC GSRILYGLAQ
SGVAPKVFKK TTKQGVPYVS VIFTALFGLL GFLATSSRGN EAFNWLLNII ATSGMIAWGF
ISYTHIRFMN VLKSRNISRD SLPFKAKFMP FNAYYAAIAI FILVFVQGFQ CFFDFTATGF
FTAYVSLILF VVCFIVFHFV FNGFGRKSFE WSSFLVPLDE CDIDTGVREI EEMEWEENEP
TNIWQKFWSW VS