Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_70645 |
Symbol | |
ID | 4836654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1027188 |
End bp | 1029097 |
Gene Length | 1910 bp |
Protein Length | 570 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640387969 |
Product | predicted protein |
Protein accession | XP_001382971 |
Protein GI | 126132892 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0413957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.317652 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTACAATCCT TATTTTATTG TAATGTCCGT GAGTGCTCAA GACGAAAAGG TGGAATCCTT AGACAAGCAA TTCTCCAGTT CCGAGACTGA TCTCCAAGCT GCTACCAACC CTAGCGTTGA GGATGAAGGC AGAACTCCAA CTGAAGATGA AATGAAGACT TTGAGACATG TCTCTGAATC TATTCCTATT TCTTGTTGGT TAGTTGCAAT TGTCGAATTG GCAGAAAGAT TCTCCTACTA TGGTTTATCT GCTCCATTCC AAAACTATAT GCAAAATACC CCTCAAGATT CACCAAAGGG TGTCTTGGGT TTGAACCAAC AAGGTGCTAC AGCCTTATCT TACTTCTTCC AATTTTGGTG TTACGTTACC CCAATTTTGG GTGGTTGGAT TGCTGATACT TACTGGGGAA AGTACAAGAC TATCTTTGTT TTCTGTGTTA TCTACATCGT CGGTATTTTC ATTCTTTTCA TCACTTCTCT TCCTTCAATT ACTAGCCGTA CTACTGCTCT TGGTGGTTAC GTGGCTGCTA TCATCATTAT CGGTCTTGCT ACCGGTGGTG TCAAGTCAAA CGTCTCTCCT TTGATCGCCG ATCAAATTCC AAAAACTCAC CCTGTTATTA AGGTATTGAA GTCTGGTGAA AGAGTCATTC AAGACCCTAA TATTACCATT CAAAATGTTT TCATGTTCTT CTATCTTATG ATTAACATTG GTTCCATGTC TGTCATTGCT ACCACTCAAT TAGAAGCTCA CGTTGGTTTC TGGGCTGCTT ACTTATTGCC ATTTTGTTTC TTCTTTATTG CCCTTCTTGC CCTTGTTCTC GGCAGAAACC AATATGTCAA GGTTCCAGTT GGTGACAAAG TGATCAACAA AACCTTCAAG TGTGCCTGGA TTGGTTTGAG AAACGGTTTC AACATGGACG CTGCTAGACC ATCCATGAAC CCAGAAAAAG AATTCCCATG GAACGACCAT TTCGTTGATG AAGTTGTCAG ATCTGTTTAC GCTTGTAAGG TTTTTGTTTT CTACCCTATC TACTGGGTTG TTTACGGTCA AATGTTGAAC AACTTCGTCT CCCAAGCAGG TCAAATGGAA TTGCACGGTT TGCCTAACGA TATCTTGCAA GCTATCGATT CGCTTGTCAT TATTATCTTC ATTCCTATCT TTGAAAGACT TGTATACCCA TTCATCAGAA AATTCACTCC TTTCAAGGCT ATCACTAAGA TTTTCTGGGG TTTCATGTTT GGTGCTGGTG CCATGGTATA CGCCGCCGTC TTGCAACACT ACATTTACAA GACCGGCCCA TGTTACGACC ATCCAAAGGC TTGTGCTCCA CAATACCTTA ACGTTCCAAA CCGTGTTCAC GTTGCCATTC AAGCTCCAGC TTACTTCTTG ATTGCTATTT CTGAGATTTT GGCTTCTATT ACTGGTTTGG AATATGCCTA CACAAAGGCT CCAGTTTCCA TGAAGTCGTT CATTATGTCC CTTTTCTTGT TGATGAATGC TTTCGGATCT GCTCTTGGTA TTGCTTTGTC ATCCACTTCT GAGGACCCCA AGATGGTCTG GACCTACAGT GGATTGGCAG TTTCTTGTTT CATTGCCGGT ATTGCTTTCT GGTTGTGTTT CAAGCACTAC AACTACAAGG AAGACGAATT GAACGCTTTA GAATACGATG ACGAAGAAGA AAGAAACATT GTGCCTGTTA CTTCATTGTC ACACTCCGTC AAGAGTCTTG CATAAGGAAA TTTGACATTT CTTTCTACTT TACGACTATG CAAATCCGAG AGGACTACGC AATCGAGCGT TTTCATTGTA TTTATAAGGA CAGGCTTTCT TCATTCATAA CATTTCATTC ATAAAACTTC TATAGAATGT CCTCACATCT CTGTATAATA ATTAATAAAA TCAAGCATAA
|
Protein sequence | MSVSAQDEKV ESLDKQFSSS ETDLQAATNP SVEDEGRTPT EDEMKTLRHV SESIPISCWL VAIVELAERF SYYGLSAPFQ NYMQNTPQDS PKGVLGLNQQ GATALSYFFQ FWCYVTPILG GWIADTYWGK YKTIFVFCVI YIVGIFILFI TSLPSITSRT TALGGYVAAI IIIGLATGGV KSNVSPLIAD QIPKTHPVIK VLKSGERVIQ DPNITIQNVF MFFYLMINIG SMSVIATTQL EAHVGFWAAY LLPFCFFFIA LLALVLGRNQ YVKVPVGDKV INKTFKCAWI GLRNGFNMDA ARPSMNPEKE FPWNDHFVDE VVRSVYACKV FVFYPIYWVV YGQMLNNFVS QAGQMELHGL PNDILQAIDS LVIIIFIPIF ERLVYPFIRK FTPFKAITKI FWGFMFGAGA MVYAAVLQHY IYKTGPCYDH PKACAPQYLN VPNRVHVAIQ APAYFLIAIS EILASITGLE YAYTKAPVSM KSFIMSLFLL MNAFGSALGI ALSSTSEDPK MVWTYSGLAV SCFIAGIAFW LCFKHYNYKE DELNALEYDD EEERNIVPVT SLSHSVKSLA
|
| |