Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66905 |
Symbol | |
ID | 4837310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1024510 |
End bp | 1026351 |
Gene Length | 1842 bp |
Protein Length | 577 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388625 |
Product | predicted protein |
Protein accession | XP_001382970 |
Protein GI | 126132890 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.425094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.705777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAACT TATCGGACGA GAAACTCCCC ACAGATTCTG AATTGCAAAA ACAGGATAGT GACATCCTCC GCGACCCCAG TGTGATCTCC AATGACATTG ATGATGAGGG TAGAGAATTG CCATCTGAAG AGGAAATGAA GACCTTGAGA CATGTCTCTG GCAACATCCC CTTAAGATGT TGGTTAGTTG CAATTGTCGA ATTGGCAGAA AGATTCTCCT ACTATGGTTT ATCTGCTCCA TTCCAAAACT ATATGCAAAA CACTCCAGAA GATTCACCAA AGGGTATCTT GGGTTTGAAT CAGCAAGGTG CTACAGCATT ATCATACTTC TTCCAATTTT GGTGTTACGT TACCCCAATC TTTGGTGGTT GGTTGGCTGA TACTTACTTG GGAAAATTCA ATACCATCTT TGTTTTCTGT ATTGTCTACA TCATTGGTAT CTTCATTTTG TTCATTACAT CCATTCCTGC CATCACCTCT AAGACGACTG CTACTGGTGG TTTTATTGCT GCTATTATCA TAATTGGTTT TGCAACCGGT GGTGTCAAGT CTAACGTTTC CCCATTAATT GCCGATCAAG TTCCAAAGGT AAAACCACAC ATCAAGGTTT TGAAATCTGG AGAAAGAGTC ATTGTCGACC CTCACATCAC TATCCAGAAT GTTTTCATGT TCTTCTACCT TATGATTAAT GTTGGCTCTT TGTCAGTCAT CGCTACCACT CAATTGGAAC ATCACGTTGG ATTCTGGGCT GCCTACTTGT TGCCATTTTG TTTCTTCTTC ATCGCTCTTG CTGCTCTTGC CTTGGGAAGA AACCAATACA TTAAGACCCC TGTCAGTGAC AAGATCGTCA ACAAGACCTT CAAGTGTGCC TGGATTGGTT TGAGAAACGG TTTTAACTTG GAAGCTGCCA AGCCATCCAA CAACCCAGAG AAGAATTACC CATGGAGTGA CAAGTTTGTT GAAGAAGTCA GAAGAGCCAT TTACGGTTGT AAGGTGTTTG TCTTTTACCC TATCTACTGG GTCACCTATG GACAAATGAC TAACAATTTC ATTTCTCAAG CTGGTCAAAT GGAATTGCAT GGCTTGCCAA ACGATATTTT GCAGGCAATT AACTCGATGT CGATTATTGT ATTTATCCCT ATTTGTGAAA GATTTGTTTA CCCATTCATC AGAAGATTCA CTCCTTTCAA GGCTATCACA AAGATCTTCT TTGGTTTCAT GTTCGCTACA GGTGCTATGG TCTATGCCGC CGTCTTGCAA CATTACATCT ACCAGGCTGG TCCATGTTAC AACTTTCCAA AAGCTTGTGC ACCTGAGTTC AAGACTGTTC CAAACCACAT TCACGTTGCC ATTCAAGCTC CTGCTTACTT CTTGATTGCC ATGTCAGAAA TTTTTGCCTC CGTTACTGGT TTGGAATATG CCTACACAAA GGCTCCAGTT TCCATGAAGT CGTTTATCAC TTCTCTCTTT TTGGTTACAA ACGCTTTCGG ATCTGCTCTT GGTATTGCTT TGTCATCCAC TTCTGAAGAT CCAAAGATGG TCTGGACCTA CACTGGTTTG GCAACTGCCT GTTTCATTGC TGGGTGGATC TTTTGGTTCT GCTTCAAGCA CTACAACTAC AAGGAAGATG AATTCAACAG GTTGGAATAC GCAACAGAAG AAGAATACAA AAAGCCTACC CTCGATGGTC TTCAGCCAAT TCCTTCTGCT AATTCATACA AGGGACTTGC TTAGTACTTC CCGCATGCGT CGTACTTATT TATACACATA TATACTATAT TCGTAACACC GACTTGTGTT TAATAGTCAG AGATCTTAAT GCATTCATGA TTGAAATTTT AG
|
Protein sequence | MSNLSDEKLP TDSELQKQDS DILRDPSVIS NDIDDEGREL PSEEEMKTLR HVSGNIPLRC WLVAIVELAE RFSYYGLSAP FQNYMQNTPE DSPKGILGLN QQGATALSYF FQFWCYVTPI FGGWLADTYL GKFNTIFVFC IVYIIGIFIL FITSIPAITS KTTATGGFIA AIIIIGFATG GVKSNVSPLI ADQVPKVKPH IKVLKSGERV IVDPHITIQN VFMFFYLMIN VGSLSVIATT QLEHHVGFWA AYLLPFCFFF IALAALALGR NQYIKTPVSD KIVNKTFKCA WIGLRNGFNL EAAKPSNNPE KNYPWSDKFV EEVRRAIYGC KVFVFYPIYW VTYGQMTNNF ISQAGQMELH GLPNDILQAI NSMSIIVFIP ICERFVYPFI RRFTPFKAIT KIFFGFMFAT GAMVYAAVLQ HYIYQAGPCY NFPKACAPEF KTVPNHIHVA IQAPAYFLIA MSEIFASVTG LEYAYTKAPV SMKSFITSLF LVTNAFGSAL GIALSSTSED PKMVWTYTGL ATACFIAGWI FWFCFKHYNY KEDEFNRLEY ATEEEYKKPT LDGLQPIPSA NSYKGLA
|
| |