Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_34848 |
Symbol | |
ID | 4836761 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 435200 |
End bp | 437215 |
Gene Length | 2016 bp |
Protein Length | 652 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388076 |
Product | predicted protein |
Protein accession | XP_001382847 |
Protein GI | 150864139 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00044083 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGCA GCTCGTTGAG ACATAGCATC GACAAGTTCC ACGAGTATGT CAAACCCGAG TATGTCAAGA AGGTGAATGA GTCACTATCA GTTTCTCTCA CGTTTATTGA GGATTCGCAG CCCGATCAGA CTATCCAGGA GTCAACAACA TGTGAATGTT TAATCAGTTT CCACAACGTT GATGAAAGAG GCATTACTGT CAATGAAACT GATGAGTCAT CATTGCGTCA GACTGAATCA AACGACTCTG GCTGGTTTGG AGGCTTTTTT GGCACAAACT CAAATAGTTC CGACTCACGA TTGTTAGTTA ACAGGACGTC TAGTGAGAAT GAGGACTTGA CTATATTTCT TGGATATATC CAGGTGGTTG GCTATGTAGT GCTTAATTAC AAGTTTGGGC TTAATAGTTC AACTTCGGCC ATTGATTCTG GCAAAAGTGC CCACTGGTGG GAGAATAAGG ATTACATCTA CAACTATAAA GATGCCGATG GAGAAGACGG CCAGTATGAA GATGAAAAAG CAGAGAAATT GGACCAAGTA CCCTATATAA AAGAAAATTT GGCTCATCCA CTAGTTGTTG GTGGAAAATT GGGTGGGGTC AGTGATTTAA TTGTAGAGCA TGGAAAGTCT TCTAATCACT GGATAGACGA AAAGGCTTTG TACTACGTCC ACGATCTTTT GTACCCTTTC AATTCTCGGG GACCTCCCAA TTTCAATCAA AATGCAGATC TTGACGGTAC TGCTGAGATC AAGATACCTG TCAAGGAGTT GTCAGACTCC ATCATACCAT TCTATTCTAC TTCGCAATCA CTTCTTTTCA CAGACTTACA TATCCCACCA AAGTCGACAA AGACATTCCA TATTAAGTTC CCCAGATCGA CCGACTTACC TCCCACATAC AATGCCAGGC TGACCGGACC AGTTTGTGAC CAGGGATTGG TCAGCATCAA GTACTCTCTA ATTGTAAGTC TTCTGCAAAG TTCTGGATCC AGTATGGATT CGCGTTCCAT TTACTTTCCG CTTAGCATCA AGGGCGAGAG AATAGGTTCG AACGAGAGAT ACATGCAACG GGACTACTTC GAATCGAAAA GTAAGATAGA TAAGAATTGG CAAGTAGAGG TCATAGAGGA AGAAGAAGCC ATAGAACCAC CCGTAGTGGG TAATGTTTCA GAGTCTCTTG AAACCAGAGA GGCATTCTTC GAAGACATTT CAAAATTGAT CAAATCGGAT TTGTACAATA TGCCAAAAAT GTCCACGAAC GAGAGAAAGA AGAGTATCCA TTCGTTAGAG TCCTGGGTGG ATGATGTACC TGTAGATGGG AAATATGTTC CACAGCTTCC GAGCCATTTG AAGACTCAGT TCCAGCTCAG AGTGAACAAT AATCAGCTTT GCCAAATTAG TTTATCGCGT CCATATTATC ATGTAGGAGA AGACATAAAC TTCATCTTGG ACATCAACCC CGAAGAACAT AGTCTCTCTA CAAAGGTAGT GGGGTTCATT GCACATTTGG AAGCTCATGA AATTTTCCAC ACAAACCACT CCAGTGCTCC ACAGGGCAAA GAGAACTCAG AAGCATTTAC GAATACCTAC CGAGTTTCAG GCAACATCAA GTACAATACA TTTATGCCCT CCCTTGCAAA TTCCATATTG CCAGATTCAG AACAGAGGCG AACTTTGATC AATGGATCGA TAAATATACC CAGACATTTA TCCCAGCAAT TCCAGAGCTC CAGTTTCATG GATCTAAAAT ACTTCATAGT GTTCAAGTTT AATTTGAACC AGTTCTCAGA ATTGCTGGAA GAGGTTAACG AAGAAGGTAA TGGGACCGAG GCGAACGGAA CCACCGGCCA CAACCCAACT GAACTTGTAG AAGAATCAGC CGATGTCGAG GCAAGCCTAC TTACTACTTC AAGTATAGTG CCTACGGATT CTGTACGCAC CAAAATCGAC GCCTTCAGGG CGAATAACTT TGGCACAGAG TTAAGGTTCA GACTTCCCCT ATACGTACTA CCATAG
|
Protein sequence | MTSSSLRHSI DKFHEYVKPE YVKKVNESLS VSLTFIEDSQ PDQTIQESTT CECLISFHNV DERGITVNET DESSLRQTES NDSGWFGGFF GTNSNSSDSR LLVNRTSSEN EDLTIFLGYI QVVGYVVLNY KFGLNSSTSA IDSGKSAHWW ENKDYIYNYK DADGEDGQYE DEKAEKLDQV PYIKENLAHP LVVGGKLGGV SDLIVEHGKS SNHWIDEKAL YYVHDLLYPF NSRGPPNFNQ NADLDGTAEI KIPVKELSDS IIPFYSTSQS LLFTDLHIPP KSTKTFHIKF PRSTDLPPTY NARSTGPVCD QGLVSIKYSL IVSLSQSSGS SMDSRSIYFP LSIKGERIGS NERYMQRDYF ESKSKIDKNW QVEVIEEEEA IEPPVVGNVS ESLETREAFF EDISKLIKSD LYNMPKMSTN ERKKSIHSLE SWVDDVPVDG KYVPQLPSHL KTQFQLRVNN NQLCQISLSR PYYHVGEDIN FILDINPEEH SLSTKVVGFI AHLEAHEIFH TNHSSAPQGK ENSEAFTNTY RVSGNIKYNT FMPSLANSIL PDSEQRRTLI NGSINIPRHL SQQFQSSSFM DLKYFIVFKF NLNQFSELSE EVNEEESADV EASLLTTSSI VPTDSVRTKI DAFRANNFGT ELRFRLPLYV LP
|
| |