Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31039 |
Symbol | |
ID | 4838197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1590341 |
End bp | 1592263 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640389512 |
Product | predicted protein |
Protein accession | XP_001383589 |
Protein GI | 126134129 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000183375 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.324549 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAAC TCAAGAAAGG AAGAAGAAAC CAGAAAGCCA GACTCAATCC CATTGGAACC AACCCCAAGG CATCCAGCAA GGATGAGAAA AGAGACGAGA ATACGAGACA GTCCAAGATC TTGCCCTTGA TCAACAAGTT GAAGTCTACA GTAGCAAATG ATAGATCTAT GGCTTTGGGT GCTATTACTG TTTTGGCTGA AGACGATAGG ATGAGATCGT TGTTATTGAA AGAAAAGCTA GTTAGTATTG TCATGGAACA GTGTTTGAAC GATGCCAACG ACGAGATCAT AGTAGAATCA TTCGGCTTGT TGAGAAATTT GGGAATTGAG GAGGGCTATG ACGTGTTGAA ATACTACTGG AGATCCAATA TTTGGACTGC TATTGAAGCT GCCATAGCTA AAATTCAAAC TTCATACAAG TATTTGGTAG AAAATGGTGC CAATCCAGCA GGTAATAAAA ACGACAAGTC CAAAGTTCAA TTGTTGTACG ATTTCACGGA GAATATTTTG TCGCTTATCA TCGTATTGGC CAGTGGTTCA GATGATTTGT ACGAATCCAT TTTCGACAAG ATCGATCCAA TCTTGAAATT TGTCTCTGAC TTGCTAACTT CAGAAATTAG CACGACTTCT GGATTCAAGA TTTCCACTAA ACTTTTCAAC TCCTTATTAG AATTCATCTA TGAATTCTCC ACTGAGTCAG AAGATTTTAT CAGGAAGTTC AGTTTGGACA TCAACTTCGA TTTTGCCAAA TTGTTTGCAT ACGTAGAAGC CAAAGAGTCC CATTACAACA ACCTCACTAG AGTATATATC GAAGGTATCA AGTTCAACAA TTTCGAAGTA TTGAGCCAAG GAGAAAACAA GTACGAGGTC GGCACCCAAT TCTTGTCCAA CATCTTCGAA ATCATCGTAA CAACTGATTT AGAACTTCTC AAGAAAAACA TACATGCCAT TAAAAACCCA GACAATTCTA GCAAGCCTAT TCAGAAGGAT GCCAACACTT TGGAAACGGA CTTGTCGAAG GGTTACGACT TAGCATTACA AACAAAATTA GAATTGACCA CATTAGAAGT GGCCCTAGAC TTGATTACTT CTTTGTTGGA GTACTTGTCT ACCAACGATA ACGATGAACC TCTCAATTTG CCTACACCAA CAGTTGAGGT CTTATTAAAC AAGATCTATC CAAGTTTAGT CGAATTGAAC AACTTTGAAA ACGCCAATAA AGGTATGTTA TCATTGCGTG AAAAGATCTT GGTAGCATTG AACAATTTGA CGTGGCTCAT GTTGTCTAAC GAATCGCTTC CAGTTGCTTG GTTCGAAAAG TCACTTGCAT TATGGGATTT GATCATAAGT TCAGCAAATA CTTCTGACGT TGTTTTACAG AAAAACTGCC TCAACGTATT ATGGGGTATC ACTAAGTCGC TCGGGCCAGA AGTTAGAAGC AAATTGCAGC CTTCTATCGT TGACGAATTG ATTGCAAAAT GCCAAACTAT TATGGAAAAG CAAAATGAAG CCGACGAAAC TGATATAGAG TTGTACCTTT CTATTGTTGG TTTCCTTGGT AACTTGGCTC CAGTCGTAGG CAACACTCAA ATCACGTCTA AAATTTCCCA GTTCTTGATT ACTACCATCG AATTATTATG CAACTCTACA GTTACTACTG TCACCAGCCC GGCCAAGATA GAAGTTGTAT TGGAGGCCAT AAACGTCATT TACGAGATTT TCGGTGATAT CGAATTTGAC TACGACTACG AGATCTTTGT GCAACAAGGC TACTTGCAAA AATTGTCCCT TTTGGAACCA AAGGTCAAGG AATTGTACAA GAAGATCGAT AAAAATAAGT ATCCACATCT CAAAGCTAAA GGTGAAGAGA CGTGGATTAA TTTGGGCAGA TTTATCCAGT ATAAACAGAG TGAAAGAGCG TAG
|
Protein sequence | MGKLKKGRRN QKARLNPIGT NPKASSKDEK RDENTRQSKI LPLINKLKST VANDRSMALG AITVLAEDDR MRSLLLKEKL VSIVMEQCLN DANDEIIVES FGLLRNLGIE EGYDVLKYYW RSNIWTAIEA AIAKIQTSYK YLVENGANPA GNKNDKSKVQ LLYDFTENIL SLIIVLASGS DDLYESIFDK IDPILKFVSD LLTSEISTTS GFKISTKLFN SLLEFIYEFS TESEDFIRKF SLDINFDFAK LFAYVEAKES HYNNLTRVYI EGIKFNNFEV LSQGENKYEV GTQFLSNIFE IIVTTDLELL KKNIHAIKNP DNSSKPIQKD ANTLETDLSK GYDLALQTKL ELTTLEVALD LITSLLEYLS TNDNDEPLNL PTPTVEVLLN KIYPSLVELN NFENANKGML SLREKILVAL NNLTWLMLSN ESLPVAWFEK SLALWDLIIS SANTSDVVLQ KNCLNVLWGI TKSLGPEVRS KLQPSIVDEL IAKCQTIMEK QNEADETDIE LYLSIVGFLG NLAPVVGNTQ ITSKISQFLI TTIELLCNST VTTVTSPAKI EVVLEAINVI YEIFGDIEFD YDYEIFVQQG YLQKLSLLEP KVKELYKKID KNKYPHLKAK GEETWINLGR FIQYKQSERA
|
| |