Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_54094 |
Symbol | |
ID | 4837650 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1235443 |
End bp | 1237866 |
Gene Length | 2424 bp |
Protein Length | 769 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388965 |
Product | predicted protein |
Protein accession | XP_001382454 |
Protein GI | 150863842 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCTACGTCTG GTAGTTCGTC TAAACCGAAA ATGAAAAAGA GAAAGTATTC CCGTGGTGGC TGTCGTGAGT GTAAAAGGCG GAAGATGAAG TGTGACGAAG GAAAGCCACA TTGCCATAAC TGTTCCCGGC TACTGAAGGT CTGCGTCTAT GAACAGAAAC AGAAGTTCCG ATTCGAAAAC ATCCAGACCG GTGGAACTAA TGACGGCGAT GAAAATCACA CTCTGAACCA AGAGCAAGAT AATGTCCACA ATATTCAACA TCTTCAGAGT GTTCAAAATC ACAGTCACAA AAACCAGAAC CAAAAGCAGC TCTCTCACGA ACAATTAATA CTGCCAAACG ATCAGAATAT AAGCGTACGA GGAATGATTC CAACTAATGG GAATGCGATT ACTATGCGTT TCTACAAGCC TGAGGATCGC GAGTCCGGGA ACTTCTCATC TTCGAACACT GAAATAGATA CTCATCCAAC CAACAATTCG AACCAGCACC TTTTGCAGAC GGACACTCCT CGAGGATCCC AGCCGAGGAC TCCCGGATAC AATTCCAATA CCTCGTCGCC CTACAACAAA AGAGTACCGT TCAAGAACAC TATATCAGAT ATACTTACTC CTACGGAAGA CGGAGTCACT ACGCCAGGCA TAAATAATGA CCAATTTAAT GTAGAAGTTA TCCAGACGCT TTTTGATGAT GCGCTGGTGT TGGTGAACGA TATAAATGGC TTTGTAGCAC TCGATTTTGC TGACTCAGTT ACAGGAGCTC TTGGAAATCT GGACATGCAT TACAAGACAG AATCCATTGC TGAAAACGAT TCCAACAAGT CAGAAAGTTA TGAAAACCAC CACTTCATCA TGGACGAATT TCTGAACAAA TTGAACGTTA ATACAAGCCC ATACGAAGAA AATGGAACTG ACCTGCTGCA GTCTTTCTTC ATTAAAGATG ACATAGACTT GTCGATATCG AATAGTGAAC TCGTTCACAA AACATTACAG CACTACGACT TGTCTGGACC TCATATCACC TACTTAAATT CACTTACCAA AACCGACTTG TCGTACCATA TGTTCCCGTT TGCATCATCT GTAGAATCCA ACGAAGTAAT TAAGATTCTC TTGAGGTATC TGAACGGTTG CCCCTACTTG TTAACTTCGT TGCTTGCCAT ATCTGCGACT TTCCAGTTCA ACCAGACTGG GAAACAGGTG CACGACTACA GTAGGAACAA ATATATAAAA GTGTGCTTGA AGACTCTAGG CGAGGCTTTT GCCAACAGCG ATAGACTCCC CAACAAACTA GCCAACAACA TTGAGCGATT GCTTCTCACT GTATTAGTAC TCACTTCGAA CTTTACGGCT ACAACTTACA GTCAGAATGG TGATTTGCTC AACTCTTGGA AAACACATTT GCGAGGAGCC AAGGACTTGA TGCTCAAATA TAGCCAGATA GAGAAGAATC AGAAAATCGT GTGTAATTCC GCAGGATTGG CTCTAGCTAA GTCGTGGTTC TTTGCTATCG AGTCAATGGC AGGTTTGAAT TCATCTTTGG GAGGGACATT GACTCAGAAA TCAGGTGAGC TTCCGGAAGA CTCTGATGCC AAAGATGACT GTAACAATAG CTTGTTTTCT GCCACGGGAT ACTACAACAG AGAAAGAAAT CCAAAATACC ACGATGCTCT TGTTGAGATC GGTTTGTTGA CCGAGTCCCA GAATAGATCC CTGACTCCAT TCAACCTTTT CATCGGATAT TCGGTGCAAG TGATAGCATT AATACAAGAA TTTATTAAGG CCCTAGATTT CTTGAGGGAC AACCATGGTG GGCAATTGGA TTCTTCAAAA ATAGCCAAGA TTATGTCATT GATCCATGAA ACAAGAAAGA ATGAAATCGC CCCTCAGGTT TCCAAAAAAA CTTATATCAT TCCTCCAACT AGTCCAGCAC ACCCCAACTA CCCAAAGAAC AGAGCTGATG CTGTCGTATT ACCTCTGTCC GCTTATGACC GACGCATAAG TTCGTCAGGA GAAGAAACGG TGTATTCTTG GTTTGATCTC AGTGAACAAT TACATGTAGA CGTCTTGTAT CTACGATTGT TATGTACCAA AGGTTTGCTC AAGCTTCCAA GATGTCACCG CCTCGTCAAA GATCTTGTTT CCAAGATATT GAAATCAACA TTCTTCATCT CGCTGAAGGA CTCAGAGGAG TATATAAAAG ATTCGCAATC CAACGACAAT TTGGTTGAAA CAGAACATTT TTATTTATCA AAGAAAGCAT TTGACACTCG GACCATCATG ATTCAGTCTC CTCTCAAACT TTGTTCAAGG TTGGTAGACG ATGAATTGGA CTTTGAGAAA ATCGAGATTT TCTTCCTTGG ATTAATCAAA CTAGGAAATG GAAGTGCGTT GACTTCACTT GACTTACTCT ACAAGTTTAG AGAA
|
Protein sequence | STSGSSSKPK MKKRKYSRGG CRECKRRKMK CDEGKPHCHN CSRLSKVCVY EQKQKFRFEN IQTGGTNDGD ENHTSNQEQD NVHNIQHLQS VQNHSHKNQN QKQLSHEQLI SPNDQNISVR GMIPTNGNAI TMRSQPRTPG YNSNTSSPYN KRVPFKNTIS DILTPTEDGV TTPGINNDQF NVEVIQTLFD DASVLVNDIN GFVALDFADS VTGALGNSDM HYKTESIAEN DSNKSESYEN HHFIMDEFSN KLNVNTSPYE ENGTDSSQSF FIKDDIDLSI SNSELVHKTL QHYDLSGPHI TYLNSLTKTD LSYHMFPFAS SVESNEVIKI LLRYSNGCPY LLTSLLAISA TFQFNQTGKQ VHDYSRNKYI KVCLKTLGEA FANSDRLPNK LANNIERLLL TVLVLTSNFT ATTYSQNGDL LNSWKTHLRG AKDLMLKYSQ IEKNQKIVCN SAGLALAKSW FFAIESMAGL NSSLGGTLTQ KSGELPEDSD AKDDCNNSLF SATGYYNRER NPKYHDALVE IGLLTESQNR SSTPFNLFIG YSVQVIALIQ EFIKALDFLR DNHGGQLDSS KIAKIMSLIH ETRKNEIAPQ VSKKTYIIPP TSPAHPNYPK NRADAVVLPS SAYDRRISSS GEETVYSWFD LSEQLHVDVL YLRLLCTKGL LKLPRCHRLV KDLVSKILKS TFFISSKDSE EYIKDSQSND NLVETEHFYL SKKAFDTRTI MIQSPLKLCS RLVDDELDFE KIEIFFLGLI KLGNGSALTS LDLLYKFRE
|
| |