Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83452 |
Symbol | |
ID | 4838642 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 226615 |
End bp | 227853 |
Gene Length | 1239 bp |
Protein Length | 403 aa |
Translation table | 12 |
GC content | 48% |
IMG OID | 640389957 |
Product | predicted protein |
Protein accession | XP_001384004 |
Protein GI | 126134960 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCCAG GCTCCGGTCA CAACACCTAC GGAGGATACC CTCCTCCACA AGGTCCTCCT CCTAACAATA ATGGCTACAA CTCTGGCCCC AATAATAGCT ACAGGCAACA GGGCTATTCT CGTCCACAGG GACCTCCTCC AGGTCAGTAT GATCAACAAT CCCAGTACTC TCAGCAATCT CAGTACTCTC AACAGCCTCA ATACTCTAGA CCATCTGCTC CTCCTCAAGG AGGCACTGGC TATGGCGACC AGAGTCAATG GGGACGTCCT ACAGGGCCCC CTCCATCTGG ATCTCAGTCC TTCGGTCAGA ATTCTGGCTA CACGTTCCAA TATTCGAACT GTAGTGGGCG TAAAAAGGCG CTTTTGGTAG GAGTAAACTA CTTTGGCTCA CCAAACGAAT TGCGGGGCTG CATCAACGAC GTCAAGAACA TGAGTTCGTT TCTTGTTGAC CATTGGGGCT ACCAGTGGAA CGACATTGTC ATTTTGACAG ATGACCAGAA CGATATATCT CGAGTTCCAA CCAAGAACAA CATCATCAGG GCGATGCAAT GGCTTGTTAA GGATGCACGT CCTAATGACT CGTTGGTATT CCACTATTCT GGTCACGGGG GTACAACAGC GGACACGGAT GGAGACGAAG AATCTGGTTA CGATGACGTT ATCTACCCTG TTGATTTCCA GCAAGCTGGT CATATAGTGG ATGATGACAT GCATGCAATT ATGGTAAGAC CTCTTCCTCC TGGTTGTCGT TTGACGGCTT TGTACGACTC TTGCCATTCT GGAACCGCTC TCGACTTACC CTATGTGTAC TCCACTAAGG GAGTAGTCAA GGAACCTAAT TTGTTGAAGG ATGCAGGCTC GGATGCACTT AATGCATTCA TTAGTTATGA GCGAGGCAAC ATTGGAGGTG CCATTTCGTC GCTTACTGGA TTGGTTAAGA AAGTAGCCCG CCAAGGCTCT ACCAACCAGG ACCAGGTAAG ACAAGCTAAG TTCTCTGCAG CTGATGTGAT CTCGATTTCT GGGTGTAAGG ATGACCAGAC TTCTGCTGAT GCAAAGGAAA ACGGCCGAGC CACCGGTGCT ATGTCGTGGT CGTTCATCAA AGTGTTGAAC GAGCTCCCCA ACCAGTCGTA CTTGTCTCTT TTGAACAATA TGAGAACGAT CTTGGCGGCC AAGTACTCGC AAAAGCCGCA ATTGAGTTGT TCTCATCCTC AGGATATGAA CATTCAATTC ATCATGTAA
|
Protein sequence | MFPGSGHNTY GGYPPPQGPP PNNNGYNSGP NNSYRQQGYS RPQGPPPGQY DQQSQYSQQS QPSAPPQGGT GYGDQSQWGR PTGPPPSGSQ SFGQNSGYTF QYSNCSGRKK ALLVGVNYFG SPNELRGCIN DVKNMSSFLV DHWGYQWNDI VILTDDQNDI SRVPTKNNII RAMQWLVKDA RPNDSLVFHY SGHGGTTADT DGDEESGYDD VIYPVDFQQA GHIVDDDMHA IMVRPLPPGC RLTALYDSCH SGTALDLPYV YSTKGVVKEP NLLKDAGSDA LNAFISYERG NIGGAISSLT GLVKKVARQG STNQDQVRQA KFSAADVISI SGCKDDQTSA DAKENGRATG AMSWSFIKVL NELPNQSYLS LLNNMRTILA AKYSQKPQLS CSHPQDMNIQ FIM
|
| |