Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33214 |
Symbol | |
ID | 4840516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 10475 |
End bp | 11830 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391831 |
Product | predicted protein |
Protein accession | XP_001386023 |
Protein GI | 150866426 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGGAT TACAGAACAT TATTGACAAA ACCCTTGCCG AACAGTCTGG GCATTCTTTT AGAAGGTTTG TCGAAAGTTA TGCCACAGAT AGAAATGGTT ACAATAACGT AATGCTACAA CGAATGAGAG ACTGTCTGGC AGCCTGGCAT ACGTGTTTGG AGATGAGGAC TCTTCAGATA TTCTCGCCTA GTATATGGAT CGAGGACGAA CTACCGAAAT TTACGGGCCA GGAACTTATA GATGCCGGCA GACAGATGTT TGGGCCACAC TTTAAGTTCA AGAGGGACCA GCAACAGGCT ACTTCAGATG TGGCCAATTG CTTCAGTACG ACGGTGGCTA TTAACCTGGG AAAGACGACT TGTTGCCTCA TTGCGATGTT GGCGGAGCTT AATTTCAGTC GTCGCCAGTC CGATCGACAC AGAAAGATTT ACTTGGTTAC GATATTGGTG GTACCTTACG TTTCTACACT CAATTCCACC ATCCGGCAAT TAACAGACTT GGGGTTCAAG GTGCGTACGT TTCATGGGGC TGATGGACTG GTCACCGACT ACGATGTTCT TTTGATGCTG CTTGAAGACG GCCACCTTAG ATATCTTAAT CAGGTAGTCA GACATTTTGA GTCCGTTGCT CATCAAGGCA GGACCTATCT CCGGCGGGTG GTGGTTGACG ATGCCCATAT TTTGCAGATG AAAGATTTTG TGGTTGATGG AAACAGAAAT GTACCCGTCG TATTTTTGAC ATCATTCCTT CCCCAGGTTT CAGACAATTC GATGCGTACA TTATTTAGTC TCAATCGGTT GGGAAGGGTG TCGTCACAGG ATCCAATATT GCCTCACAAA GAATTCACTC TTGTCAAGAA GCCGAATGCC GCGTCAATAA ATCAGACGAT TTCATACAGA GTTAGGAGTT TGGGTTCGGC GCTTGTACTC GTTCAATCTT ATGAAAATGT GTTGTACTTG GAACATCGGT TGGCTGAGGA GGGGGTTCCA TGTATAGGTG TCTGTGGAAT TCCACAGCGC AGAACAGCTG CCTTAGAAAA GATGGCCAGT GAGAATATCA AGGTCGTGGT TACTACGGCA GAAGCCATGG TGGGTCTACA TCAGTTCGAG GCTACAACCG TGTTGTTTGC ATATTCGATT AGAAACCCTA TAGAATTGCT TGTAGCCAGT CAATTTAGCA GTGGGAAGGT GGAGATGGTT TTGTACAACA CCTGTTCGTC TGCGTTTCAA CGAGAATTGT CAAACAATTG TGTCAACTTA ATACTAATGA GACACATGCG TATGACGGAA GAAACCTGTT ATGATGCTGG CAAGGAGCAG TGTCATATTT GTGCCAACAA GAGAAGAAGA GGATAG
|
Protein sequence | MVGLQNIIDK TLAEQSGHSF RRFVESYATD RNGYNNVMLQ RMRDCSAAWH TCLEMRTLQI FSPSIWIEDE LPKFTGQELI DAGRQMFGPH FKFKRDQQQA TSDVANCFST TVAINSGKTT CCLIAMLAEL NFSRRQSDRH RKIYLVTILV VPYVSTLNST IRQLTDLGFK VRTFHGADGS VTDYDVLLMS LEDGHLRYLN QVVRHFESVA HQGRTYLRRV VVDDAHILQM KDFVVDGNRN VPVVFLTSFL PQVSDNSMRT LFSLNRLGRV SSQDPILPHK EFTLVKKPNA ASINQTISYR VRSLGSALVL VQSYENVLYL EHRLAEEGVP CIGVCGIPQR RTAALEKMAS ENIKVVVTTA EAMVGLHQFE ATTVLFAYSI RNPIELLVAS QFSSGKVEMV LYNTCSSAFQ RELSNNCVNL ILMRHMRMTE ETCYDAGKEQ CHICANKRRR G
|
| |