Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32361 |
Symbol | |
ID | 4839541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1313079 |
End bp | 1314776 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640390856 |
Product | predicted protein |
Protein accession | XP_001384943 |
Protein GI | 150865635 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5207] Isopeptidase T |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.136395 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACAT CCAGAAAAAC CGTTCCTGGA GGAAAGTCCG TGCCGAATGC CGCTGGAACC TCGAATGGAA GTGTTTCTTC CAACGGCCAT GGTAAAAATG GTACTTCTGA ATTAGACTAT GAAAGGGAAT CCCAGCATAG GTTGCGACCA CAAGACAACT ACTCCACCAT CGCAGCCTGC TCCCACCTCA AGTCCGTGTT AGAATCATCA GCCCGAGAGA CGGCCCTCAT CACATACAGA CAAGCTGTCA ACATCTCTCG GCCAATCGAT AATGACCTTA TCTACACCGC GAAGAAGGAT GGCTCTGTAG TGTCGCACCA TCGTTTGTTA GTACGAAAAT CCTCATCCTT GCGGTGCACA GACTGTTCGC TCAACAACTT CCACCACAAT TTTACCTGTT TGCAGTGCCC GCATGTGGGC TGTTTCAATG ACGTCCACAA CCATGCTTAC ACCCACTATA AGCTCACCCA ACATGTCTTC GCTATCGACA GCCACTCGGG CTTGCTTTAC TGTTTTCCGT GCGGAACCTA TGTCAACCAT CCTGCTCTAG ATAAAGTCAG ACAGGAGGTG TTGTTGAGTG CTACAGACTA CAGCGATCTA ATTAAAAGCG AGGTAGATGA AGAATTTGAT TATAGTGATG TAGATGCTCA CTACTCAGAC CCCAGCCGCT TGGGCGTAGA CGGGTTGAAG GGCTTTGTCA ACTTGGGTGC CACTTGTTTC ATGAGTTCCA TCCTCCAGAC CTTCATCCAC AACCCCATCA TCAAAAACCA TTTTTTTAAC AACGACTTGC ATTACTTCAA CTGCGAAAAG AGCATGGCAC AGGGTTCGAC TCTCGACGAA AATAACGCAT GCATAACATG TAGCATCGAT AACATATTTC AGCTCTTCTA CACCTCTAAC AGCATTGAAG GCTTTGGAAT GACGAACCTC TTGACCACAG CGTGGTACAA GAAAAAGTCG TTGGCCGGAT TCCAAGAACA AGATGCCCAC GAGTTCTGGC AGTTTATCTT GAACGAGTTC CACTCAGACT ACGAAAGGAT CAGATCCAAC ACTGGTTTAA GTCCAATGTC AACTTCAGAC TGCAATTGCA TTACACACTC TACATTCTCA GGAGAACTAC AAAGCTCTAT AAGATGCCTC TCGTGCGAAT CTGTGACCAA GACTATCGAC CCGATGGTAG ACTTGTCGCT CGAAATCAAT CACTTGAAGC TGAACCATCC TGGAAGCCAG ATAGATTTGT ACGACTGCCT CGACCTCTTC ACCAGCGATG AGAAGTTAGA TGTTATGTAC ACCTGTCAAT CGTGTGGTGA CAAGACCAAG GCTATCAAGT CGTTGAGTGT CAAGTCGCTT CCGCCTGTTC TATCCATCCA GTTGAAGCGA TTCAAGCATA ATTCGTTGAA CGACACTTCG TCCAAAATCG AAACTCCTAT AAAGATTCCT CTCTATTTAA ACATGACTAG GTATTCTATA GGTCATGATC CCCACGATTC AGAGCAAATT GATGAAGACA AAATCTTCGA GCTCTTCGCC TTGGTGTGCC ACATCGGCTC GGTGAATACG GGCCACTACA TAGTACTCAC CAAAGATGGC AATGGCCAGT GGTTCAAATT CGATGACAGC GTTGTCTCGA TGGTTTCGCA AGAGGAGGTA ACCAATACAA ACGCATACTT GGTGTTCTAC ATCACCCACA AGATCTAG
|
Protein sequence | MSTSRKTVPG GKSVPNAAGT SNGSVSSNGH GKNGTSELDY ERESQHRLRP QDNYSTIAAC SHLKSVLESS ARETALITYR QAVNISRPID NDLIYTAKKD GSVVSHHRLL VRKSSSLRCT DCSLNNFHHN FTCLQCPHVG CFNDVHNHAY THYKLTQHVF AIDSHSGLLY CFPCGTYVNH PALDKVRQEV LLSATDYSDL IKSEVDEEFD YSDVDAHYSD PSRLGVDGLK GFVNLGATCF MSSILQTFIH NPIIKNHFFN NDLHYFNCEK SMAQGSTLDE NNACITCSID NIFQLFYTSN SIEGFGMTNL LTTAWYKKKS LAGFQEQDAH EFWQFILNEF HSDYERIRSN TGLSPMSTSD CNCITHSTFS GELQSSIRCL SCESVTKTID PMVDLSLEIN HLKSNHPGSQ IDLYDCLDLF TSDEKLDVMY TCQSCGDKTK AIKSLSVKSL PPVLSIQLKR FKHNSLNDTS SKIETPIKIP LYLNMTRYSI GHDPHDSEQI DEDKIFELFA LVCHIGSVNT GHYIVLTKDG NGQWFKFDDS VVSMVSQEEV TNTNAYLVFY ITHKI
|
| |