Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29646 |
Symbol | |
ID | 4837173 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 766101 |
End bp | 767994 |
Gene Length | 1894 bp |
Protein Length | 618 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388488 |
Product | predicted protein |
Protein accession | XP_001382917 |
Protein GI | 150864192 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0379511 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTAC CACAGCCTCC GTTCAAAGTC AAAACCAAGG TGTCTTGGGC TGGAGAAGAG GATGGTGACT TGGGGTTTCT TGAAAACGAG ATTATTGAAG TGTACTCCAT CGTCGATGAA TCATGGTGGT CGGGAAAGCT CAGACGAAAC AGAGCTGAAG GTATCTTCCC CAAGGACTAC GTTGAGGTGA TTCCAGAGTT GAATAAATCA GTATCCAGTC ACTCCATTGG TACCCCCAAA GCACAGACTC CCGTCCAGAC TCCCATCCAG ACTCCCGTCA AAGAAGCTGA CTATGGTAAG TTCAAACATA GCAGAGCCAG CGGAACTCCT ACGGGAATGT ATAACTCTTA CAACGGTTCC AAGTCAGTAT CTCCCAAGAG ATACCACCAT GTCAATGTTG ATGCTGACGG CTCATTTGAA GTCGAAGATG CTGTCAATTG CACCTATGAT GGCATCTACG ATTCTTCTAA TGGCATGTCG CTGTCAAACC GAAGCTCACC CAAGAAAATC AGATCTGCCA ACAAGCTCGT TTCATCTACC CAACATCCTC GGTACCGAAA CAGCCATAAT GGGTACCAGC AGCAGCAACA CGAGGATTTT GACAGAGAAC GCGAAATGGA AAACTTTCGT ACCCTCCAAC AGCAACAGAA CTACCACATC AAGAAACTGC CTCTGCACAA TATGAAATCG CTGGTTCAAA CCCAACAGCA AGAATTCAGA CAATCCTCAT ATTCACCGGA AAATAGTAAC TCCATCTATG GAAAACATCA AGGAAAGTCC AATTCAAGAT CAGGCTCGCA ACAGAACTTG TCTCAATCTG CTGTCCAGGG ACATCCTCCG CAAGTAAAGT CGCAATCGTA TGTTGATATA CCCTACAGAA ATCAACAGAA TAACACCGTA GAAAGTTTTG TTTCCGAGGG ATCTCCCTCA CGTCAAGCTA GGAGGGCCAG ATACGACGCT GATGCCGTGA ATGAATACGA GATCATCTCC CAGAAAAGGG CCCAATTGGA GTTGGAGTTA CTGCAGTTGA AGCAGTTGGA AAGGTCTACC CAGAAGAAAA GAGTCCACAA CCCCCATCTT CAACACAAGC TGGGCCAAGT CAGTCGTGGC AGCAATGATG ATTCCTTAGT GAACTCATTA GAATCTAGCT ATATTAGCGA AGACCTCTTA TCTTCCAAGA AGAACTACTC TTCCAGAGAG GATCTTGGAA AGAAGCTTGC TAAGTTCGAA AGTGTTGACG ACGAAGATGA TGACTATTTC AACGATAACG AGTCGTCTCC ACCACCTCCT CCACCAAAAC ATACTACACC TATCAAGGCT ATTGAAGCTA TCAGAGACAA CGATAGTCCA CTTAGAAAAA GTGGAAACAA AGTTCCTTTT GATGCTGATG ACTTCAGGTT TTCTGGTTCA AATCGCCTCC ATCAGGGTCA AGTATCCGAA GAAGATGTCT ACAGAGTGTC CCAGCTTCAG CAGGATGACT TGAAGAACTC GATCAAGTCG TTGCAAAGTG ATGTCTTAAA TTTATCTGAA CTCAGTGCCA CAAGTGCTGG TTCTTTTATG AGACACAAGT ACGAGAGAGA CATCCAACAA CAAGAAATGA AGTTACACAG CTTATCCATC AATGAAGAAG AGGAAAAAAG ATCACCTAAG CAAAACGGGA AAGATGTTAT GGACTCCGTT TTCCAGGAAA AGAAATCGAG GCATCCTAAT ATTTTCAAGA TGTTGTTGCT GAAGAAGAGC GACAACGAGA TCAATGTTAT TGAAAGAAAG CTCCAAAAGG ACGAAGAAAT TGACTGGACC ACTTTCAAAA TGGATCTTAA TCGTATGAAC TCGTTGACTT CTCATGACAA ACAAGCCAGA ACCAGAAGAG TGGTCAGAGA AGAGGGCTCG TTGA
|
Protein sequence | MSVPQPPFKV KTKVSWAGEE DGDLGFLENE IIEVYSIVDE SWWSGKLRRN RAEGIFPKDY VEVIPELNKS VSSHSIGTPK AQTPVQTPIQ TPVKEADYGK FKHSRASGTP TGMYNSYNGS KSVSPKRYHH VNVDADGSFE VEDAVNCTYD GIYDSSNGMS SSNRSSPKKI RSANKLVSST QHPRYRNSHN GYQQQQHEDF DREREMENFR TLQQQQNYHI KKSPSHNMKS SVQTQQQEFR QSSYSPENSN SIYGKHQGKS NSRSGSQQNL SQSAVQGHPP QVKSQSYVDI PYRNQQNNTV ESFVSEGSPS RQARRARYDA DAVNEYEIIS QKRAQLELEL SQLKQLERST QKKRVHNPHL QHKSGQVSRG SNDDSLVNSL ESSYISEDLL SSKKNYSSRE DLGKKLAKFE SVDDEDDDYF NDNESSPPPP PPKHTTPIKA IEAIRDNDSP LRKSGNKVPF DADDFRFSGS NRLHQGQVSE EDVYRVSQLQ QDDLKNSIKS LQSDVLNLSE LSATSAGSFM RHKYERDIQQ QEMKLHSLSI NEEEEKRSPK QNGKDVMDSV FQEKKSRHPN IFKMLLSKKS DNEINVIERK LQKDEEIDWT TFKMDLNQPE EWSEKRAR
|
| |