Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_81056 |
Symbol | SIP2 |
ID | 4851915 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3171846 |
End bp | 3173982 |
Gene Length | 2137 bp |
Protein Length | 623 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393623 |
Product | Sip1p-Gal83p family protein |
Protein accession | XP_001386934 |
Protein GI | 126276009 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.896245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAACA ACACTTCAGT ATCAGCGAAT ACTTCCACGC GGAAGTCTGC TGCCGCCATT CTCCACCAAC AGAATCTCGA TCTGGAGTCA CAAGTGTCGG TAGCCGGTTC ACGAAAGTCG TCTTCAGGAA GAAATGCTAC AGCTAAAGGA GACACTTCGC TAGATGAAGA TTTCAGTGAT TTGATTCTCC ACCAAGTAAA ACGCCAGGAT CTGACGAGCT CGTCGGGATA CCTTCCCACG ACTAATATCC TGGCCGGTGG GATCATTCCC GTCTCCTCTA ACATCGGAAG CACGCCTGCT GTAACGAATA CTAATGCTAA CGCTAACCTT GCTCATAACA CTAATGCTGA TCCATATTAT TCAGCTACGT ATAATCAATT TTCAAATGAC CTTATCGAAG ACGAGTTCAA TGAGTTGAAT GATGGATTAT TGGATCAGGA ACCAGAAGTC TCTCGTAATA TCTTGCCCGA TGATGCTAGT GAAATCACTA CGGAATCTAC AAACAGCGGA GCAAACGACG AAACTCAAGA CAATATGGAT GTGGATGAGG ACCATTTCAA AGCTGTAACA GACCAATCCA ATACTGATTC TGACTTGGAC ATGAATCATG CATCTGGCTT GTCGAAAGTA GACTTCACCA AAGTGACACC AGCCAACCAG CAGCCTCATG AAGTTTCTGT AAAAGTAGAT AACTCATATG TTCACCAGAG TAGAAAACGA CACAATCGTT CGGGGAATGC TAGTGCCAGC AACGTCACCA GCAACCTCAT TCCAGTGGAA ATTAAATGGG TGAACTCGTC TCGAGAGGTC ATAAACAAGA TCTCTATCAT TGGCTCGTTC ACCAACTGGA GGGACAGCAT TCCTTTGTCG CTTTCACCTT TTCATTCGAA CGAATACGTG ACCACCTTAA ACTTACCTCT TGGTGTCCAC AAGTTGTTGT ATATCATCAA TAACGAATAC CGAGTCAGTG ACCAGTTGCC TACTGCAACA GATCTGGAGG GAATTTTCTT CAACTGGTTC GAAGTCATAG ACGAAGCCCA TCTCTTCAAT CATTCATTAA ATCAACCAAA TCATATCGGT GCTTCTACAG ACTACGATGC CAACATAATC TCTCCGCCAT ACTACGACTA CAAGACAACG TCGTCTTTCT CAGTAAACCA CCAGGTACAG CCTCAAACTG CTGGCAAGTT TGAAGTGGAC CAAATCAACA GAAAATCTAA CAGCTTCTTG GCCAAGATCT CAAAAGAGAA CTCGTCCAAC TTCGAACATG TAGAATACGC GGAAGACAAA AACGACGACA TGAAGGACAT ACGGATGCAC GAAGAAGAAC AAAAGAGTTC CGAGAACTAT CCATATGGAA GTAACAAAGG CTCAGTGCCA GGCTCAACTC AATATGTGCC ATATAGCATG TCGTCTTCTA CATCTTTGCG AACGCCCACA ACTGAGAATG TACCAAAATT AGAATATTCC AGTGACATTC CAGAAATGTT TCAAAACTAT GACTACTTCA AGAATAAGAG TCCAAACTAC GAACTTCCTG AACCACCACA GCTCCCAGCA CACTTAAACA ACGTGTTATT AAACAAAATG TCGCAGACAT CGTCTCAAAG CTCCCAGAGC CACATTAGCA ATTCACAGAC AGCTCACAGC TCTTCTTACG GTTCCACGCA TCATCAGAGC CTCAAACCTC CCAATGCTGC ATTTGTTTCA GAATCTAGCC CAACTAACCA GTCTCATAAC AAAAGACCCA CCTTAAGAAG AGCCGACAGC TCATACTACG CTTCAAACAA AGAATCCTAC CACCAGTCAA TTCCCAACCA CGTGATCTTG AACCATTTGA TGACGACCTC CATTAGAAAC GACGTCTTAA CAGTTGCTTG TATAACAAGG TACTCCGGTA AGTTCGTTAC CCAAATCATG CATTCTCCAG CAGATAAATG AGATGAATGT TATGAATGCA AATGCTGCGA ATGCTAAGGG AAAGTAGAAT AGTGGAATTA CTGGCTAGTG TTTTTGTATG TATGCTTATT ATTGCTTTAT TTAACTTGTA TTTTTCTCTC GTTCTCGTTG ACTAGCTTAC GCGGTTAATT CATTCAATTC ATAAATAAAA CATGAAATAA ATAACAATAC AAACTAG
|
Protein sequence | MGNNTSVSAN TSTRKSAAAI LHQQNLDLES QVSVAGSRKS SSGRNATAKG DTSLDEDFSD LILHQVKRQD LTSSTPAVTN TNANANLAHN TNADPYYSAT YNQFSNDLIE DEFNELNDGL LDQEPEVSRN ILPDDASEIT TESTNSGAND ETQDNMDVDE DHFKAVTDQS NTDSDLDMNH ASGLSKVDFT KVTPANQQPH EVSVKVDNSY VHQSRKRHNR SGNASASNVT SNLIPVEIKW VNSSREVINK ISIIGSFTNW RDSIPLSLSP FHSNEYVTTL NLPLGVHKLL YIINNEYRVS DQLPTATDLE GIFFNWFEVI DEAHLFNHSL NQPNHIGAST DYDANIISPP YYDYKTTSSF SVNHQVQPQT AGKFEVDQIN RKSNSFLAKI SKENSSNFEH VEYAEDKNDD MKDIRMHEEE QKSSENYPYG SNKGSVPGST QYVPYSMSSS TSLRTPTTEN VPKLEYSSDI PEMFQNYDYF KNKSPNYELP EPPQLPAHLN NVLLNKMSQT SSQSSQSHIS NSQTAHSSSY GSTHHQSLKP PNAAFVSESS PTNQSHNKRP TLRRADSSYY ASNKESYHQS IPNHVILNHL MTTSIRNDVL TVACITRYSG KFVTQIMHSP ADK
|
| |