Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67308 |
Symbol | PIB2 |
ID | 4837730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 850510 |
End bp | 853475 |
Gene Length | 2966 bp |
Protein Length | 836 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389045 |
Product | Zn finger protein |
Protein accession | XP_001383445 |
Protein GI | 150864573 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.106701 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGCAGCAGTC ACACAGAGTC GTCCCTCTGT TTCAGAAACT TTGTGGCTTG AGAGATTAGC TCATCGTCTA CTGACTCACC CGAACATCTA TAAGTTCAAT CGCTAGAATA CATATACTGG TTGTTTCACC AGGCATCTCC ATCAGTTAGC AATTAAACCA TAAGAAATTA TACAATAAAG TTTCTACAAT TCAAACCATC AACATATCCA ACATCTAAGT CCTTCAACAT TCAATATTTA CTTCTTCAAT ATAAAGTTTC TGGGTAATTG CTTCATATTT TACATCTATA CTCATTATAC AACATCTGAC TCTTCCTATC ATGTCCACCC CTCCGAGCAA ACAGCCGAAG TTCACTCTAG CGTCTCCCGA GAAAGACTCT AGCGACGACG AGCCAAGAGA AAACATAGCT ACTGTATTAG ACGAAACCAA TACAAATCTT TCCAATCTTT CCAACCCAAA TGCCAATACG AATCCGGATC CTACTAATAA CAATGCTATC AACGGCATCC CTAATGTTAA TTCTAATACA AATGGCAATA TCACTGCTAC CAACAACTCT CCTATCAACA TTATCACTGA CAACAATAAC GTAAACTCAG ACTCGGTAGA GTCTCCAGCT GACGACGAAA ACGGAAACGA AAACAAGCTC GACGGTTCTG ATTTGTCGTT CACCAAGAAA GATCCCCGGA AGTCACGTAC TCAGTCATTT CAGTCAGTGC TATCCACAGC TTCGTTAAAG TCATTGAAAC AACAGGCCCT AGCAAGTGTT CCCACAAATA ACAACGTCAA TATTGGCACT CGCAGCAGTA GTATTTCTCA CAACAATCTC AATAATCTCC ATAGCAACAG CAGTAATTCT AATCCCAATA TCAATGCTAC TGCTAGTCGC AACTTTCAGT CGTTCATCCA GGCTCCAGTT TTGTCTAGCA TCACGAATCT CAAGATGGAA GACGATATCG AGATCGGCCA GCAGTTGCCG TTTACTGAGT CAAAAGTGAC TTCTCAGCGC AGTCGTGATC GTGCTACGTC CTCCACAACT ACAGCTGCAA CTACAAATGC TGTTTCTTTA ACAAATACTA ATGCCACTTC TACTGTTCAG TCAAAATCTG CACTGGATCA TGATGAAGAA TATGAAGACA CGATTGTGCA ACAACAGAAG CTCACCTTGA ACGCTTTGAA GAAGCTCTCG TTGTCTCCTA TGCCTATAAA TACGGCTGAC AATAGTCTTA CAAGAAGACC ACTCAACAGA GCCAGTTCTA AGCCCAAGAT TGACATAGCT GAATCCAGCA AGGAATCAAT AAACTCTTCA ACATCTGCTC CAGGAAGCAA GACACGAACT CAAGAGCCTT ACCAGCCAGC AGAAGTAGAC TTGTCATCTT TTGCTAGTCT TACAAGACAA CCCAAAGTAG CGACGGAGAA GTTGCCTTCT CCAGGTGTTG CTTCCGCGTC GACTTCGGCA GTGACTGGGT TTGGTGAACG GGATTCTTCA ATCTTCAACC CTAATACTAG TACGACTTCA AATCCTGATT CGATCCCAAC TGGTACACCC AGCTTGTCTC AAAAGAGGTC ATTGCCTAGT TTACCAGAAG GAGAGCAGTT TGTAGAAAAT GTAGCAACTG AGTCAAATGC TGCTGTACGC AGCAATTCTG GTGATGTCAA TCAAACGTAC CAGCATCATC CACAAGCTCA CCAAGGTTCT CAGCAACAAT ACCAACATTC AAAGCAAGAC AGCCACCATA TGCAGCTTGG TAATAGGATC CCTTCTGCTG TAGTACCTCC CCAGAACATG AACACTAGAC GAGTTCCTCT GGGAGGATTC CAAGCGCCCA ATGTATACAA TTCTCATATG GCATCACAAA CACAACAGAA TCCACATCTT CATTACCCCA AGGCTAACAG GCAGTTGCAG CAGATTAAGG GATTCCGAAG TCCAATGTAT GTTCCAGCCG TGTTGCGGAT GTCTACTTTG AGTACTGTAT CACCTACTAA TTCAAATACG TCTGGATCTA ATCCCAACTC GCCTAACGAG TTGTCTACTT CTCCAAAGAA TGGTACACAC TACGAACACG ACCATTTGAA CTTGAACGAA CCTGCTGCTA CACCTAGATC GTCCAGTAGA GCCTCTGTAA AGTCGTTTGA TTCTGGAATC TCTGTAGAAT CGTCGAGCTC CACAACTAAC CAGCCAGGTT CGTCGCCTTT CCTCAGCTTG TTGGGCAAGA ATGGAAATCC CGAGTCATAC ATTTTCCGCG CTCCGCCTAC CAGAAAGCAT TGGGTTAAAG ATGAGGCCGT GTTAAAGTGT GGTATGCCCT TCTGTTCCAA AGTGTTCAAC TTCTTTGAAA GAAGACACCA TTGTCGGAAG TGTGGAGGGA TTTACTGTAA GGAGCATACG TCGCATTATC TCTATATCAA TCACTTGGCA CAGTTTACCA CGGGAGGCAG AGGTACTTTG TCTAAGGTGT GCGATTTGTG TATTGAAGAG TATAATGACT TCATACAGCA CGAGTTTGGA GTCAACATTG CACATTCGTC GTCGGAGAAC TCGTCTTACC ATTCTGCTGA ACACATTGCT CGTACTGCTA ATAGTAATAC TATTAGTAAT ACTGTTGCTA AGGGTAGTAG TGTTGGTGTT GGTGTTGGTT CAGCAGTAGA CACAGTTGGA CCCAGAAAGG ATACTAGATC GTCACAGACA CCCAATCCAC AGTATTTGCG GAATGGTATT AACCCAGGCA AGTCTCATTT GATTGGAGTC AATGCCAATG ACGAAACGAA CCAAAGAAGC GAACAAGCTG TAGGCAGTGT GCCTGCCAAC TGGAGTTGGA GTTCGTTCTG AGCTTAACTA CATCATTTAT TTAATTCATA CTGCAGAAGG ATCTGCCACG GAGAGCTTGA TATGACTCCA ATAATATGCA TTTTGATACT GAATAGAGGC GTTGTCATTT TATACTTGGT TTTATGAAGT TTTTCT
|
Protein sequence | MSTPPSKQPK FTLASPEKDS SDDEPRENIA TVLDETNTNL SNLSNPNANT NPDPTNNNAI NGIPNVNSNT NGNITATNNS PINIITDNNN VNSDSVESPA DDENGNENKL DGSDLSFTKK DPRKSRTQSF QSVLSTASLK SLKQQALASV PTNNNVNIGT RSSSISHNNL NNLHSNSSNS NPNINATASR NFQSFIQAPV LSSITNLKME DDIEIGQQLP FTESKVTSQR SRDRATSSTT TAATTNAVSL TNTNATSTVQ SKSASDHDEE YEDTIVQQQK LTLNALKKLS LSPMPINTAD NSLTRRPLNR ASSKPKIDIA ESSKESINSS TSAPGSKTRT QEPYQPAEVD LSSFASLTRQ PKVATEKLPS PGVASASTSA VTGFGERDSS IFNPNTSTTS NPDSIPTGTP SLSQKRSLPS LPEGEQFVEN VATESNAAVR SNSGDVNQTY QHHPQAHQGS QQQYQHSKQD SHHMQLGNRI PSAVVPPQNM NTRRVPSGGF QAPNVYNSHM ASQTQQNPHL HYPKANRQLQ QIKGFRSPMY VPAVLRMSTL STVSPTNSNT SGSNPNSPNE LSTSPKNGTH YEHDHLNLNE PAATPRSSSR ASVKSFDSGI SVESSSSTTN QPGSSPFLSL LGKNGNPESY IFRAPPTRKH WVKDEAVLKC GMPFCSKVFN FFERRHHCRK CGGIYCKEHT SHYLYINHLA QFTTGGRGTL SKVCDLCIEE YNDFIQHEFG VNIAHSSSEN SSYHSAEHIA RTANSNTISN TVAKGSSVGV GVGSAVDTVG PRKDTRSSQT PNPQYLRNGI NPGKSHLIGV NANDETNQRS EQAVGSVPAN WSWSSF
|
| |