Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80808 |
Symbol | SUR1 |
ID | 4850902 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 378770 |
End bp | 381634 |
Gene Length | 2865 bp |
Protein Length | 802 aa |
Translation table | |
GC content | 40% |
IMG OID | 640392610 |
Product | putative zinc finger protein |
Protein accession | XP_001387717 |
Protein GI | 126273862 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.10969 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AATTAAGAGA GAAGGATACA CCGAGTGCTT AGAAGAGTTA GAGAGTAGTA ATCAAATCTA CGCCACTTCC GAAGTCATAT AGTAAGTGCA CGTATACTTC CATCACAACT TCGGCCAATT CTAAAGATAA CGGTTATAAA AAAGTCCGAA CTGCAATCAG GAAAAGTTTG ATTTGGTTCG GGTAACGATA CGCATTCACA GAGGTTAGTT GAACTTCCTA TTCTCAGGTC AAACTTTTTT CCGTGTAATT TTCATTCATT ATAAAATAGT TTTTTTATAT TAAATTCATA TCGTTCTTCT TATTGCTGAT CAACGTTGAA TAAAGTGGTA CATTGGAACG AAATTTCTAA ATTGAATATT AATTACCGCC GTCTGCCATG AGTTCCATTT CTGTTTATAT TTCGACAGAC TCTGGAGGTT CTGGTTCTAC GGCTTCAAAT AATTCTGGTA CGTCTTTCGG GGCTTCTTCA GTTGATTCTA GTGTTGGTCA TGATCTGGTT CCGAGCAAAA ACGGAGCTTC TGACGCTACA GAGCTTCAGT CAGCCTCGGG TTCCAGTTCT GTTTCAGAAA GGTCGGACTC TAAGTTCACT TCCTTGACTT CGTCGTCAAA TGGCTCGGTG TCGCCGTTGG TGGAATCTGC CAATGTAAGT TTGCCAAAGT CATTCTACCA GAAGCTCACC AAAAGTGAAC AGCTCAACAG CTCAATAATT ACCCATGGTT ATGTACATGG ACATATTCAC AAACACGGAG ACCATACACA TATCCATGGC CATATCCATA ATCATGACCA CGACCACCAT AACAGCGTGA CCAAGCAACC ATCTACGACT ACGGTAGCCG ACGAATCTTG TCAGGAATTT GACGATCTCG ACTTGTGCAC TGACATTTTC TGCGATGAGT TGGACGACTG CTTCTTCTTG AACTGTGACG ACTCAAAAAA CTCCTCTTGT CAGAACCACG GATGTTGTGA TAACTCAGGA GACTACTTGG AAGAAATATG CTGTAACGAC ATCCACTGCA TTGAAGATAG CAGTTCTGAT ATACCTGCTG ATAAAAATAA TACTGACTTC TGCTGTGGCA ACTCAAATTG TCCCAGCTAC TCTGTCTGTC ATGGTGGCAC TGTCTCGTCG ACAAGCACTA CGGAAGCTAT TTGCAACGAT CCTCAGTGTG TTTCAGATTC CAACACGTCT ATAGTCTATG ATTGTTGCGA TACGACAACT CCAGGATATC CGAGTAACCA TCAGAATAAT CTATGCGATC TACAATTATC TAAGAAGCCC ATGTTTGCAA ACTTAATCAA TGGCGTCCAT CAAAATTTGG ACCAAATAGC AACGGAATCA GACGAAATAC AATTGCAAGC TGCAAAGAAA AGAAAGCTAG CCGATAAAAA CTTTGAAATC CATTTCCCAC ATCATTGCCA CCATTCCGAT AATCCAGAGC AATCAACAGG TCCAGTTTCT GATGAAGCTC CGGGCCATCA TCATTTCCAC CAATCCTGTT TTCACACAAC AATTCAAAAT GATTCCACTG CAGTAGCCTC TGATTCAGAA CCTGCTAACA AGTTGATGTC TGATTTTGAC TTCTACATTC AGTTCAATAA TTTCAATCAA TTTTTGAACA ACTCGCAACC ACTGCAAAAT AATAATCGAA CATTCCAGAA ACAAGAACCC TCTATTGAAT ATTTGCATAA TTTTCAAGGA GCGCCATTCG AGAGCAGCTC TGAACTGTAC TCTTGTAAAT GGGACAAGTG TTTCGAAAAA TTGACTGACG ATACATTTTT GAAGCATTTA ATCGGACAGC ACATTGGCCA AGAATATGGC ATGCCCACCA ACAATAATTC GAACAATAAT TCGAACAATA CAGTCAACCC AGCACAGACA TTATATCAAT GTGAATGGAA CGACTGTGAT TACATAAACT CCGATCTAAA TTCATTGATA GACCATATTG TCACACACAA GGGAGACAAG ATCCATGAAA TATATTCGCA AGATTCATTG GTTCCTCTCA AGAGTCAAAA TCATCTTTTG ACCCCAAGTT CCTCGAAATC TGGCTTATCC TCTCCTACAT TAATGACATC TCCACCAATA TTACCAAATA GTGTGGTAGA AAAGAAAGAA CCATCTTCTG AGCCTCGAAA ATTCGACGAC CTCAATATTA CGTCTATAAA AATTATGCCG AAAAAGAATA GTAGCAGCAG TATTCATCCA GAAGATCCCA ACTTCACTTG CAAGTGGCAA ATTGGTGTTG ATAGTGACAG AAATCCAATT CCCTGCAACA AAATTCATGC CAATGCGGGA GAATTACAAC AGCATTTGGT GGATGAACAT ATTGGCTCAG GAAAATCCAT CTACAGCTGT GACTGGATTG GCTGCGAAAG GCATAATGGG AAAATGTTCA CCCAGAGACA AAAGGTCTGG AGGCATATCC ATGTTCATAC CAATTTCAAG CCATGTAAAT GTGAAATATG CGGAGCTACT TTTGCTGTTG ATTCGATGTT GAAACAACAT ATGAGAACGC ACTCCGGTGA AAAACCATTC AGCTGTTCCA TATGTGGAAA GAAATTCGCA ACCAGTTCAT CGTTGTCTAT TCACAACCGA GTTCACACAG GAGAAAAGCC GTTGGCGTGC AAATGGCCTG GCTGCAATAA GCGGTTCAGT GAAAGTTCAA ACTTGACAAA GCACATGAAA ATACACTTTA AGACCTTCAA GTGTGAGATT TGTGACGAAG AATTCGAGAA GAAACCCGAT TTCACTAAGC ATATGAAGTC GCATAAGGTT GAAGGTGAGG ATTTCGAAGA TATGGAATTA AAATCGCAAT TGTCAAATAG CTGAAATCTA CAAGGAAATG TACATCGCTA AACCT
|
Protein sequence | MSSISVYIST DSGGSGSTAS NNSGTSFGAS SVDSSVGHDL VPSKNGASDA TELQSASGSS SVSERSDSKF TSLTSSSNGS VSPLVESANV SLPKSFYQKL TKSEQLNSSI ITHGYVHGHI HKHGDHTHIH GHIHNHDHDH HNSVTKQPST TTVADESCQE FDDLDLCTDI FCDELDDCFF LNCDDSKNSS CQNHGCCDNS GDYLEEICCN DIHCIEDSSS DIPADKNNTD FCCGNSNCPS YSVCHGGTVS STSTTEAICN DPQCVSDSNT SIVYDCCDTT TPGYPSNHQN NLCDLQLSKK PMFANLINGV HQNLDQIATE SDEIQLQAAK KRKLADKNFE IHFPHHCHHS DNPEQSTAPG HHHFHQSCFH TTIQNDSTAV ASDSEPANKL MSDFDFYIQF NNFNQFLNNS QPLQNNNRTF QKQEPSIEYL HNFQGAPFES SSELYSCKWD KCFEKLTDDT FLKHLIGQHI GQEYGMPTNN NSNNNSNNTV NPAQTLYQCE WNDCDYINSD LNSLIDHIVT HKGDKIHEIY SQDSLVPLKS QNHLLTPSSS KSGLSSPTLM TSPPILPNSP RKFDDLNITS IKIMPKKNSS SSIHPEDPNF TCKWQIGVDS DRNPIPCNKI HANAGELQQH LVDEHIGSGK SIYSCDWIGC ERHNGKMFTQ RQKVWRHIHV HTNFKPCKCE ICGATFAVDS MLKQHMRTHS GEKPFSCSIC GKKFATSSSL SIHNRVHTGE KPLACKWPGC NKRFSESSNL TKHMKIHFKT FKCEICDEEF EKKPDFTKHM KSHKVEGEDF EDMELKSQLS NS
|
| |