Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83723 |
Symbol | |
ID | 4839230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 814833 |
End bp | 816902 |
Gene Length | 2070 bp |
Protein Length | 557 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640390545 |
Product | predicted protein |
Protein accession | XP_001384821 |
Protein GI | 150865554 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000478126 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.12365 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGACCCTATA AATTGCCATT GGAAATATTT GTCTAGTCTA ATATTTGTAT ATTTAGGTGA TTTCAACCTC TTTAAGCCGC CTTAGTACAG CCTATTTTCA CAAATAGTAG CACTGAAGAT TTATTAAACT ACGAAGTATT CACTTTCTGT TTTCACTACG ATTTTCATTC GGCAATACTT AATCCGCATA TTCCTACGAA GTTTGCCAAT TTAACCAAGC ACCTTCATTC TAATTGTGGA GACTACTTCA GTTGCTAGTC TTATCTAAGG AACGACTATA GACTACTGTG AAATTTCTAT ATATTCAACC CAGGGCAAAC CCACTAGACC AAACACGCAT AATGAAGTCA TGTCGTACAT AGTAAAGAAA ATATTCATTC CCAAATCGTA CCGTAAGGTG CTCGTCACCA TATTCCTGGC CCTTACTATG GTATTGATAT ATAGCTACCT AAGAGCATCG ACCGCGGTGT ATCCAGATGT CAAGCTATTA CAGAACAATT TCGACCAGCC ACATACCAAA CTAGGAACGG TGATCACCGA TATCAAGATC ATCAAATGCT ACTACCGGAA TTGCAAAGCG CCCGTGGGCT ATTCCAAGAT CCAGCCTCCC ATGAATTACT ACGAAACAAC TGTAGACTCT TCTGCTACCA CCAAATTGAC ATCTCCCTTG TATTATATCG TCATTAAGGA GCAGCCCGTA GACACTGCCA CCAACATCCT TTTAGATCTC ACGTTTGACA AGCCTAAGGA TGACGCTGGG TTCGAACTCA TCAAAAGCGA CAACTACCAG TTGTACAAGA AGTTTTTCAG CACGAACATC AAGAACCCCA TTCCAAGTGA TTTGCCTTTG GTGCATTCAC TTGATTTGCT CTTTGGTTCA AACGACCTCA TCGACTCGCG TGTGAACCAT TTCTCTGTTC ACTCCTCAAT GACGGAAGAA AAGATTCATC CCATCATCTC GATGTTCAAA TTGCCTCAAT CGAAACAAGA CATCTGGACG CAAGATACTT CGCAATTTGT GATGATGCAA GAGACAAGCA TTCTTCGTAT TGATGAATCC GTGACCAAGT TCAAAGTGAT TCAGATGAGT GACTTGCACT TTGGACAAAG TCTTGGTAGA AAATGCGGTA AGGATCAGGA GTTGTGTACT TCAGACTTGA AAACCTTGAA ATTCATGGAG GACAGTATAC ACAAAGAGAA CCCAGATTTG GTGGTAATCA CAGGTGACTT GATAGACGTA GACCGTAGTG TCGACTATAA GTCGATAATT CTCAAGTCGT TACAGCCTAT ACTACAGACA AATACCAAGT TCATATTCAC TTTTGGTGAT GAGTTTGACG GCCAGGAAAA CCTCAGAGAG ATCAAATTGT CCTTGATTAA GTTTTTGCAA ACGTTGCCCA ATTGCTACAA CACTATAGAA GGAATTGACG ATAGTTTGCA TGGAGTTACT AACTACAACT TGAAAGTGAT AAGAGGCGAA AAGGAAGTAG CTCATGTTAC TGTTTTTGAC TCCGAAGATA AATATCTTGA TGAAACACAG ACCAATTTCT TGTACCGTAT CCATGCCGAG GACCCTGAGA AATTGTTTAA GTTGTTATTC TTCCATTTTC CCATTCCTCA GTTCCGTCCC ACTGGAAAGT TCAAGATAAT TGGAAGTTAC AATGAGAAGC ATCCTCTCAA TTCCAAGACG AAGCCGCAGG TGCTTGATGA TATCCGCAAC TGCGGTTATC AAGTAGTCAG TGTAGGACAC GAACATGAGA ATGATGCCTG TCTCCTAAAC GAAAAATCTA GTGCTTCAGG AGAACAGTCT ATCTGGCTCT GCTACAGTAG CGTTGCTGGA GATTCTGGAG TCACTGCTCT TGATGCCAAC TACGATCGTA AGCTCAGAGT GTATGAAATC GACTTTGAGA AGAGTATATT GTTGAGTTGG AAGAGAAGTG AAATGAAAAA GAAGGGATTT GACTACCAGC TGGTCTACAA GTTCCCATCC TTACCTGAGG CACCAAAGGA ACCCAAACCA TAGACATAAG TCGTTGAATC ATATAGGCAT ATGCAATCTA ATTATTATTT
|
Protein sequence | MSYIVKKIFI PKSYRKVLVT IFSALTMVLI YSYLRASTAV YPDVKLLQNN FDQPHTKLGT VITDIKIIKC YYRNCKAPVG YSKIQPPMNY YETTVDSSAT TKLTSPLYYI VIKEQPVDTA TNILLDLTFD KPKDDAGFEL IKSDNYQLYK KFFSTNIKNP IPSDLPLVHS LDLLFGSNDL IDSRVNHFSV HSSMTEEKIH PIISMFKLPQ SKQDIWTQDT SQFVMMQETS ILRIDESVTK FKVIQMSDLH FGQSLGRKCG KDQELCTSDL KTLKFMEDSI HKENPDLVVI TGDLIDVDRS VDYKSIILKS LQPILQTNTK FIFTFGDEFD GQENLREIKL SLIKFLQTLP NCYNTIEGID DSLHGVTNYN LKVIRGEKEV AHVTVFDSED KYLDETQTNF LYRIHAEDPE KLFKLLFFHF PIPQFRPTGK FKIIGSYNEK HPLNSKTKPQ VLDDIRNCGY QVVSVGHEHE NDACLLNEKS SASGEQSIWL CYSSVAGDSG VTALDANYDR KLRVYEIDFE KSILLSWKRS EMKKKGFDYQ SVYKFPSLPE APKEPKP
|
| |