Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_55251 |
Symbol | |
ID | 4837434 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2142773 |
End bp | 2143669 |
Gene Length | 897 bp |
Protein Length | 270 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388749 |
Product | predicted protein |
Protein accession | XP_001382636 |
Protein GI | 150863975 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3663] G:T/U mismatch-specific DNA glycosylase |
TIGRFAM ID | [TIGR00584] mismatch-specific thymine-DNA glycosylate (mug) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0254148 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.277007 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGACA AGTTGAAGCT GCTTCAAGCT TTCAGATTCA AAGATAAGGA TGCCGAGAAA GTAGTCGCGA ATTCAGAGAA AATACAGGTT GACAAATTGG TTACCAGACC AGTTCCTCAT AAAGTCACTA AACCTACTAC GAAGAAAAGA GTCAAACCAA CTCTGAATAC ATACCCTGAA CAGTACAAAG ATGTCCGTCC CTCTCTAGTA GAGAATTTGA CTCTTCTATT TATAGGATTT AATCCCGGAA TGGAATCCTC ACTTCAACAA CATCATTATG CCCATTTCTC GAACTTGTTC TGGAAACTCT TCAACCAGTC GGGACTACTA TTGCAATGCT TGGGAGTACT AGACCCACAG TACTTACTGC ATAACTACGA TAATGATGAA TTACTACAGG TCCTCGTTAA AGATGGGACG ACGTACGCCA AACCAGAACA CGACTACGAG CTTATAAAGT ACAAGATCGG GTTTACAGAT TTGATATTGA GATGTACACG AACGGCACAG GAACTTCCAA TGGCCGAGAA GCTTGCCAAC GTCCCCAGAT TGATAGACGA GTTCAATATG TCTAGCAGCA AACACATTGT GTTTATTGGC AAAGGAATAT GGGAGGTTAT AGTGAAATAT GTGGAAGTAG AACTAGGAAT CAAGAAGGTA AAGCTTTCGA AAGAGACTTT TATGTGGGGA CTCCAAGACT CTCAGAGTGT ATCAGCTCTG AGAAGTGATA AAGACTCCAG GTTATATGCT CTAGTGCTAA AGAAGTTCCA GCTGAAGATA TCCGCTGATT CTAAGGTCTT TGTTTTTCCA AACACGTCTG GGTTGGTAGG TTCGTTGAAG TACGAGGAGA AATTGAAGTT GTGGCAGGAT TTGGCAGATG TCATCTCAAA AACTTGA
|
Protein sequence | MSDKLKSLQA FRFKDKDAEK VVANSEKIQK RVKPTSNTYP EQYKDVRPSL VENLTLLFIG FNPGMESSLQ QHHYAHFSNL FWKLFNQSGL LLQCLGVLDP QYLSHNYDND ELLQVLVKDG TTYAKPEHDY ELIKYKIGFT DLILRCTRTA QELPMAEKLA NVPRLIDEFN MSSSKHIVFI GKGIWEVIVK YVEVELGIKK VKLSKETFMW GLQDSQNSRL YALVLKKFQS KISADSKVFV FPNTSGLVGS LKYEEKLKLW QDLADVISKT
|
| |