Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67940 |
Symbol | CDB4 |
ID | 4839620 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 1482492 |
End bp | 1483911 |
Gene Length | 1420 bp |
Protein Length | 383 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390935 |
Product | Curved DNA-binding protein (42 kDa protein) |
Protein accession | XP_001385287 |
Protein GI | 126137527 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0024] Methionine aminopeptidase |
TIGRFAM ID | [TIGR00495] 42K curved DNA binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCA AGACTGGTAA GTAGCGAACG GAATATTCCG AGAGAAATGG CTATTTTAGA GTGTTGTGAA ATTGGAGAGA AATAGTGAAA CTTGTCCAAT ACCTCACATA CTGAAAATTA TGTCCAACTT TTCATTACTG TTCATTGATA CTAACATTCA CAGCTCCCGC TGCTACCCCA GACTACACCA TTGCCAACTC CGACGTTGTA TCCAAATACA AGACTGCCGG AGAAATCACC AACCGTGTGT TGGCTCAAGT CATTGCTTTG CTTGTTGATG GCGCTACCAC CTACGAAGTC TCTTCCAAGG GTGATGAGTT ATTGAACGAA GAATTGTCTA AGATCTACAA CTCCAAGAAG GCTTCCAAGA CTCCAAAGGG CATTGCATTC CCTACCTGTG TGAATCCTAA CCACATCCCA GCCCACTTGG CTCCTGTGAG TGAAGATGAT GCTGGTAACA TTACCTTGAA AAACGGCGAT GTAGTTAACG TGATGCTCGG TGTCCAGCTT GATGGGTTTC CATCCATTGT AGCTCAGACT ATTGTCATTG GAGCTACTAA GGAATCTCCT GCTGAAGGTA ACAAGGCTGA CTTACTCCAC GCTGCCTGGA CTGCTTCTGA GGCTGCTATC AGAACTTTGA GACCCAAGAA CAAGAACTGG GACACCACCA ACGTTGTAGC CAAGGTCGCC AAGGAATTCG ACACTACCCC AGTCGAAAGC ATGTTGTCTC ATAACCAAGA AAGAAACGTG TTGTACGGTC CTAAAGAAAT CATCATCAAC CCTACCAAGC AGAACAAGAG CCAGATGGAA ACCTTCAAGT TTGAAGAAAA CGAAGTCTAT GGCTTGGACA TCTTAATCTC TACTTCTAAG GATGGAAAAG TCAAGCCTTC TGACTACAGA ACCTCCTTGT ACAAGTTGAC AGGTAACAAC TACTCTCTCA AGATGAAGTT GTCGCACAAG GTTTTGGCCG AATTCAAAGC TAAGTGCAAC AACCAGCCTT TCCCTTTCAA CATCAGAAAC TTGGACGAAC CTAAGAAGTC TAGAGGTGGT TTGGCTGAAC CTTCAAACCA CAAGGTCATC TTGCCATACG ATATTGTCAC CGAAAAGGAA GGCGAATATG TTGCCCAGTT CTTTACGACA GTTGCTATCA CCAAGAACGG TCTTGTCAAG TACACCCAAC CAGAGTTTGA CCCTGAGCTC TACAAGACCG AGAAGAAGGT CGAGGACGAG GAAATTGTGC AATTGTTGAC TGAGCCTTTG AGAATCAAGA AGCAGTCTAA GAAGGAAGAA GCTAAGTAGC TCTGTAGCCT TCCAAAATTG ACATGACATG ATAGTATCAA AACATCGAAT ACCAAGATAT CTCCAGTATC CAACATTCAT ATATCCAAAA TATCAGAATC CAAAATACGT ATCTAGCCAG
|
Protein sequence | MSTKTAPAAT PDYTIANSDV VSKYKTAGEI TNRVLAQVIA LLVDGATTYE VSSKGDELLN EELSKIYNSK KASKTPKGIA FPTCVNPNHI PAHLAPVSED DAGNITLKNG DVVNVMLGVQ LDGFPSIVAQ TIVIGATKES PAEGNKADLL HAAWTASEAA IRTLRPKNKN WDTTNVVAKV AKEFDTTPVE SMLSHNQERN VLYGPKEIII NPTKQNKSQM ETFKFEENEV YGLDILISTS KDGKVKPSDY RTSLYKLTGN NYSLKMKLSH KVLAEFKAKC NNQPFPFNIR NLDEPKKSRG GLAEPSNHKV ILPYDIVTEK EGEYVAQFFT TVAITKNGLV KYTQPEFDPE LYKTEKKVED EEIVQLLTEP LRIKKQSKKE EAK
|
| |