Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33431 |
Symbol | HMC4 |
ID | 4840448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 635382 |
End bp | 636530 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640391763 |
Product | hypothetical multicopy protein |
Protein accession | XP_001386131 |
Protein GI | 150866503 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.613875 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAACT TATCTACAAA TGTTTTGTCG GCTGGTAGAT CGGCACCGCC AGAAAGTCAC GGCTCCAAAA CTCTTACTGT AGATAATATC GCAGAATTGA TTAAGATACA ATTAGAGGAT TATGAAGCTA AGTTTCTAAA GCTTTATGCT CAACAGCAAT CTCAAGTTAA TGAACTTATA CTGATTATAT TAACCAAGAA GAGCGATTCT GAGAAAGATA CCGGTGATTC CGCCATTACG AGTACTGATA GTGATATTAA TCTTATTACG AGTATTGAGT CCAGTTCTGT GCTTGCTGGT AAAGACATTA CTGAAAACAC CAGTGTTCCA CCTAAAATTG AAACCTTTCC ATCGCTTAGA ACAAGTCAAA CCACTGCTCT AGATGGATTC GATTTTTTTG AGCGAACTCA AAGGGTGATT AGAAAATCTC AGAAACATAT TCATGAATAT TGGAAAAAAT TACCCCCGCT CAATGAGACC AGCGCTGAAC TCTGGTCCAG GGCTATTCAG GATTTGAACA ACGAGATGGA TTATAGAGCC TTATCTAAAG CCAATTTCAA AGTCGACTGG AACACTTTCC AATCCAAAAC CGGACTCCGT GGTGATAAAT TAGAATATTT TTACGAGTGC TGGAAGGATG CTCTTATTGG ACGTTATCGC AATAACACTT TGCGTATCCT TGCTGTCAAT CGAGACCATA TTATTACTCT TGAAGATCTA CTTGAGTACA CATCGCAAAA TGCAGACTAC GACAAAACCA ATTCTATACT TGAGGAAGTG CAAAGACGCC GCCGAATCAA TCCAATGTGT CAAGACTATA TCTCAGAATT TAGAGGTACA AACATTCATG ATTATGACCG TATAATTCAA TTTCTTAACG GCCATCCCGC CGATCTCTAT TGTGCCATTA GTCACTTCTG TAACCAAAAA CATGAAGGCA ATCGTACTAT TGCTGCCGCT ACGGTCAACT TCTATTATCA GGATTTTATG ACCAAGGACA ATTTTCAATA CCCTTCAGTC AACGCTTTTG AAAAGAAAAT GAAAAGTACA CTTGGTTACT CTTGTAAATT TTATTCTGAT TCATCTAAAT TAAGAACGAA TTCAAAACAC AGAAGGGGTA AGAGTAATAG TAATCATTAT TCACAATAA
|
Protein sequence | MSNLSTNVLS AGRSAPPESH GSKTLTVDNI AELIKIQLED YEAKFLKLYA QQQSQVNELI SIILTKKSDS EKDTGDSAIT STDSDINLIT SIESSSVLAG KDITENTSVP PKIETFPSLR TSQTTALDGF DFFERTQRVI RKSQKHIHEY WKKLPPLNET SAELWSRAIQ DLNNEMDYRA LSKANFKVDW NTFQSKTGLR GDKLEYFYEC WKDALIGRYR NNTLRILAVN RDHIITLEDL LEYTSQNADY DKTNSILEEV QRRRRINPMC QDYISEFRGT NIHDYDRIIQ FLNGHPADLY CAISHFCNQK HEGNRTIAAA TVNFYYQDFM TKDNFQYPSV NAFEKKMKST LGYSCKFYSD SSKLRTNSKH RRGKSNSNHY SQ
|
| |