Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47392 |
Symbol | MAT1 |
ID | 7202438 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 514658 |
End bp | 515788 |
Gene Length | 1131 bp |
Protein Length | 294 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181741 |
Protein GI | 219122830 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAGAGTGTA GCGTCTTGGT GAATAAGCAT TCTTGCTAGT CATGGAAGAG GATCCTCAAG ACGATCTTTT TCGATGCGCC TTGTGCGGGA CCGCGGAAGG TGATTCATCC AATCTATCAT CTCATACATC GCTACAAACG AATGCCACGG TCCGGTGTGG ACATCAATTG TAAGTAGAGT TACTTTGGCT GTTCTTGGAC AATAGTGTCT ATACCATCAG ACAACGTCAT CATTCCCATT GTGATGTATT GCCTGGCCAG CCGAAAACAA CAGCCGCAGG GGATTTCAGC ATATACGGAA AAGACTTGCT CCACGAAAAT ATCCGACACA CTTCTAATCT CGTTGGGCTC TGCTTCCACT GTCTATTCAC ACAGCTGTAA CTCCTGCATC GATCGAGAAC TCGTCCGCAA GCGTGAATTT CCCTGCCCTG TATGTCAAAC CCCCGTCAAA CGTGTGACGC TAACCGTCCG GAGTCTAGAC GATGTCCAGT GCGAAAAGGA CACATCTTGG CGTCGACGGG TACTGAAAGT CTTCAACAAA ACTGAGCCAG ACTTCTCTTC TCTGCTGGAA TTTAACAATT ATCTGGAACA AGTTGAAGAC ATGATCTACT CTATTGTCAA CGAAGAGCCA GATGCTGAAG CTTGTAAGGC GAAGATTAAG GAGTATGAAA ACGCTCACAA GACAGAAATC GTCATTCGGC AATCCCAACG CGCCGATGAG GAACGTTCCA TCCAGGATCG GATCGCCGCT GAGCAAAGAA GCACCGAGCG ACTCCGGCGT GAAGCGTTTG ATGAGGAAAA GGCTGTCGCT AACGCTAAGA AGCGTCTAAA GCAGGAAAGT ACGCAGGTGT TGCTAGGAGA ACGTGAAGAA GTATCGGCGG AGCTGCGACA AGCCCAGATG CAAGGGTACC GCAATGAGTT AAAGAGGCAG TCGAGAGGTA AAAAAAGCAG CGACTTTGTC TCGCCACGCG TTCGGGAACC AGCCGATGGT TGGAAAAAGG AAACACTGGA TCGGCAGCTG TATTTAAAAC GGCAAGCAGC GGGTGGGGGA ATACCGACGG GAAGTATTGC ATCACTGGAA CGCAACTGGA ACGAAACGGT ACAATCACTT TTTGCCAGAA TGAAAGCCTA A
|
Protein sequence | MEEDPQDDLF RCALCGTAEG DSSNLSSHTS LQTNATVRCG HQFCNSCIDR ELVRKREFPC PVCQTPVKRV TLTVRSLDDV QCEKDTSWRR RVLKVFNKTE PDFSSLLEFN NYLEQVEDMI YSIVNEEPDA EACKAKIKEY ENAHKTEIVI RQSQRADEER SIQDRIAAEQ RSTERLRREA FDEEKAVANA KKRLKQESTQ VLLGEREEVS AELRQAQMQG YRNELKRQSR GKKSSDFVSP RVREPADGWK KETLDRQLYL KRQAAGGGIP TGSIASLERN WNETVQSLFA RMKA
|
| |