Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39282 |
Symbol | |
ID | 7195025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 114948 |
End bp | 116426 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183423 |
Protein GI | 219126351 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.218149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCAATA CTTTTAAGGA ATGGCATACC AGTGCATTTT TAAGGTTACG TTTGCTAGTT GGACTATTAT TTCTGCTTTC GGTCGGCGCC AATTCAAGCC CAACGTGGGA TGAACTCGAT GCGAGGACGA ACCCTAGGTG GTATGACGAG GCGAAGTTTG GTATTTTTAT ACATTGGGGT CTCTTCAGTG TTCCAGGATA TCTGTCTCCA TGGTATCAAT CCTACTGGCA GGGACACTGG GAAGGGCCCT ACGGGAGCTG GAAGCAATAC GACTCCTTTG TAAACGAGAC GGAGCGATCG AATTTTGCCT ACCAGGATTA CGCCCATCGT TTCTTGGCAG AGCTCTACCG ACCGGACTAC TGGGCAGACG CCTTTGCTAA ATCTGGGGCG CAATATGTTG TACTTACAAG TAAGCATCAC GAAGGATACT GTATGTGGAA CTCGACCAAT ATTGCTACGA CTTGGAACTG GAACGTCATG GACACAGGAC CACGTAGGGA CCTCTTGGGA GATTTGTCGA AGGAAGTTAA GAAAGCGGTC AGTCCCTATA CCAATCGTAC TTTGAAGTTT GGTATCTATC ATTCCTTGCT GGAATGGTTC AATCCCCTTT ACCATCATGA TATGAAGAAC AACTGGACTA CTCAAACGAT GGTGGACTTA AAAGTTCTGC CAGAACTGTA CGATCTTGTG AAGCGCTACC AACCCGACTT ATTATGGTCC GATGGAGCTT GGGAAGCAGA TAGTACATAC TGGAAAGCAA CGGAATTTTT ATCCTGGTAC GGATACAACA GCTCGGTAGC GGAGACAGCT GTCTGGAACG ATCGTTGGGG TACTGACGCC ACGTGTGCAC ATGGAAGCTA CCTTACTTGC AATGATCGCT ACCAACCTGA TAGTTTTCAA GAAAGGAAAT GGGAAGATGC TACGACTATG GATACAAGCT CTTGGGGATA CAATCGCAAT GCTACCGCGC AGGATTTTAT GAGCGTCAAG GAGCTCATCG ATCAATTGAT TAGGGTGGTG GCTTACGGTG GAAATCTGTT ATTGAATGTT GCGCCTGCAG GCGACGGTAC GATCCACCCG TTGTACATTG ACCGACTGAT GGGTATAGGA GAGTGGCTGG GCGTGAACGG TAGAAGTGTC TATGGCACAA GACCCTGGAG CGTTTGCCAA ACCGAATCAG ACTCGAACGT ATTCTATACG AGAAGCAACG AAAGTTTGTT TGCACATATC ACCCAATGGC CAGCAGAGAG TGTCCTTGTT CTAAAGTGTC CTCATCCCAG CTCTCGAACT AAATTTCGAA TGTTGGGATT GACAGAGGCC TCTCTAGAAT GGAAGAGACA AATGCATACT GAGACTTCAG GATTCGATGT AATAAGGCGA TTTCCAGACA TTGCCATTGC ACTTCCCAAA CTAACTCCAG ATATTTTGCC ATGCCATCAC GCATGGGTCC TGGAAATCAG TGAAGTAGAA AACCTGTAG
|
Protein sequence | MCNTFKEWHT SAFLRLRLLV GLLFLLSVGA NSSPTWDELD ARTNPRWYDE AKFGIFIHWG LFSVPGYLSP WYQSYWQGHW EGPYGSWKQY DSFVNETERS NFAYQDYAHR FLAELYRPDY WADAFAKSGA QYVVLTSKHH EGYCMWNSTN IATTWNWNVM DTGPRRDLLG DLSKEVKKAV SPYTNRTLKF GIYHSLLEWF NPLYHHDMKN NWTTQTMVDL KVLPELYDLV KRYQPDLLWS DGAWEADSTY WKATEFLSWY GYNSSVAETA VWNDRWGTDA TCAHGSYLTC NDRYQPDSFQ ERKWEDATTM DTSSWGYNRN ATAQDFMSVK ELIDQLIRVV AYGGNLLLNV APAGDGTIHP LYIDRLMGIG EWLGVNGRSV YGTRPWSVCQ TESDSNVFYT RSNESLFAHI TQWPAESVLV LKCPHPSSRT KFRMLGLTEA SLEWKRQMHT ETSGFDVIRR FPDIAIALPK LTPDILPCHH AWVLEISEVE NL
|
| |